LexFlow is a human-readable, AI-interpretable format for encoding meaning itself.
It combines the clarity of a dictionary, the structure of a programming language, and the precision of an embedding model into one unified file type.
Every LexFlow file (.copr) describes how ideas, words, or code structures connect to one another through both text and vectors.
LexFlow is part of the FlowOS ecosystem, designed by Don Sauber.
At its simplest, a LexFlow file is a list of concepts.
Each line holds:
code : word : alt_text : sign_code : speech_hint = vector_list
Example:
B1: run : to move quickly on foot : ASL_RUN : rhymes with sun = 0.12, -0.04, 0.33, 0.21, -0.10, 0.02, 0.18, -0.07
-
Everything before
=is for humans and accessibility. -
Everything after
=is for AIs and computation.
Today, language and data live in two incompatible worlds.
Text is readable but inefficient; vectors are powerful but opaque.
LexFlow fuses them into a single, transparent layer that both people and machines can understand.
Problems it addresses:
| Problem | LexFlow Solution |
|---|---|
| Redundant text storage | Single semantic entry replaces dozens of word forms |
| Unaligned embeddings | Vectors stored beside their meanings—no lookup errors |
| Accessibility gaps | Built-in fields for alt text, sign codes, and speech hints |
| Punctuation duplication | Unicode handled natively; no extra tokens required |
| File bloat | Variable-length codes keep data compact |
| Fragmented parsing | One universal Copper-based structure for all languages |
-
Readability First
Anyone can open and edit a LexFlow file in plain text.
No special IDE, no hidden binary data. -
Machine-Parsable by Design
Each line follows one strict pattern—parsers stay tiny and robust. -
Accessibility as Core Infrastructure
Alternative text, sign-language references, and pronunciation cues are not add-ons—they’re part of the data model. -
Universal Scope
LexFlow doesn’t just describe human language.
It can represent programming syntax, scientific notation, or abstract symbols—any system where meaning has structure. -
Inline Vectors
Embedding vectors live beside definitions, eliminating alignment errors and allowing immediate AI ingestion. -
Context-Scoped Headers
Each file defines its language, version, and vector schema once at the top; all entries inherit that context.
-
Create open, shared semantic dictionaries for AI training.
-
Compress multilingual corpora by using concept IDs instead of full words.
-
Build explainable embeddings where every coordinate has a readable anchor.
-
Enable screen readers, sign-language interfaces, and text-to-speech to share one common vocabulary layer.
-
Support high-contrast color coding, haptic feedback, and audio cues for each entry.
-
Treat programming keywords, libraries, or templates as LexFlow units.
-
Allow AI agents to “read” code and documentation in the same format.
-
Use vectors to find semantically similar functions or APIs.
-
Map meaning to color or motion, creating semantic heat maps of text and code.
-
Identify patterns of tone, rhythm, or conceptual density visually.
-
Simplify reading or translation for language learners.
-
Build bilingual LexFlow sets with one-to-one concept mapping.
-
Use LexFlow as an intermediate layer for natural-language programming.
Cross-AI Semantic Alignment
Different AI models can share LexFlow bundles to synchronize meaning, making collaboration and transfer learning trivial.
Knowledge Portability
Vectors and definitions can be version-controlled like code—meaning becomes reproducible.
Semantic Compression
Documents transmitted as LexFlow references become drastically smaller while retaining clarity.
Transparency and Trust
Because every number lives beside its definition, LexFlow eliminates “black box” embeddings.
Universal Accessibility
Every LexFlow entry can be expressed through sight, sound, text, or touch—bridging human and machine communication seamlessly.
/spec/LexFlow_Spec.copr # Core specification
/examples/LexFlow_English_Sample.copr
/tools/parser.py # Minimal parser / validator
/LICENSE # CC BY-SA 4.0 + MIT for code
`from pathlib import Path`
`import numpy as np`
`for line in Path("examples/LexFlow_English_Sample.copr").read_text().splitlines():`
`if "=" not in line or line.strip().startswith("#"):`
`continue`
`left, vec = line.split("=", 1)`
`fields = [f.strip() for f in left.split(":", 4)]`
`code, word, alt, sign, speech = (fields + [None]*5)[:5]`
`vector = np.fromstring(vec, sep=",")`
`print(code, word, vector[:3])`
LexFlow is designed for open collaboration.
You can contribute by:
-
Proposing new spec sections in
/spec/. -
Adding language or code-domain dictionaries in
/examples/. -
Providing accessibility enhancements.
-
Sharing averaged or consensus vector sets.
-
Building visualization or editing tools.
All contributions remain open and attributed under the repository license.
Text and Data: CC BY-SA 4.0
Code Examples: MIT License
© 2025 Don Sauber
LexFlow is the beginning of a universal semantic layer —
a bridge where words, code, and meaning share the same language.