MailPlus Intelligence is the intelligence layer for email, not the canonical mail warehouse.
- Canonical raw store: the operator's MailPlus deployment
- Structured recall layer: metadata + thread index
- Semantic layer: selected high-value summaries, entities, obligations, and events
- Durable memory surfaces: wiki,
memory/,MEMORY.md, and approved reminders/tasks
The system should reference raw mail from MailPlus rather than duplicating raw mail into long-term memory surfaces.
See privacy, secrets, and redaction boundaries for the raw-mail, metadata, cache, semantic-output, and durable-memory rules.
The canonical-store boundary is documented in
docs/architecture/canonical-store-boundary.md.
That document is the implementation guardrail for keeping MailPlus as the raw
archive while this repo stores only structured metadata and reviewed derived
intelligence.
MailPlus remains the source of truth for:
- raw message storage
- attachments
- threading truth
- retention/compliance/history
The index layer should cover all mail metadata needed for fast retrieval and thread reconstruction, including:
message-idthread-idfrom/to/ccsubjectdate- mailbox / folder / labels
- attachment flags
- normalized thread relationships
- MailPlus locator information when available
This layer exists for:
- fast filtering
- thread reconstruction
- dedupe
- source discovery such as “find all mail from X about Y”
Only selected, high-value mail should be promoted into derived intelligence such as:
- thread summaries
- people/company relationship summaries
- commitments / obligations
- decisions made in email
- travel / billing / legal / account events
- project correspondence summaries
Those distilled outputs can then feed:
- wiki pages
memory/MEMORY.md- future entity/concept pages
- reminders/tasks when explicitly approved
v0.1 is a fixture-mode release. It supports metadata fixture sync, SQLite
schema bootstrap, deterministic thread reconstruction, lane classification,
noise suppression, deterministic semantic extraction, LLM extraction when the
optional llm extra and API key are configured, promotion queue review,
dry-run exporters, scheduler locks, CLI inspection, and mpi doctor.
- Live MailPlus/IMAP ingestion is not connected yet. The public adapter shape is documented in the live MailPlus adapter guide and tracked for Phase 2 in issue #75.
- Production export to wiki,
memory/, or reminders is dry-run only. The future live export contract is documented in the live export guide and tracked for Phase 2 in issue #75.
Avoid dumping:
- all raw email bodies
- newsletters, marketing, and promotions
- bulk notifications
- automated logs
- repetitive receipts unless categorized
- attachment binaries
That would poison recall quality and blur the canonical/archive boundary.
This repo is targeting the medium architecture first:
- full metadata/thread index
- incremental sync
- selective semantic extraction
- wiki/entity promotion
- raw email remains only in MailPlus
This gives high value without memory bloat or premature overbuilding.
Start with the fixture-mode quickstart to seed a local database, search fixture metadata, review extraction candidates, and run a dry-run export without live MailPlus credentials.
M0 uses a Python 3.12 package with SQLite-friendly local foundations.
Python is the initial runtime because indexing, classification, and extraction work will be data-heavy and benefits from the Python ecosystem. SQLite is the initial storage foundation for local metadata/thread indexing before any live MailPlus, DSM, or NAS access is introduced.
Local setup:
python3.12 -m venv .venv
source .venv/bin/activate
python -m pip install -e .Run the fast unit-test baseline:
bash scripts/ci/run-fast-checks.shPhase 1 should support practical operator questions like:
- What’s my history with this person?
- Did I already commit to this?
- What admin, travel, financial, or legal follow-ups are pending?
The v0.1 public-release blockers are tracked with the
release-v0.1 label.
Phase 2 live integration work is tracked in
issue #75.
Primary epics include:
- architecture boundary and canonical-store model
- metadata/thread indexing
- incremental sync/export
- search and on-demand raw fetch
- classification lanes
- semantic extraction
- memory/wiki promotion
- phase-1 medium-architecture delivery
Release history is tracked in CHANGELOG.md, and versioning policy is documented in docs/versioning.md.
MailPlus stores the mail. MailPlus Intelligence stores reviewed derived intelligence.