Skip to content

docs: cutting-edge search + memory research report for Flatbread + Proof#185

Merged
tonyketcham merged 4 commits into
mainfrom
toeknee/search-research-dag-719a
May 11, 2026
Merged

docs: cutting-edge search + memory research report for Flatbread + Proof#185
tonyketcham merged 4 commits into
mainfrom
toeknee/search-research-dag-719a

Conversation

@tonyketcham
Copy link
Copy Markdown
Collaborator

@tonyketcham tonyketcham commented May 10, 2026

Summary of changes

Strategic research report on performant retrieval / agent memory for Flatbread + Proof, plus clone-safe citations and honest CLI wording.

Layout (updated)

Review fixes (prior commits)


  • pnpm exec prettier --write on touched markdown/json
  • DAG JSON parses

Does this introduce any non-backwards compatible changes?

  • Yes
  • No

Does this include any user config changes?

  • Yes
  • No
Open in Web Open in Cursor 

cursoragent and others added 2 commits May 10, 2026 07:36
… Proof

Synthesizes a /proof DAG run that executed in three ranks:

- Rank 1 (parallel research streams):
  - Audit of Flatbread's current retrieval surface (filter DSL, refs, transformers).
  - Audit of @flatbread/proof's parent->child context-passing pipeline.
  - SOTA dossiers on dense/sparse/hybrid retrieval, structural/graph retrieval,
    agent-memory architectures, and embeddable runtimes (sqlite-vec, Orama, etc.).
- Rank 2 (parallel synthesis):
  - Proof-track opportunity catalog (O1..O11).
  - Flatbread-track opportunity catalog (F1..F12).
  - Novel/industry-shifting bets (N1..N12).
- Rank 3:
  - Composes the final 13-section weighing report at repo root.

Recommends Posture B (Effort-Graph + Hybrid Retrieval), shows three phased
trajectories (V0 foundation / V1 minimum credible / V2 industry-shifting), and
proposes new packages (@flatbread/index, @flatbread/embed, @flatbread/mcp,
@flatbread/memory) plus an MCP surface that lets Proof use Flatbread as its
retrieval backend over the host repo.

Co-authored-by: Tony <tonyketcham@users.noreply.github.com>
Co-authored-by: Tony <tonyketcham@users.noreply.github.com>
Copy link
Copy Markdown

@cursor cursor Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Found two review-worthy issues in flatbread-search-and-memory-research.md.

  1. The report cites /tmp/proof-search/* artifacts and file:///tmp/... links that are not checked into the repo, so readers cannot inspect the evidence chain from GitHub or a fresh clone.
  2. It presents flatbread index verify / flatbread index build as runnable CLI commands, but the current flatbread CLI only exposes start, init, and codegen.

Because this PR is docs-only, I did not identify any test coverage asks beyond keeping the citations and command surface accurate.

Open in Web View Automation 

Sent by Cursor Automation: Flatbread PR Review

Comment thread flatbread-search-and-memory-research.md Outdated

### 2.1 Frontmatter-only retrieval is structurally insufficient

The audit ([`/tmp/proof-search/audit-flatbread-retrieval.md`](file:///tmp/proof-search/audit-flatbread-retrieval.md) §2.4–2.5, §3.1, §5) is unambiguous about today's retrieval surface in [`packages/core`](packages/core):
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This link target is local to the author's machine/runtime, not to the repo. None of the cited audit-*, sota-*, or synthesis-* markdown files are checked in anywhere under this tree, so a reader opening the PR or a fresh clone cannot inspect the source material behind the report's claims.

Please either commit those dossiers under a stable repo path and link them relatively, or rewrite these as plain-text citations to committed/external sources instead of file:///tmp/... links.

Comment thread flatbread-search-and-memory-research.md Outdated

#### F9 — Index artifacts + verify CLI

chunks/edges jsonl plus gitignored sqlite/orama binaries; `flatbread index verify` checks hashes **Pros:** CI drift detection; clone size control **Hypothesis:** Support load shifts from mystique to actionable rebuild commands.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

flatbread index verify is written here as if it already exists, but the current CLI in packages/flatbread/src/cli/index.ts only defines start, init, and codegen. Copy/pasting this from the doc will fail today.

If this is future work, please mark it explicitly as proposed/not-yet-shipped instead of command voice. Otherwise, land the command and link the implementation/docs from this report.

Comment thread flatbread-search-and-memory-research.md Outdated

**Query surfaces:** GraphQL remains the introspectable contract for apps; MCP exposes tool-shaped facades over the identical resolver stack; codegen grows typed accessors (document nodes + thin SDK) so agents rarely hand-author GraphQL strings.

**Index format:** Source markdown stays canonical. **Tier A:** committed `chunks.jsonl` / `edges.jsonl` manifests for CI and small repos. **Tier B (default at scale):** derived `sqlite` with FTS5 + `sqlite-vec` (or Orama snapshot) under `.flatbread/index/`, gitignored, rebuilt by `flatbread index build` with manifest hashes recorded. Optional **Tier C:** team policy commits quantized vectors per N3 when audit demands byte-level review.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same issue here for flatbread index build: the report treats it as an existing command, but there is no such subcommand in the current CLI. That makes the "rebuilt by flatbread index build" workflow unverifiable for readers.

Please either implement and document the subcommand before describing it as runnable, or rephrase this section as a proposed future design rather than present-day operational guidance.

@tonyketcham tonyketcham marked this pull request as ready for review May 11, 2026 03:09
…x CLI

- Add docs/research/performant-context-search/README.md describing the Proof DAG
  methodology and why intermediate dossiers are not vendored; link to root
  report and packages/proof README (.flatbread/artifacts default).
- Remove file:///tmp and /tmp/proof-search links from the research report;
  point readers at the methodology README, repo paths, and §13 references.
- Mark flatbread index build/verify as proposed future CLI; cite shipped
  commands in packages/flatbread/src/cli/index.ts (start/init/codegen only).
- Soften embeddable-runtime guidance to regenerated .flatbread/index outputs
  vs committing manifests where desired.

Co-authored-by: Tony <tonyketcham@users.noreply.github.com>
@tonyketcham tonyketcham force-pushed the toeknee/search-research-dag-719a branch from 49018d1 to af7e118 Compare May 11, 2026 06:45
@tonyketcham tonyketcham merged commit efbb36c into main May 11, 2026
37 checks passed
@tonyketcham tonyketcham deleted the toeknee/search-research-dag-719a branch May 11, 2026 06:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants