hyperpolymath
diff --git a/‎PROOF-NEEDS.md‎
Lines changed: 35 additions & 0 deletions b/‎PROOF-NEEDS.md‎
Lines changed: 35 additions & 0 deletions
diff --git a/‎TEST-NEEDS.md‎
Lines changed: 68 additions & 0 deletions b/‎TEST-NEEDS.md‎
Lines changed: 68 additions & 0 deletions
diff --git a/‎lithoglyph/analytics/src/abi/Foreign.idr‎
Lines changed: 0 additions & 226 deletions b/‎lithoglyph/analytics/src/abi/Foreign.idr‎
Lines changed: 0 additions & 226 deletions
@@ -0,0 +1,35 @@
+# PROOF-NEEDS.md — nextgen-databases
+
+## Current State
+
+- **src/abi/*.idr**: YES (in lithoglyph) — `BofigEntities.idr`, `GQLdt/ABI/Foreign.idr`
+- **Dangerous patterns**: 0 (4 references are documentation asserting "no believe_me" invariant)
+- **LOC**: ~202,000 (Rust + Idris2)
+- **ABI layer**: Lithoglyph has Idris2 ABI with constructive proofs, explicit no-believe_me policy
+
+## What Needs Proving
+
+| Component | What | Why |
+|-----------|------|-----|
+| VeriSimDB WAL correctness | Write-ahead log guarantees durability and ordering | WAL bugs cause data loss — the worst database bug |
+| VeriSimDB transaction isolation | ACID properties hold under concurrent access | Isolation violations corrupt data silently |
+| VeriSimDB ZKP bridge | Zero-knowledge proof generation is sound | Unsound ZKP breaks semantic verification |
+| VeriSimDB HNSW index | Vector similarity search returns correct nearest neighbours | Wrong results from vector search corrupt ML pipelines |
+| VeriSimDB query planner | Query optimization preserves result equivalence | Optimized query must return same results as unoptimized |
+| VeriSimDB drift detection | Drift detector correctly identifies schema changes | Missed drift causes silent data corruption |
+| Lithoglyph entity proofs | Extend BofigEntities constructive proofs | Current proofs cover basic entities; need full coverage |
+| QuandleDB algebraic laws | Quandle operations satisfy rack/quandle axioms | Mathematical structure must be correct by construction |
+
+## Recommended Prover
+
+**Idris2** — Lithoglyph already has constructive Idris2 proofs. VeriSimDB WAL and transaction proofs fit naturally. ZKP soundness may need **Coq** for deeper cryptographic proofs.
+
+## Priority
+
+**HIGH** — VeriSimDB is the standard database across the ecosystem. WAL correctness and transaction isolation are critical — bugs here corrupt data in every downstream project. The ZKP bridge is security-critical.
+
+## Template ABI Cleanup (2026-03-29)
+
+Template ABI removed -- was creating false impression of formal verification.
+The removed files (Types.idr, Layout.idr, Foreign.idr) contained only RSR template
+scaffolding with unresolved {{PROJECT}}/{{AUTHOR}} placeholders and no domain-specific proofs.
@@ -0,0 +1,68 @@
+# TEST-NEEDS.md — nextgen-databases
+
+> Generated 2026-03-29 by punishing audit.
+
+## Current State
+
+| Category     | Count | Notes |
+|-------------|-------|-------|
+| Unit tests   | ~40   | VeriSimDB Elixir: consensus (kraft_node, kraft_wal, kraft_recovery, kraft_transport), federation adapters (mongodb, redis, duckdb, clickhouse, surrealdb, sqlite, neo4j, vector_db, influxdb, object_storage), resolver, adapter + base tests |
+| Integration  | ~12   | Federation adapter integration tests (mongodb, redis, neo4j, clickhouse, surrealdb, influxdb) |
+| E2E          | 0     | None |
+| Benchmarks   | 2     | verisimdb/benches/modality_benchmarks.rs (Rust), lithoglyph core-factor benchmarks.factor |
+
+**Source modules:** ~833 across 2 major subsystems. verisimdb: ~248 files (Rust core, Elixir orchestration, Gleam, Idris2 ABI, Zig FFI, ReScript). lithoglyph: ~212 files (Gleam, Rust, Factor).
+
+## What's Missing
+
+### P2P (Property-Based) Tests
+- [ ] Kraft consensus: property tests for leader election, log replication, partition tolerance
+- [ ] CRDT convergence: property tests for VeriSimDB's CRDT operations
+- [ ] VQL query parsing: arbitrary query fuzzing
+- [ ] Federation: property tests for data consistency across adapters
+- [ ] lithoglyph: data structure invariant tests
+
+### E2E Tests
+- [ ] VeriSimDB: full write -> replicate -> read across nodes
+- [ ] Federation: write through adapter -> verify in external DB -> read back
+- [ ] Kraft consensus: cluster formation -> leader election -> write -> node failure -> recovery
+- [ ] lithoglyph: full lifecycle (create -> write -> query -> archive)
+- [ ] VQL: complex query execution with joins/aggregations
+
+### Aspect Tests
+- **Security:** No tests for authentication bypass, unauthorized federation access, injection through VQL, data exfiltration across adapters
+- **Performance:** Rust modality benchmark exists. Missing: Elixir orchestration throughput, Kraft consensus latency, federation adapter comparison benchmarks
+- **Concurrency:** No tests for concurrent writes across Kraft nodes, federation adapter connection pooling, VQL query contention
+- **Error handling:** No tests for adapter connection failure, Kraft split-brain recovery, malformed VQL, storage corruption
+
+### Build & Execution
+- [ ] `mix test` for VeriSimDB Elixir
+- [ ] `cargo test` for VeriSimDB Rust
+- [ ] `gleam test` for lithoglyph
+- [ ] Zig FFI tests
+- [ ] Container-based multi-node tests
+
+### Benchmarks Needed
+- [ ] Write throughput (single node, cluster)
+- [ ] Read latency (hot, cold, cache miss)
+- [ ] Kraft consensus round-trip time
+- [ ] Federation adapter roundtrip per backend
+- [ ] VQL query execution time by complexity
+- [ ] lithoglyph query performance
+- [ ] Replication lag measurement
+
+### Self-Tests
+- [ ] Cluster health self-check
+- [ ] Federation adapter connectivity verification
+- [ ] Data integrity checksums
+- [ ] WAL consistency validation
+
+## Priority
+
+**CRITICAL.** Two database systems with 833 source files and ~52 tests (6.2%). The consensus layer (Kraft) has 4 tests for a distributed consensus protocol — that is dangerously low. Federation adapters have decent unit coverage but zero E2E. lithoglyph appears to have no dedicated tests at all. A database with no concurrency tests is a ticking time bomb.
+
+## FAKE-FUZZ ALERT
+
+- `tests/fuzz/placeholder.txt` is a scorecard placeholder inherited from rsr-template-repo — it does NOT provide real fuzz testing
+- Replace with an actual fuzz harness (see rsr-template-repo/tests/fuzz/README.adoc) or remove the file
+- Priority: P2 — creates false impression of fuzz coverage