grounded_evolution/ROADMAP.md at main · NullLabTests/grounded_evolution

Eval harness rewrite — parameterized benchmarks so the score is meaningful across runs

Population persistence v2 — SQLite backend instead of JSON for concurrent access

Execution cost tracking — log LLM tokens, wall time, and per-cycle cost to enable regression-aware pruning

Prompt diversity enforcement — deduplicate near-identical prompts before insertion

Multi-provider LLM support — graceful fallback if one provider is rate-limited

Explicit mutation hyper-parameters — expose mutation rate, crossover rate, tournament size as CLI/env overrides

Regressions tests in CI — run evaluator/ suite on every PR

Provide feedback