Add --mareforma-export post-run flag for signed claim receipts by felipeyanez · Pull Request #1 · mims-harvard/AutoScientists

felipeyanez · 2026-05-29T17:25:39Z

Hi. I read your AutoScientists paper and noticed how every research_insights.md ends with some version of the "single-run, no independent replication" caveat. I've been working on an epistemic substrate (mareforma) where agents write signed, cross-referenced claims, and I think it can help here.

This PR is intentionally small: a single --mareforma-export [RUN_DIR] post-run mode added to launch.py. When set, it walks RUN_DIR for autoscientists_submission/research_insights.md files and signs each one (Ed25519, content-addressable) into a local graph at RUN_DIR/.mareforma/graph.db. When a second lab runs the same task and signs with their own enrolled key, the substrate flags REPLICATED when both converge on the same finding. Today "independent validation" is a phrase in your paper; with this in place it becomes a query: graph.query(text, min_support="REPLICATED").

Tested against all 24 reference submissions in task-biomlbench/: 24 of 24 signed on first run, 0 of 24 re-signed on the second pass (content-keyed idempotency, so editing a file produces a new claim and preserves the old one for audit). Round-trips via mareforma ask return findings with PRELIMINARY trust level and per-task source attribution.

43 lines added to launch.py. Single file changed. Opt-in flag, lazy import, no required deps. The normal bootstrap path is untouched.

The full integration also signs autoscientists.py and autoscientists_submission.csv for tamper-evidence across the whole submission directory, runs as a live event hook during the experiment (instead of post-run only), and supports validator enrollment for the REPLICATED -> ESTABLISHED promotion. Happy to send a follow-up if this direction is useful. Otherwise this stands alone.

Cheers.

Adds a single --mareforma-export [RUN_DIR] flag to launch.py. When set, walks RUN_DIR for autoscientists_submission/research_insights.md files and signs each one (Ed25519, content-addressable) into a local mareforma graph at RUN_DIR/.mareforma/graph.db. Content-keyed idempotency: re-runs on unchanged files are no-ops; editing a file produces a new claim and preserves the old one for audit. The flag is opt-in, the mareforma import is lazy, and the normal bootstrap path is untouched when the flag is absent. Requires: pip install mareforma[clawinstitute]>=0.3.3 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add --mareforma-export post-run flag for signed claim receipts#1

Add --mareforma-export post-run flag for signed claim receipts#1
felipeyanez wants to merge 1 commit into
mims-harvard:mainfrom
felipeyanez:mareforma-export

felipeyanez commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

felipeyanez commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant