Skip to content

Add --mareforma-export post-run flag for signed claim receipts#1

Open
felipeyanez wants to merge 1 commit into
mims-harvard:mainfrom
felipeyanez:mareforma-export
Open

Add --mareforma-export post-run flag for signed claim receipts#1
felipeyanez wants to merge 1 commit into
mims-harvard:mainfrom
felipeyanez:mareforma-export

Conversation

@felipeyanez
Copy link
Copy Markdown

Hi. I read your AutoScientists paper and noticed how every research_insights.md ends with some version of the "single-run, no independent replication" caveat. I've been working on an epistemic substrate (mareforma) where agents write signed, cross-referenced claims, and I think it can help here.

This PR is intentionally small: a single --mareforma-export [RUN_DIR] post-run mode added to launch.py. When set, it walks RUN_DIR for autoscientists_submission/research_insights.md files and signs each one (Ed25519, content-addressable) into a local graph at RUN_DIR/.mareforma/graph.db. When a second lab runs the same task and signs with their own enrolled key, the substrate flags REPLICATED when both converge on the same finding. Today "independent validation" is a phrase in your paper; with this in place it becomes a query: graph.query(text, min_support="REPLICATED").

Tested against all 24 reference submissions in task-biomlbench/: 24 of 24 signed on first run, 0 of 24 re-signed on the second pass (content-keyed idempotency, so editing a file produces a new claim and preserves the old one for audit). Round-trips via mareforma ask return findings with PRELIMINARY trust level and per-task source attribution.

43 lines added to launch.py. Single file changed. Opt-in flag, lazy import, no required deps. The normal bootstrap path is untouched.

The full integration also signs autoscientists.py and autoscientists_submission.csv for tamper-evidence across the whole submission directory, runs as a live event hook during the experiment (instead of post-run only), and supports validator enrollment for the REPLICATED -> ESTABLISHED promotion. Happy to send a follow-up if this direction is useful. Otherwise this stands alone.

Cheers.

Adds a single --mareforma-export [RUN_DIR] flag to launch.py. When set,
walks RUN_DIR for autoscientists_submission/research_insights.md files
and signs each one (Ed25519, content-addressable) into a local mareforma
graph at RUN_DIR/.mareforma/graph.db. Content-keyed idempotency: re-runs
on unchanged files are no-ops; editing a file produces a new claim and
preserves the old one for audit.

The flag is opt-in, the mareforma import is lazy, and the normal
bootstrap path is untouched when the flag is absent.

Requires: pip install mareforma[clawinstitute]>=0.3.3

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant