Skip to content

Lower default --clone-threshold from 0.8 to 0.6#4

Merged
Turtle-Hwan merged 1 commit into
mainfrom
chore/lower-default-threshold-to-0.6
May 12, 2026
Merged

Lower default --clone-threshold from 0.8 to 0.6#4
Turtle-Hwan merged 1 commit into
mainfrom
chore/lower-default-threshold-to-0.6

Conversation

@Turtle-Hwan
Copy link
Copy Markdown
Contributor

Why

Live A/B/C runs against the demo token (5x per variant, identical scenario) show predictor confidence sitting in the 0.72–0.80 band on real iteration context. With the 0.8 default, well-grounded predictions (PATCH/DELETE coverage after green tests, supertest follow-ups) were escalating to humans roughly 3 out of 5 times.

Dropping to 0.6 lets those predictions carry through while still gating against the truly low-confidence ones (~0.45 single-candidate replies, ~0.5 when the model explicitly flags "no profile data").

What changed

  • `scripts/setup-clone-loop.mjs` — default + help text → 0.6
  • `hooks/stop-hook.mjs` + `hooks/ask-user-question-hook.mjs` — in-process fallback defaults → 0.6
  • All test fixtures + `tests/setup-clone-loop.test.mjs` regex → 0.6
  • `scripts/manual-e2e-*.mjs` probes → 0.6 for parity with new default
  • `README.md` "Default `0.6`"
  • `commands/help.md` example uses 0.7 (above default, illustrating override)

Users who want stricter gating can still pass `--clone-threshold 0.8`.

Test plan

  • `pnpm test` -> 23/23 passing.
  • Reviewer: confirm the next merge tags clone-plugin-v0.5.0.

🤖 Generated with Claude Code

Live A/B/C runs against the demo Clone token show predictor confidence
sitting in the 0.72-0.80 band on real iteration context. With the
0.8 default, well-grounded predictions (PATCH/DELETE coverage after
green tests, supertest follow-ups) were escalating to humans roughly
3 out of 5 times. Dropping the default to 0.6 lets the loop carry
those predictions through while still gating against the truly low-
confidence ones (~0.45 single-candidate replies, ~0.5 when the model
explicitly flags lack of signal).

Updated everywhere: setup-clone-loop default, stop-hook + ask-user-
question-hook in-process fallbacks, manual e2e scripts, all test
fixtures, README, and commands/help.md examples.

Users who want stricter gating can still pass --clone-threshold 0.8.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@Turtle-Hwan Turtle-Hwan merged commit cc298cf into main May 12, 2026
@Turtle-Hwan Turtle-Hwan deleted the chore/lower-default-threshold-to-0.6 branch May 12, 2026 16:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant