Commit 91ed94e
Add select_subset.py for stratified benchmark task sampling
Selects representative subsets across suite effect-size buckets
(high-positive, near-zero, negative, mixed), stratified by language,
difficulty, and codebase size band. Includes --seed for reproducibility,
--power-report for DOE validation, and outputs both JSON (for
run_selected_tasks.sh --selection-file) and plain-text task lists.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 80b927e commit 91ed94e
7 files changed
Lines changed: 7374 additions & 1 deletion
0 commit comments