Commit 91ed94e

and

committed

Add select_subset.py for stratified benchmark task sampling

Selects representative subsets across suite effect-size buckets (high-positive, near-zero, negative, mixed), stratified by language, difficulty, and codebase size band. Includes --seed for reproducibility, --power-report for DOE validation, and outputs both JSON (for run_selected_tasks.sh --selection-file) and plain-text task lists. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

1 parent 80b927e commit 91ed94eCopy full SHA for 91ed94e

7 files changed

configs
- subset_tasks_n80.json
- subset_tasks_n80.txt
docs/ops
- SCRIPT_INDEX.md
scripts

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit 91ed94e

File tree

0 commit comments