GitHub - locuslab/llm-components

Evaluations

Single runs to obtain accuracies

Model options: llama3.1-8b-it, llama3.2-3b-it, llama3.2-1b-it, qwen2.5-7b-it, qwen2.5-3b-it

Dataset options: gsm8k, arithmetic, mbpp, humaneval_instruct, hellaswag, boolq, arc_challenge, mmlu, swearing, rhyming

Get baseline accuracy of GSM8K dataset on Llama 3.1 8B model

python run_evals.py --single-run --model llama3.1-8b-it --task gsm8k

Accuracy with heads L16H21 and L15H13 ablated

python run_evals.py --single-run --model llama3.1-8b-it --task gsm8k --layerid 16 15 --headid 21 13

Compressed sensing search

Perform compressed sensing with 100 masks, each of which has 0.02 sparsity. Include --stratified flag for stratified sampling in the measurement matrix, otherwise defaults to Bernoulli sampling.

python compressed_sensing.py --model llama3.1-8b-it --task gsm8k --nmasks 100 --sparsity 0.02 --num-samples 100 --stratified

Greedy search

Greedy search via comprehensive ablation of all heads

python run_evals.py --model llama3.1-8b-it --task gsm8k --num-samples 100

Greedy search that additionally always knocks out L16H21 (i.e. for subsequent iterations of iterative greedy search) and does checkpointing for intermediate saving

python run_evals.py --model llama3.1-8b-it --task gsm8k --num-samples 100 --extra-layers 16 --extra-heads 21 --checkpoint

After performing greedy search, print out a list of the heads that resulted in the smallest accuracy on the target task:

python get_heads.py --filename {saved filepath here}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
plotting		plotting
README.md		README.md
compressed_sensing.py		compressed_sensing.py
environment.yml		environment.yml
evals.py		evals.py
get_heads.py		get_heads.py
load_model.py		load_model.py
modify_model.py		modify_model.py
run_evals.py		run_evals.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluations

Single runs to obtain accuracies

Compressed sensing search

Greedy search

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

locuslab/llm-components

Folders and files

Latest commit

History

Repository files navigation

Evaluations

Single runs to obtain accuracies

Compressed sensing search

Greedy search

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages