huggingface / lighteval Public

Notifications You must be signed in to change notification settings
Fork 444
Star 2.4k

Code
Issues 209
Pull requests 86
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: huggingface/lighteval

Labels 14 Milestones 0

New pull request New

86 Open 643 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add --load-tasks-multilingual and fix --custom-tasks for inspect backend

#1199 opened Mar 25, 2026 by dzautner

Loading…

4 tasks done

[Bugfix] Check all responses when n>1 instead of only the first one

#1197 opened Mar 23, 2026 by eldarkurtic

Loading…

[Litellm Enhancement] Enable extra sampling args for litellm backend

#1195 opened Mar 20, 2026 by eldarkurtic

Loading…

[Bugfix] presence_penalty is silently dropped from sampling args in litellm backend

#1193 opened Mar 18, 2026 by eldarkurtic

Loading…

[Bugfix] litellm backend should iterate over docs in a split not entire dataset

#1192 opened Mar 18, 2026 by eldarkurtic

Loading…

Remove deprecated prompt_token_ids wrapping in vLLM backend

#1191 opened Mar 18, 2026 by sihyeonn

Loading…

Fix litellm connection pool limiting concurrent_requests

#1190 opened Mar 18, 2026 by sihyeonn

Loading…

feat(utils): show count of evaluated samples in Markdown summary table

#1188 opened Mar 13, 2026 by anzzyspeaksgit

Loading…

Fix typos in math_comparison.py and sample_comparison.py

#1186 opened Mar 12, 2026 by joshuaswanson

Loading…

squad_v2: include unanswerable questions in evaluation

#1185 opened Mar 9, 2026 by Matteovanypersele

Loading…

Update vllm version requirement to 0.17.0

#1183 opened Mar 9, 2026 by NathanHB

Loading…

Fail fast on non-retriable LiteLLM status codes

#1182 opened Mar 8, 2026 by yangbaechu

Loading…

Redact model config credentials in saved and returned results

#1181 opened Mar 8, 2026 by yangbaechu

Loading…

fix(normalizations): guard against index out of range in LogProbToken…

#1180 opened Mar 6, 2026 by inakiLakunza

Loading…

Korean completed and Basque fixed

#1179 opened Mar 6, 2026 by inakiLakunza

Loading…

[LiteLLM] Add cross-provider reasoning_effort support (first step) + token budget fixes

#1178 opened Mar 5, 2026 by dyurchenko98

Loading…

Fix LiteLLM split iteration in greedy_until to avoid duplicate API requests

#1177 opened Mar 5, 2026 by dyurchenko98

Loading…

Fix corpus reference orientation for chrF/chrF++/TER metrics

#1176 opened Mar 5, 2026 by dyurchenko98

Loading…

FIX : handle empty choices in Doc.get_golds() to prevent IndexError

#1174 opened Feb 23, 2026 by nandeanie

Loading…

Fix: pass through custom_tasks and enable multilingual in eval command

#1172 opened Feb 19, 2026 by dzautner

Loading…

2 tasks done

Fix IndexError in LogProbTokenNorm when choices_tokens is shorter than choices_logprob

#1171 opened Feb 18, 2026 by worksbyfriday

Loading…

Add jfinqa: Japanese Financial Numerical Reasoning QA

#1169 opened Feb 17, 2026 by ajtgjmdjp

Loading…

2 of 3 tasks

fix: restore task list display logic

#1166 opened Feb 10, 2026 by s1eeping-king

Loading…

fix: Transformers Model no template cast stop_sequences to list

#1165 opened Feb 7, 2026 by mrsndmn

Loading…

Fix TypeError in aa_omniscience_prompt

#1161 opened Jan 22, 2026 by pjavanrood

Loading…

Previous 1 2 3 4 Next

Previous Next

ProTip! Filter pull requests by the default branch with base:main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!