Skip to content

Pull requests: huggingface/lighteval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Upgrade vLLM from 0.10.1.1 to 0.14.1
#1173 opened Feb 19, 2026 by NathanHB Loading…
Fix: pass through custom_tasks and enable multilingual in eval command
#1172 opened Feb 19, 2026 by dzautner Loading…
2 tasks done
Add jfinqa: Japanese Financial Numerical Reasoning QA
#1169 opened Feb 17, 2026 by ajtgjmdjp Loading…
2 of 3 tasks
fix: restore task list display logic
#1166 opened Feb 10, 2026 by s1eeping-king Loading…
Fix TypeError in aa_omniscience_prompt
#1161 opened Jan 22, 2026 by pjavanrood Loading…
Fix split loading error in bigbench
#1159 opened Jan 22, 2026 by pjavanrood Loading…
Fix RecursionError in imdb_contrastset_prompt
#1155 opened Jan 22, 2026 by pjavanrood Loading…
Fix non-existent evaluation splits in lextreme
#1151 opened Jan 22, 2026 by pjavanrood Loading…
Fix evaluation split config in lsat_qa
#1149 opened Jan 22, 2026 by pjavanrood Loading…
Improve NarrativeQA metrics and prompt structure
#1147 opened Jan 22, 2026 by pjavanrood Loading…
Fix key mismatch and context access in PubMedQA
#1143 opened Jan 22, 2026 by pjavanrood Loading…
Fix TypeError in real_toxicity_prompts
#1141 opened Jan 22, 2026 by pjavanrood Loading…
Fix column mismatch and metric in SimpleQA
#1139 opened Jan 22, 2026 by pjavanrood Loading…
Fix subset names in StoryCloze
#1137 opened Jan 22, 2026 by pjavanrood Loading…
Fix hardcoded path in tiny_benchmarks
#1133 opened Jan 22, 2026 by pjavanrood Loading…
Fix KeyError in truthful_qa_generative_prompt
#1131 opened Jan 22, 2026 by pjavanrood Loading…
Fix specific error in truthfulqa
#1127 opened Jan 22, 2026 by ChenZiHong-Gavin Loading…
ProTip! Add no:assignee to see everything that’s not assigned.