Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[QUESTION] hello, I has a problem…
#2477 opened Nov 10, 2024 by sofiaserkhir
task load return error
#2466 opened Nov 7, 2024 by pod2c
Why is using vLLM via lm-eval-harness slower than using vLLM directly? asking questions For asking for clarification / support on library usage.
#2445 opened Oct 30, 2024 by WuXnkris
Improve preprocessing for paws-x and xnli tasks feature request A feature that isn't implemented yet. good first issue Good for newcomers
#2442 opened Oct 30, 2024 by zxcvuser
GPU with GGFU LLM
#2429 opened Oct 25, 2024 by Znbne
Llama3.1-8B-Instruct evaluation fails asking questions For asking for clarification / support on library usage.
#2428 opened Oct 25, 2024 by Isaaclgz
How is the MMLU accuracy calculated here? asking questions For asking for clarification / support on library usage.
#2425 opened Oct 24, 2024 by yuqinan
test speculative decode accuracy asking questions For asking for clarification / support on library usage.
#2424 opened Oct 24, 2024 by baoqianmagik
Question related to how to use the validation and training splits. asking questions For asking for clarification / support on library usage.
#2423 opened Oct 24, 2024 by sorobedio
bbh_zeroshot fails during to a custom filter issue. bug Something isn't working.
#2422 opened Oct 23, 2024 by shamanez
How to evaluation openai model?
#2416 opened Oct 21, 2024 by 9mean2
ProTip! Type g i on any issue or pull request to go back to the issue listing page.