You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When calling TaskManager._initialize_tasks I noticed that tasks which are managed by their own python class enter the _task_index with type="python_task":
As a consequence, when printing the task list lm-eval --tasks list these are effectively omitted from any constructed table, which only consider types group, task or tag
Allthough, when I try to see fda in the list of tasks, e.g. lm-eval --tasks list | grep fda this doesn't appear in the table.
I would be happy to work on this but I'd like to hear a maintainer's opinion regarding whether these tasks should appear in tables and be treated the same way task-type tasks are treated, including their inclusion in the tag handling.
The text was updated successfully, but these errors were encountered:
Problem Description
When calling
TaskManager._initialize_tasks
I noticed that tasks which are managed by their own python class enter the_task_index
withtype="python_task"
:lm-evaluation-harness/lm_eval/tasks/__init__.py
Lines 452 to 457 in 928e8bb
As a consequence, when printing the task list
lm-eval --tasks list
these are effectively omitted from any constructed table, which only consider typesgroup, task
ortag
lm-evaluation-harness/lm_eval/tasks/__init__.py
Lines 39 to 47 in 928e8bb
How to reproduce
Simple way to notice this, take a task that is managed by a Python class, such as
fda
managed bytask.FDA
, I can run the following:lm_eval --model hf \ --model_args pretrained=EleutherAI/pythia-160m,dtype="float" \ --tasks fda \ --device mps \ --limit 10
Allthough, when I try to see
fda
in the list of tasks, e.g.lm-eval --tasks list | grep fda
this doesn't appear in the table.I would be happy to work on this but I'd like to hear a maintainer's opinion regarding whether these tasks should appear in tables and be treated the same way
task
-type tasks are treated, including their inclusion in the tag handling.The text was updated successfully, but these errors were encountered: