Tasks of type `python_task` are not listed in `lm-eval --tasks list` #2268

giuliolovisotto · 2024-08-30T15:08:15Z

Problem Description

When calling TaskManager._initialize_tasks I noticed that tasks which are managed by their own python class enter the _task_index with type="python_task":

lm-evaluation-harness/lm_eval/tasks/__init__.py

Lines 452 to 457 in 928e8bb

    
           if self._config_is_python_task(config): 
        
               # This is a python class config 
        
               tasks_and_groups[config["task"]] = { 
        
                   "type": "python_task", 
        
                   "yaml_path": yaml_path, 
        
               }

As a consequence, when printing the task list lm-eval --tasks list these are effectively omitted from any constructed table, which only consider types group, task or tag

lm-evaluation-harness/lm_eval/tasks/__init__.py

Lines 39 to 47 in 928e8bb

    
           self._all_groups = sorted( 
        
               [x for x in self._all_tasks if self._task_index[x]["type"] == "group"] 
        
           ) 
        
           self._all_subtasks = sorted( 
        
               [x for x in self._all_tasks if self._task_index[x]["type"] == "task"] 
        
           ) 
        
           self._all_tags = sorted( 
        
               [x for x in self._all_tasks if self._task_index[x]["type"] == "tag"] 
        
           )

How to reproduce

Simple way to notice this, take a task that is managed by a Python class, such as fda managed by task.FDA, I can run the following:

lm_eval --model hf  \
    --model_args pretrained=EleutherAI/pythia-160m,dtype="float" \
    --tasks fda \
    --device mps \
    --limit 10

Allthough, when I try to see fda in the list of tasks, e.g. lm-eval --tasks list | grep fda this doesn't appear in the table.

I would be happy to work on this but I'd like to hear a maintainer's opinion regarding whether these tasks should appear in tables and be treated the same way task-type tasks are treated, including their inclusion in the tag handling.

The text was updated successfully, but these errors were encountered:

haileyschoelkopf · 2024-09-09T12:50:20Z

Hi! This check

 self._all_subtasks = sorted( 
     [x for x in self._all_tasks if self._task_index[x]["type"] == "task"] 
 )

should indeed be

 self._all_subtasks = sorted( 
     [x for x in self._all_tasks if self._task_index[x]["type"] in ["task", "python_task"]] 
 )

.

If you have the bandwidth to PR and test a fix, we'd be grateful, but will fix this ourselves if not!

giuliolovisotto linked a pull request Sep 10, 2024 that will close this issue

Treat tags in python tasks the same as yaml tasks #2288

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tasks of type `python_task` are not listed in `lm-eval --tasks list` #2268

Tasks of type `python_task` are not listed in `lm-eval --tasks list` #2268

giuliolovisotto commented Aug 30, 2024 •

edited

Loading

haileyschoelkopf commented Sep 9, 2024

Tasks of type python_task are not listed in lm-eval --tasks list #2268

Tasks of type python_task are not listed in lm-eval --tasks list #2268

Comments

giuliolovisotto commented Aug 30, 2024 • edited Loading

Problem Description

How to reproduce

haileyschoelkopf commented Sep 9, 2024

Tasks of type `python_task` are not listed in `lm-eval --tasks list` #2268

Tasks of type `python_task` are not listed in `lm-eval --tasks list` #2268

giuliolovisotto commented Aug 30, 2024 •

edited

Loading