Ifeval: Dowload `punkt_tab` on rank 0 #2267

baberabb · 2024-08-30T11:18:42Z

closes #2266. Also removed the pkg_resources dependency as that's depreciated.

baberabb · 2024-08-30T11:20:54Z

lm_eval/tasks/ifeval/instructions_util.py

+            nltk.download("punkt_tab")
+            print("Downloaded punkt_tab")
+        else:
+            time.sleep(5)


This isn't really necessary as the code runs at the beginning (before generations). But couldn't hurt.

suggestion: if sleep is not needed we should not add it.

al093 · 2024-08-30T13:29:14Z

Sharing a slightly more verbose version.
I would do this:

def download_nltk_resources_guarded() -> None:
    """Download 'punkt_tab' tokenizer.

    Downloading nltk with distributed barrier otherwise race condition can occur
    when multiple processes try to download the same resource later.
    """
    local_rank = os.environ("LOCAL_RANK", 0)

    if local_rank == 0:
        try:
            nltk.data.find("tokenizers/punkt_tab")
        except LookupError:
            logger.info(f"Local rank {local_rank}: Downloading NLTK 'punkt_tab' resource.")
            nltk.download("punkt_tab")
            logger.info(f"Local rank {local_rank}: Downloaded NLTK 'punkt_tab' resource.")

    if torch.distributed.is_initialized():
         torch.distributed.barrier()
    try:
        nltk.data.find("tokenizers/punkt_tab")
    except LookupError:
        logger.error(
            f"Local rank {local_rank}: NLTK 'punkt' resource not found."
            f"This should have been downloaded by local rank 0."
        )
        raise

al093 · 2024-08-30T13:30:26Z

suggestion: would rethink the current download on import behaviour.

baberabb · 2024-09-19T19:19:04Z

@al093 . Thanks very much for your suggestion. I'm hesitant in add a torch dependency here, as none of the tasks currently require it. I removed time.sleep() as you suggested and we do have a barrier later in the evaluation loop. To add an extra layer we could add a condition to have the user manually download and cache the data: please run python -c "import nltk; nltk.download('punkt'). Thoughts @haileyschoelkopf ?

baberabb added 2 commits August 30, 2024 16:00

download nltk punkt_tab on LOCAL_RANK=0

f611817

remove print

5e89982

baberabb requested review from haileyschoelkopf and lintangsutawika as code owners August 30, 2024 11:18

baberabb commented Aug 30, 2024

View reviewed changes

baberabb added 2 commits September 20, 2024 00:02

remove time

8a78ea2

nit

f691592

baberabb mentioned this pull request Sep 19, 2024

IFEval fails when multiple gpus are used (for DDP) #2266

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ifeval: Dowload `punkt_tab` on rank 0 #2267

Ifeval: Dowload `punkt_tab` on rank 0 #2267

baberabb commented Aug 30, 2024

baberabb Aug 30, 2024 •

edited

Loading

al093 Aug 30, 2024

al093 commented Aug 30, 2024 •

edited

Loading

al093 commented Aug 30, 2024 •

edited

Loading

baberabb commented Sep 19, 2024

Ifeval: Dowload punkt_tab on rank 0 #2267

Are you sure you want to change the base?

Ifeval: Dowload punkt_tab on rank 0 #2267

Conversation

baberabb commented Aug 30, 2024

baberabb Aug 30, 2024 • edited Loading

Choose a reason for hiding this comment

al093 Aug 30, 2024

Choose a reason for hiding this comment

al093 commented Aug 30, 2024 • edited Loading

al093 commented Aug 30, 2024 • edited Loading

baberabb commented Sep 19, 2024

Ifeval: Dowload `punkt_tab` on rank 0 #2267

Ifeval: Dowload `punkt_tab` on rank 0 #2267

baberabb Aug 30, 2024 •

edited

Loading

al093 commented Aug 30, 2024 •

edited

Loading

al093 commented Aug 30, 2024 •

edited

Loading