-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
IFEval fails when multiple gpus are used (for DDP) #2266
Comments
One workaround for this issue is to download the nltk resources in a desired safer manner before. |
Hi! Thanks for the reporting the issue! The PR should handle this. Thought the simplest way is to check for the |
Thanks for the PR. I gave suggestion in the PR. I am not sure about the need of downloading nltk tokenizers when the module is imported. If possible it should be refactored. |
I also ran into the punkt_tab problem
I have the data in my local folder, and I can even load the tokenizer locally
But I got the above error when I ran it with Is it related to a race condition? |
can we get an update on this (either merge the existing PR for fixing this issue or create a new one if needed?) happy to work on it but this issue is blocking multi-GPU evals for me |
#2267 should fix it. As a workaround you could run |
While doing IFEval, the lib downloads NLTK tokenizers. This is an issue when multiple processes are used (for eg. in a DDP inference), because the download is done by each process. i think this leads to race conditions and causes the following issue:
I think,
I used the main to produce this issue: (commit: 8138fd5)
The text was updated successfully, but these errors were encountered: