Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added TurkishMMLU to LM Evaluation Harness #2283

Open
wants to merge 7 commits into
base: main
Choose a base branch
from

Conversation

ArdaYueksel
Copy link

@ArdaYueksel ArdaYueksel commented Sep 6, 2024

In this pull request, I would like to add our work TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish to LM Evaluation Harness.
You can find the details of our work in our repository:
https://github.com/ArdaYueksel/TurkishMMLU
Also, our dataset is made available in HuggingFace: https://huggingface.co/datasets/AYueksel/TurkishMMLU

Key Features:

  • MMLU variant in Turkish Language
  • Has a separate Development Set.
  • Chain-of-Thought Configuration is available.

@CLAassistant
Copy link

CLAassistant commented Sep 6, 2024

CLA assistant check
All committers have signed the CLA.

Copy link
Contributor

@lintangsutawika lintangsutawika left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks real good. Thanks!

Copy link
Collaborator

@haileyschoelkopf haileyschoelkopf left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi there! One last thing: could you also add an entry to lm_eval/tasks/README.md describing the task in 1 sentence as is done for the other entries in that table (mentioning in this sentence that your dataset is not translated from MMLU and not machine-translated!), and note that the language is Turkish?

We want to make sure others can discover this task and your work more easily!

lm_eval/tasks/turkishmmlu/README.md Outdated Show resolved Hide resolved
lm_eval/tasks/turkishmmlu/README.md Outdated Show resolved Hide resolved
lm_eval/tasks/turkishmmlu/README.md Show resolved Hide resolved
Copy link
Author

@ArdaYueksel ArdaYueksel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated Readme addresses the raised issues

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants