Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
282 changes: 0 additions & 282 deletions docs/evaluation/eval-kit.md

This file was deleted.

1 change: 0 additions & 1 deletion docs/evaluation/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,6 @@ We support many popular benchmarks and it's easy to add new in the future. The f
- [**Multilingual**](./multilingual.md): e.g. [mmlu-prox](./multilingual.md#mmlu-prox), [flores-200](./multilingual.md#flores-200), [wmt24pp](./multilingual.md#wmt24pp)
- [**Speech & Audio**](./speech-audio.md): e.g. [asr-leaderboard](./speech-audio.md#asr-leaderboard), [mmau-pro](./speech-audio.md#mmau-pro)
- [**Vision-Language Models (VLM)**](./vlm.md): e.g. [mmmu-pro](./vlm.md#mmmu-pro)
- [**VLMEvalKit Integration (eval_kit)**](./eval-kit.md): Run VLMEvalKit benchmarks via Megatron in-process or vLLM
- [**Speculative Decoding (SD)**](./speculative-decoding.md): e.g. [SPEED-Bench](./speculative-decoding.md#SPEED-Bench)

See [nemo_skills/dataset](https://github.com/NVIDIA-NeMo/Skills/blob/main/nemo_skills/dataset) where each folder is a benchmark we support.
Expand Down
45 changes: 0 additions & 45 deletions nemo_skills/dataset/eval_kit/__init__.py

This file was deleted.

7 changes: 0 additions & 7 deletions nemo_skills/dataset/utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -161,13 +161,6 @@ def _load_external_dataset(dataset_path):

def get_default_dataset_module(dataset):
data_path = "/nemo_run/code/nemo_skills/dataset"

# For dotted names like eval_kit.MMBench_DEV_EN, import the parent package.
# The sub-benchmark part is handled by the module's get_extra_generation_args().
if dataset.startswith("eval_kit."):
dataset_module = importlib.import_module("nemo_skills.dataset.eval_kit")
return dataset_module, data_path

dataset_module = importlib.import_module(f"nemo_skills.dataset.{dataset}")

return dataset_module, data_path
Expand Down
Loading