First pass over evaluator support by marcromeyn · Pull Request #63 · NVIDIA-NeMo/Nemotron

marcromeyn · 2026-01-23T10:04:05Z

No description provided.

Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>

github-actions · 2026-01-23T10:04:28Z

Documentation Preview

📖 Preview URL: https://NVIDIA-NeMo.github.io/Nemotron/pr-63/

Built from commit 13d3916

athitten · 2026-01-23T19:46:52Z

docs/train/nano3/eval.md

@@ -0,0 +1,267 @@
+# Stage 3: Evaluation
+
+This stage evaluates trained models using [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher), running standard NLP benchmarks to measure model capabilities.


Suggested change

This stage evaluates trained models using [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher), running standard NLP benchmarks to measure model capabilities.

This stage evaluates trained models using [NeMo Evaluator](https://github.com/NVIDIA-NeMo/Evaluator/tree/main), running standard NLP benchmarks to measure model capabilities.

athitten · 2026-01-23T19:47:19Z

docs/train/nano3/eval.md

+| `truthfulqa` | Truthfulness evaluation |
+| `gsm8k` | Grade school math |
+
+See [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher) for the full list of available tasks.


Suggested change

See [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher) for the full list of available tasks.

See [NeMo Evaluator](https://github.com/NVIDIA-NeMo/Evaluator/tree/main) for the full list of available tasks.

athitten · 2026-01-23T19:47:55Z

docs/train/nano3/eval.md

+## Reference
+
+- [Evaluation Framework](../evaluator.md) — Full evaluator documentation
+- [NeMo Evaluator Documentation](https://github.com/NVIDIA/nemo-evaluator-launcher) — Launcher reference


Suggested change

- [NeMo Evaluator Documentation](https://github.com/NVIDIA/nemo-evaluator-launcher) — Launcher reference

- [NeMo Evaluator Documentation](https://github.com/NVIDIA-NeMo/Evaluator/tree/main) — Launcher reference

athitten · 2026-01-23T19:48:15Z

docs/train/evaluator.md

@@ -0,0 +1,393 @@
+# Evaluation Framework
+
+The Nemotron evaluation framework provides model evaluation capabilities using [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher), enabling benchmark testing of trained models on standard NLP tasks.


Suggested change

The Nemotron evaluation framework provides model evaluation capabilities using [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher), enabling benchmark testing of trained models on standard NLP tasks.

The Nemotron evaluation framework provides model evaluation capabilities using [NeMo Evaluator](https://github.com/NVIDIA-NeMo/Evaluator/tree/main), enabling benchmark testing of trained models on standard NLP tasks.

athitten · 2026-01-23T19:48:36Z

docs/train/evaluator.md

+
+- [Execution through NeMo-Run](./nemo-run.md) — Execution profiles and env.toml
+- [W&B Integration](./wandb.md) — Credentials and artifact tracking
+- [NeMo Evaluator Documentation](https://github.com/NVIDIA/nemo-evaluator-launcher) — Launcher reference


Suggested change

- [NeMo Evaluator Documentation](https://github.com/NVIDIA/nemo-evaluator-launcher) — Launcher reference

- [NeMo Evaluator Documentation](https://github.com/NVIDIA-NeMo/Evaluator/tree/main) — Launcher reference

First pass over evaluator support

0b40b6e

Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>

github-actions bot added a commit that referenced this pull request Jan 23, 2026

docs: preview for PR #63 13d3916

323d4ee

athitten reviewed Jan 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First pass over evaluator support#63

First pass over evaluator support#63
marcromeyn wants to merge 1 commit intodevfrom
romeyn/evaluator

marcromeyn commented Jan 23, 2026

Uh oh!

github-actions bot commented Jan 23, 2026

Uh oh!

athitten Jan 23, 2026

Uh oh!

athitten Jan 23, 2026

Uh oh!

athitten Jan 23, 2026

Uh oh!

athitten Jan 23, 2026

Uh oh!

athitten Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,267 @@
		# Stage 3: Evaluation

		This stage evaluates trained models using [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher), running standard NLP benchmarks to measure model capabilities.

	See [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher) for the full list of available tasks.
	See [NeMo Evaluator](https://github.com/NVIDIA-NeMo/Evaluator/tree/main) for the full list of available tasks.

	- [NeMo Evaluator Documentation](https://github.com/NVIDIA/nemo-evaluator-launcher) — Launcher reference
	- [NeMo Evaluator Documentation](https://github.com/NVIDIA-NeMo/Evaluator/tree/main) — Launcher reference

		@@ -0,0 +1,393 @@
		# Evaluation Framework

		The Nemotron evaluation framework provides model evaluation capabilities using [NeMo Evaluator](https://github.com/NVIDIA/nemo-evaluator-launcher), enabling benchmark testing of trained models on standard NLP tasks.

Conversation

marcromeyn commented Jan 23, 2026

Uh oh!

github-actions bot commented Jan 23, 2026

Documentation Preview

Uh oh!

athitten Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

athitten Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

athitten Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

athitten Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

athitten Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants