Vmendelev/2512 s2s eval by wprazuch · Pull Request #1246 · NVIDIA-NeMo/Skills

wprazuch · 2026-02-18T10:47:06Z

No description provided.

Signed-off-by: Valentin Mendelev <vmendelev@nvidia.com>

- Remove 727 audio files (712MB) from git tracking - Add data/ directory to .gitignore - Data will be prepared on cluster using prepare.py - Significantly reduces git packaging time

- Update to use evaluation/evaluate.py (not root evaluate.py) - Map subtest names to FDB task names (pause -> pause_handling, etc.) - Add note about ASR transcript requirement

- Add --audio_output_dir to save audio to mmkrtchyan's directory - Fixes Permission denied error when saving to vmendelev's directory

- Set TMPDIR to mmkrtchyan's directory to override hardcoded vmendelev path - Update config to use custom inference yaml

- unified_server.py already supports AUDIO_SAVE_DIR environment variable - Set to mmkrtchyan's directory instead of vmendelev's hardcoded path - This will fix Permission denied errors when saving audio Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

gwarmstrong · 2026-02-27T22:06:21Z

please do not commit cluster configs to this repo

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

… preparation Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

karpnv and others added 30 commits December 20, 2025 07:00

Added audio requests to vLLM models

99ef50a

Intorduced vLLM_multimodal model to save multimodal outputs

05dba7d

Signed-off-by: Valentin Mendelev <vmendelev@nvidia.com>

generation.py to respect separate server type for the client

8621313

Signed-off-by: Valentin Mendelev <vmendelev@nvidia.com>

Unified server to work with NeMo models not supported by vLLM

32daf07

Signed-off-by: Valentin Mendelev <vmendelev@nvidia.com>

s2s incremental backend and session based backend

d4c7ece

s2s_demo test set and evaluation script

9c66bf6

No special handling of data_dir in vllm constructor

da6c481

Fixed VLLM_multimodal

bb67e07

Metircs calculation

991f215

LLM judge

4de7342

Eval starter script, config, comparator, documentation

5635a21

Example cluster config

29a0b87

Voicebench

f984ed1

Added session related fields to incremental config

77ba6c2

Lock for session backend in unified server

90ce0a6

Race in Mamba triton kernels fix in session backend

ef5287c

Removed the lock from unified server

13b0b01

Session backend with unified debug output and fixed audio output

f93e262

Voicebench sd_qa_usa scoring fix

c081ff3

Voicebench related scripts

56964d4

Documentation and minor changes in s2s_demo test set scripts

ec99c5c

Support for the standard OpenAI Chat Completion API audio input

5630986

Generation detection parameter into serve_unified

c0b7f84

Fixed session mechanism to deprecate session_id

0a82387

A switch to get rid of debug info

5c6c727

Return only 1 turn text in multi turn requests

a32779b

Documentation on how to run the server only

b6512f2

Fixed documentation and return asr output with the debug_info

177a93c

Documentation updated

2c17ae3

Voicebench config to run external models

a01e794

melllinia added 13 commits February 6, 2026 18:12

Update cluster paths for fullduplexbench

4507b44

Remove audio data from git, keep only on cluster

bed02f6

- Remove 727 audio files (712MB) from git tracking - Add data/ directory to .gitignore - Data will be prepared on cluster using prepare.py - Significantly reduces git packaging time

Fix FDB scoring to use correct evaluation script path

7ce8c9f

- Update to use evaluation/evaluate.py (not root evaluate.py) - Map subtest names to FDB task names (pause -> pause_handling, etc.) - Add note about ASR transcript requirement

Fix audio output directory permission issue

ce0b41a

- Add --audio_output_dir to save audio to mmkrtchyan's directory - Fixes Permission denied error when saving to vmendelev's directory

Add TMPDIR env var to fix audio save permission issue

4f6a994

- Set TMPDIR to mmkrtchyan's directory to override hardcoded vmendelev path - Update config to use custom inference yaml

cheanup

daf1715

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

cheanup

37a3f97

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

Adding FDB v1.5 and restructuring dirs

58e25fd

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

splitting pause subtask

1975ae1

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

add stereo audio option

be614d6

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

add stereo audio option

0ecc58a

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

Trimming to per-sample input duration to remove batch-padding

a9cbefb

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

melllinia force-pushed the vmendelev/2512_s2s_eval branch from 392e767 to a9cbefb Compare February 18, 2026 16:50

vmendelev and others added 3 commits February 21, 2026 03:39

Script to parallelize execution with s2s_voicechat backend

023cdc6

adding missing scores to FDB v1.5

ba66eb3

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

adding system prompt for MCQ

a4b6435

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

melllinia force-pushed the vmendelev/2512_s2s_eval branch from 4f89364 to a4b6435 Compare February 26, 2026 12:21

adding incremental backend

2b151ed

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

gwarmstrong added the not ready label Feb 27, 2026

gwarmstrong reviewed Feb 27, 2026

View reviewed changes

Comment thread cluster_configs/s2s_eval_oci_iad.yaml

Copy link
Copy Markdown

Collaborator

gwarmstrong Feb 27, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please do not commit cluster configs to this repo

melllinia added 8 commits March 2, 2026 18:58

hf asr leaderboard and feb26 configs

ce0e4d0

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

adding asr text

b645376

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

adding corpus level wer calculation

cbb6777

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

adding hf normalization

83a65c4

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

trimming back the audio to match the input length

0002426

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

trimming back the audio to match the input length

b1fe941

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

fixing incremental backend postprocessing

ffce093

Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

add S/I/D breakdown to ASR evaluation metrics and fix earnings22 data…

eb78128

… preparation Signed-off-by: mmkrtchyan <mmkrtchyan@nvidia.com>

melllinia force-pushed the vmendelev/2512_s2s_eval branch from 8ce3638 to eb78128 Compare March 9, 2026 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vmendelev/2512 s2s eval#1246

Vmendelev/2512 s2s eval#1246
wprazuch wants to merge 79 commits intomainfrom
vmendelev/2512_s2s_eval

wprazuch commented Feb 18, 2026

Uh oh!

gwarmstrong Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

wprazuch commented Feb 18, 2026

Uh oh!

gwarmstrong Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants