Update Qwen3 235B B300 Configs to match Qwen3 B200 Configs by rhmukundan · Pull Request #2669 · NVIDIA-NeMo/Megatron-Bridge

rhmukundan · 2026-03-05T21:50:14Z

Summary by CodeRabbit

Chores
- Updated configuration settings for model workload optimization.

Note: This release contains only internal configuration adjustments with no user-facing changes.

Signed-off-by: Raghav Hrishikeshan Mukundan <rmukundan@nvidia.com>

copy-pr-bot · 2026-03-05T21:50:17Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2026-03-05T21:55:57Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: ebcb689e-d287-4108-aa45-34de7ea40655

📥 Commits

Reviewing files that changed from the base of the PR and between b730ab6 and a759cdf.

📒 Files selected for processing (1)

scripts/performance/configs/qwen/qwen3_workload_base_configs.py

📝 Walkthrough

Walkthrough

This PR modifies QWEN3 workload configurations by disabling moe_a2a_overlap across two configuration variants and removing an explicit virtual_pipeline_model_parallel_size setting from the FP8_CS_V1 configuration.

Changes

Cohort / File(s)	Summary
QWEN3 Workload Configuration `scripts/performance/configs/qwen/qwen3_workload_base_configs.py`	Disabled moe_a2a_overlap (changed from True to False) in QWEN3_235B_A22B_PRETRAIN_CONFIG_B300_BF16_V1 and QWEN3_235B_A22B_PRETRAIN_CONFIG_B300_FP8_CS_V1 configurations. Removed virtual_pipeline_model_parallel_size=4 from FP8_CS_V1 variant.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Possibly related PRs

Update Qwen3 30B H100 Base Configs with HybridEP #2477: Modifies the same QWEN3 config file with related changes to moe_a2a_overlap and virtual pipeline parallel settings.
Update Qwen3 235B A22B MXFP8 GB200/300 recipe and resolve NaN grad norm #2209: Updates the same qwen3 config file with overlapping changes to virtual_pipeline_model_parallel_size and model-parallel behavior.
cp: Update Qwen3 235B A22B MXFP8 GB200/300 recipe and resolve NaN grad norm (2209) into r0.3.0 #2210: Modifies QWEN3 235B configurations with related adjustments to virtual_pipeline_model_parallel_size and model-parallel settings.

Suggested labels

performance

Suggested reviewers

malay-nagda
dingqingy-nv
ko3n1g

🚥 Pre-merge checks | ✅ 4

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: updating Qwen3 235B B300 configurations to match B200 configurations, which aligns with the modifications to moe_a2a_overlap and virtual_pipeline_model_parallel_size settings.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Test Results For Major Changes	✅ Passed	Configuration-only changes standardizing B300 to match B200 parameters; minor adjustments to parallelism settings without code logic modifications.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch rmukundan/match_qwen3_235_b300_b200_configs

Tip

Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs).
Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

yaoyu-33 · 2026-03-06T03:03:51Z

scripts/performance/configs/qwen/qwen3_workload_base_configs.py

    expert_model_parallel_size=8,
    global_batch_size=1024,
-    moe_a2a_overlap=True,
+    moe_a2a_overlap=False,


explain why in the pr desciption?

Signed-off-by: Raghav Hrishikeshan Mukundan <rmukundan@nvidia.com>

Update Qwen3 235B B300 Configs to match Qwen3 B200 Configs

a759cdf

Signed-off-by: Raghav Hrishikeshan Mukundan <rmukundan@nvidia.com>

rhmukundan requested review from dingqingy-nv and malay-nagda March 5, 2026 21:50

rhmukundan self-assigned this Mar 5, 2026

rhmukundan marked this pull request as ready for review March 5, 2026 21:50

dingqingy-nv added performance performance/release Performance items related with NeMo release r0.3.0 Cherry-pick label for r0.3.0 release branch labels Mar 5, 2026

yaoyu-33 reviewed Mar 6, 2026

View reviewed changes

yaoyu-33 previously approved these changes Mar 6, 2026

View reviewed changes

Set VP=None for bf16 B300

63cb484

Signed-off-by: Raghav Hrishikeshan Mukundan <rmukundan@nvidia.com>

rhmukundan dismissed yaoyu-33’s stale review via 63cb484 March 6, 2026 19:46

Merge branch 'main' into rmukundan/match_qwen3_235_b300_b200_configs

9cb657b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Qwen3 235B B300 Configs to match Qwen3 B200 Configs#2669

Update Qwen3 235B B300 Configs to match Qwen3 B200 Configs#2669
rhmukundan wants to merge 3 commits intomainfrom
rmukundan/match_qwen3_235_b300_b200_configs

rhmukundan commented Mar 5, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

copy-pr-bot bot commented Mar 5, 2026

Uh oh!

coderabbitai bot commented Mar 5, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Uh oh!

yaoyu-33 Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rhmukundan commented Mar 5, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

copy-pr-bot bot commented Mar 5, 2026

Uh oh!

coderabbitai bot commented Mar 5, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Uh oh!

yaoyu-33 Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rhmukundan commented Mar 5, 2026 •

edited by coderabbitai bot

Loading