Skip to content

Parity with VLLM over the reasoning field#3873

Merged
tdene merged 4 commits intoNVIDIA:mainfrom
tdene:tde/reasoning_parity_vllm
Mar 18, 2026
Merged

Parity with VLLM over the reasoning field#3873
tdene merged 4 commits intoNVIDIA:mainfrom
tdene:tde/reasoning_parity_vllm

Conversation

@tdene
Copy link
Contributor

@tdene tdene commented Mar 15, 2026

What does this PR do ?

⚠️ For major changes (either in lines of code or in its impact), please make sure to first share a design doc with the team. If you're unsure what's the best way to do so, contact the @mcore-oncall.

Contribution process

Pre-checks

  • I have added relevant unit tests
  • I have added relevant functional tests
  • I have added proper typing to my code Typing guidelines
  • I have added relevant documentation
  • I have run the autoformatter.sh on my PR

Code review

Feel free to message or comment the @mcore-oncall to help accelerate your merge into main. The less complex your PR is, the faster it will be approved and merged!

All PRs start as draft. If you open a non-draft PR, it will be automatically converted to draft.

Step 1: Mark PR as "Ready for Review"

  1. When your PR is ready, click Ready for Review.
  2. An oncall reviewer is auto-assigned and expert reviewers are notified based on your changes.
    • Some PRs may jump straight to step 2. This is determined by .github/CODEOWNERS.

⚠️ Only mark as ready once merge-conflicts are resolved and the CI is passing.
Final Review might get declined if these requirements are not fulfilled.

Step 2: Final Review

For PRs that change megatron/core, once all expert reviewers have approved, the Final Review label is applied automatically and final reviewers are assigned.

For PRs outside megatron/core, this step is skipped.

Step 3: Approved

Once all required reviewers have approved, the Approved label is applied automatically.

Merge

Any member of mcore-engineers will be able to merge your PR.

For MRs into `dev` branch The proposed review process for `dev` branch is under active discussion.

MRs are mergable after one approval by either eharper@nvidia.com or zijiey@nvidia.com.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Mar 15, 2026

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@tdene tdene marked this pull request as ready for review March 15, 2026 17:31
@tdene tdene requested review from a team as code owners March 15, 2026 17:31
@svcnvidia-nemo-ci svcnvidia-nemo-ci requested a review from a team March 15, 2026 17:31
@svcnvidia-nemo-ci svcnvidia-nemo-ci added this to the Core 0.16 milestone Mar 15, 2026
@tdene
Copy link
Contributor Author

tdene commented Mar 16, 2026

/claude review

@claude
Copy link
Contributor

claude bot commented Mar 16, 2026

No critical issues found. One observation:

  • There are no unit tests for _reconstruct_reasoning_content (or any of the other helpers in chat_completions.py). The function is simple, but a couple of test cases (e.g. message with reasoning_content + existing content, message without reasoning_content, non-assistant message with reasoning_content) would help prevent regressions as this logic evolves.

LGTM otherwise.

@tdene
Copy link
Contributor Author

tdene commented Mar 17, 2026

No critical issues found. One observation:

  • There are no unit tests for _reconstruct_reasoning_content (or any of the other helpers in chat_completions.py). The function is simple, but a couple of test cases (e.g. message with reasoning_content + existing content, message without reasoning_content, non-assistant message with reasoning_content) would help prevent regressions as this logic evolves.

LGTM otherwise.

Understood. While unit tests are very important, there is - in general - a lack of unit tests for this particular section of the code. Such unit tests will be added for the entire text_generation_server in a follow-up PR, by no later than March 24th.

Copy link
Contributor

@santhnm2 santhnm2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM but please take a look @i-riyad

@svcnvidia-nemo-ci svcnvidia-nemo-ci added the Final Review PR is in the "final review" stage label Mar 17, 2026
@svcnvidia-nemo-ci svcnvidia-nemo-ci added Approved All necessary approvals have been made and removed Final Review PR is in the "final review" stage labels Mar 17, 2026
@tdene tdene enabled auto-merge March 17, 2026 17:20
@tdene tdene added this pull request to the merge queue Mar 17, 2026
@svcnvidia-nemo-ci
Copy link

🔄 Merge queue validation started!

You can track the progress here: https://github.com/NVIDIA/Megatron-LM/actions/runs/23212926764

@svcnvidia-nemo-ci
Copy link

🔄 Merge queue validation started!

You can track the progress here: https://github.com/NVIDIA/Megatron-LM/actions/runs/23214453991

@svcnvidia-nemo-ci
Copy link

🔄 Merge queue validation started!

You can track the progress here: https://github.com/NVIDIA/Megatron-LM/actions/runs/23215938152

@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 17, 2026
@tdene tdene added this pull request to the merge queue Mar 18, 2026
@svcnvidia-nemo-ci
Copy link

🔄 Merge queue validation started!

You can track the progress here: https://github.com/NVIDIA/Megatron-LM/actions/runs/23238234634

Merged via the queue into NVIDIA:main with commit 77c2095 Mar 18, 2026
77 of 81 checks passed
@tdene tdene deleted the tde/reasoning_parity_vllm branch March 18, 2026 10:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Approved All necessary approvals have been made complexity: low

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants