Skip to content

feat: context-parallel with nano-v3#1441

Draft
adil-a wants to merge 2 commits intomainfrom
adil/nano-v3-cp
Draft

feat: context-parallel with nano-v3#1441
adil-a wants to merge 2 commits intomainfrom
adil/nano-v3-cp

Conversation

@adil-a
Copy link
Collaborator

@adil-a adil-a commented Mar 3, 2026

Adds CP for Mamba layers (hidden-shard) and Attention layers (sequence-shard).

Convergence runs to follow after #1416 is merged.

adil-a added 2 commits March 3, 2026 15:26
Signed-off-by: adil-a <adil.asif2000@hotmail.com>
Signed-off-by: adil-a <adil.asif2000@hotmail.com>
@adil-a adil-a self-assigned this Mar 3, 2026
@copy-pr-bot
Copy link

copy-pr-bot bot commented Mar 3, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@adil-a adil-a linked an issue Mar 3, 2026 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Feature Request] Context Parallelism Support for NemotronV3

1 participant