Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

chore: nightly sync main into dev (18_06_2026) Run functional tests Run MBridge tests Attach this for testing this PR against MBridge main
#5402 opened Jun 18, 2026 by svcnvidia-nemo-ci Draft
Update goldens for weekly tests after pytorch and TE bumps. complexity: high
#5399 opened Jun 18, 2026 by balasaajay Contributor Loading…
1 of 6 tasks
Fix fast-cache-load rank synchronization guard community-request waiting-on-customer Waiting on the original author to respond
#5398 opened Jun 18, 2026 by sandyhouse Loading…
1 task
[main] moe(perf): Refactor GDN A2A helper flow complexity: medium
#5392 opened Jun 17, 2026 by yuzhongw-nvidia Contributor Loading…
1 of 6 tasks
Add experimental decoupled compact LayerWise DDP layout for Muon (main)
#5391 opened Jun 17, 2026 by Wohox Contributor Draft
3 of 6 tasks
Test stacked PRs
#5390 opened Jun 17, 2026 by wujingyue Contributor Draft
[dev] Add experimental decoupled compact LayerWise DDP layout for Muon complexity: medium
#5388 opened Jun 17, 2026 by Wohox Contributor Loading…
3 of 6 tasks
Add experimental Megatron-FSDP fully_shard implementation complexity: medium Final Review PR is in the "final review" stage MFSDPv2 Run tests
#5387 opened Jun 17, 2026 by wujingyue Contributor Loading…
Fix fused MLA down projection with tensor parallelism complexity: low Final Review PR is in the "final review" stage
#5383 opened Jun 16, 2026 by sraman-rgb Contributor Loading…
6 tasks
Add generic interface for SSM inference
#5382 opened Jun 16, 2026 by santhnm2 Contributor Draft
6 tasks
ProTip! What’s not been updated in a month: updated:<2026-05-18.