-
Notifications
You must be signed in to change notification settings - Fork 764
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(qwen3_next): use torch.get_default_dtype() — get_current_dtype do…
#1883
opened Apr 30, 2026 by
HeatherLiuzh
Loading…
fix ppo value offload bugs
run-ci-megatron
#1882
opened Apr 30, 2026 by
lilei199908
Collaborator
Loading…
fix: add fallback for --save-hf when Megatron-Bridge lacks model support
#1881
opened Apr 30, 2026 by
WangHong-yang
Contributor
Loading…
3 tasks done
[Fix] Fix distributed POST actor concurrency split
#1880
opened Apr 29, 2026 by
kaysonyu
Contributor
Loading…
feat(profile): safer torch.profiler defaults + per-grad-step capture
#1879
opened Apr 29, 2026 by
leofan-lab
Contributor
Loading…
Add Megatron-Bridge LoRA support for GRPO actor training
#1865
opened Apr 26, 2026 by
taivu1998
Loading…
fix: harden retool rollout against multi-turn / retry desync
#1861
opened Apr 24, 2026 by
leofan-lab
Contributor
Loading…
fix: guard DP-imbalance empty micro-batches under dynamic batching
#1860
opened Apr 24, 2026 by
leofan-lab
Contributor
Loading…
fix: rebind asyncio Semaphore and HTTP client on event-loop change
#1858
opened Apr 24, 2026 by
leofan-lab
Contributor
Loading…
fix: make no_sync_func install idempotent across train() calls
#1857
opened Apr 24, 2026 by
leofan-lab
Contributor
Loading…
feat(gemma4): add Gemma4 26B-A4B MoE and 31B dense support
#1855
opened Apr 24, 2026 by
leofan-lab
Contributor
Loading…
Fix double prepare_grads / loss-scaler-double-update in train_one_step
#1842
opened Apr 17, 2026 by
jthomy
Loading…
fix(sft): enable max-length filtering for messages datasets
#1841
opened Apr 17, 2026 by
none0663
Contributor
Loading…
3 tasks done
Fix missing activation checkpointing (recompute) parameters in bridge mode
#1833
opened Apr 14, 2026 by
XJL010622
Loading…
[build] Add A100 support: patch set, offline-friendly conda build, and examples
#1832
opened Apr 14, 2026 by
jason9693
Loading…
fix(gemma3): use GeGLU activation instead of SwiGLU
#1825
opened Apr 10, 2026 by
leofan-lab
Contributor
Loading…
fix: auto-fallback to flash_attn for Qwen3.5 on pre-Hopper GPUs (head_dim=256)
#1808
opened Apr 6, 2026 by
dadiaomengmeimei
Loading…
feat: delta compression for weight sync
run-ci-megatron
#1806
opened Apr 5, 2026 by
nanjiangwill
Collaborator
Loading…
3 of 5 tasks
Supporting FIPO (Future-KL Influenced Policy Optimization)
#1801
opened Apr 3, 2026 by
SeungyounShin
Loading…
feat: add checkpoint retention limit to automatically clean up old checkpoints
#1798
opened Apr 2, 2026 by
stevewx
Contributor
Loading…
4 tasks done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.