-
Notifications
You must be signed in to change notification settings - Fork 502
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: fix nvidia-modelopt version specifier in build_conda.sh
#1554
opened Feb 6, 2026 by
liujiahua123123
Loading…
fix: remove double-counting of LR scheduler steps on checkpoint resume
#1547
opened Feb 5, 2026 by
Surya-Gunukula
Loading…
[Draft] Add bridge mode support for distributed weight update
#1537
opened Feb 3, 2026 by
coding-famer
Loading…
fix: resolve OOM in long-sequence training via conditional entropy gradient tracking
run-ci-megatron
#1524
opened Jan 30, 2026 by
ppraneth
Loading…
fix: pp broadcast_object_list patch bug of megatron bridge
#1517
opened Jan 29, 2026 by
Yangruipis
Loading…
refactor: replace internal Ray API calls with public interfaces
#1508
opened Jan 28, 2026 by
HappyCpp
Loading…
[FIX] model provider compatibility with Megatron-LM
#1500
opened Jan 27, 2026 by
simondong1
Loading…
[Fix] Allow default TIS function usage when get_mismatch_metrics is set
#1483
opened Jan 23, 2026 by
eecspan
Loading…
[Fix] Fix some tiny bugs in fault tolerance
run-ci-megatron
#1480
opened Jan 23, 2026 by
yitianlian
Loading…
[Feature] Distribute Prefill on different node
run-ci-megatron
#1478
opened Jan 22, 2026 by
yitianlian
Loading…
fix(rollout): raise error when buffer is insufficient without global dataset
#1474
opened Jan 21, 2026 by
JackXu0
Loading…
perf(rollout): compress loss masks with run-length encoding
#1473
opened Jan 20, 2026 by
JackXu0
Loading…
fix: remove enforced CPU initialization assertion for AMD GPUs
#1471
opened Jan 20, 2026 by
Vivicai1005
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-02-04.