Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

support Opencua click dataset
#1393 opened Jan 12, 2026 by kafkayu Loading…
3 of 4 tasks
[FSDP] Optimize context parallel
#1383 opened Jan 12, 2026 by Beichen-Ma Loading…
[FSDP] Add argument validation for FA3 with cp
#1367 opened Jan 9, 2026 by Beichen-Ma Loading…
fix: fix sglang regression
#1363 opened Jan 8, 2026 by nanjiangwill Loading…
Feat: multi-threads data fetching for sft data
#1355 opened Jan 7, 2026 by UbeCc Loading…
[FSDP][Fix] Fix redundant import
#1354 opened Jan 7, 2026 by Hecate0821 Loading…
[WIP] add fault torlance
#1311 opened Jan 3, 2026 by lilei199908 Loading…
[data][feat] add large dataset support
#1298 opened Dec 31, 2025 by SwordFaith Loading…
Handle deepscaler answers without markers
#1226 opened Dec 26, 2025 by cklxx Loading…
Add Qwen3-Coder-30B-A3B-Instruct model script
#1213 opened Dec 25, 2025 by maoquan-ms Loading…
Megatron VLM Support (Qwen2.5-VL series) (3/N)
#1210 opened Dec 25, 2025 by Zhuohao-Li Loading…
Fix ruff hook and update pre-commit hooks
#1206 opened Dec 24, 2025 by ParagEkbote Loading…
Integrate Sonic-Moe in FSDP
#1176 opened Dec 22, 2025 by ChangyiYang Draft
tau-bench: offline stub user + tool parsing fallback
#1158 opened Dec 19, 2025 by Fengzdadi Loading…
ProTip! Follow long discussions with comments:>50.