-
Notifications
You must be signed in to change notification settings - Fork 91
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add activation, norm and rope ops for cuda device.
cuda
#485
opened Dec 4, 2025 by
XuZhang99
Loading…
feat: support iluvatar backend qwen3 0.6b run through
ilu
#481
opened Dec 4, 2025 by
laneeeee
Loading…
refactor: breakdown fused moe kernel for deepseek all2all setup.
mlu
#476
opened Dec 3, 2025 by
a120092009
Loading…
feat: add wrappers for ATB and ACLNN fused operators.
#474
opened Dec 2, 2025 by
yingxudeng
Loading…
refactor: separate mlu and cuda version Qwen model implementation.
cuda
#468
opened Dec 1, 2025 by
XuZhang99
Loading…
refactor: optimize unique token count preparation of batch input builder.
#449
opened Nov 27, 2025 by
RobbieLeung
Loading…
[WIP] feat: support loading model weights and forward overlap.
#441
opened Nov 26, 2025 by
Clement-Wang26
Loading…
feat: support Qwen2-VL & GME-Qwen2-VL model on npu device.
#399
opened Nov 18, 2025 by
xanecdotex
Loading…
feat: enable torch_npu graph mode for Qwen-3 dense with TP support.
#325
opened Nov 6, 2025 by
yingxudeng
Loading…
ProTip!
What’s not been updated in a month: updated:<2025-11-04.