Skip to content

Pull requests: alibaba/rtp-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: quark format model support
#316 opened Nov 3, 2025 by zhaoan12-prc Loading…
[wip]Develop/embedding grpc server
#315 opened Nov 3, 2025 by wanglining97 Loading…
refactor: refactor post checkout
#314 opened Nov 3, 2025 by wht21 Loading…
docs - add cli reference
#313 opened Nov 3, 2025 by wht21 Loading…
feat: rmsnorm fuse quant and unitest
#312 opened Nov 3, 2025 by zhaoan12-prc Loading…
refactor: refactor cutlass groupgemm fp8
#307 opened Oct 31, 2025 by MMadhatter Loading…
feature: new mtp framework
#305 opened Oct 31, 2025 by Vinkle-hzt Draft
3 tasks
[WIP]fix: moe out of range
#303 opened Oct 31, 2025 by Bruce-Lee-LY Loading…
Feature/cuda129
#302 opened Oct 31, 2025 by zerozw Loading…
refactor: clean sampler, suppor cuda random_seed
#300 opened Oct 30, 2025 by LLLLKKKK Loading…
Features/token processor
#293 opened Oct 29, 2025 by siluzhou Loading…
feature - add reuse cache in py mla
#292 opened Oct 29, 2025 by Nancheng-11 Loading…
Feature/reuse_cache
#290 opened Oct 29, 2025 by zerozw Loading…
test: ci test only, don't review
#287 opened Oct 29, 2025 by JackTan25 Loading…
feature - adapter requirement for roll
#280 opened Oct 28, 2025 by jianglan89 Loading…
Feature/reorder kvcache
#276 opened Oct 27, 2025 by alibaba-miji Loading…
feat: some features and optimize for rocm pymodel
#268 opened Oct 23, 2025 by liaocz Loading…
feat: improve pymodel bert perf
#260 opened Oct 21, 2025 by JackTan25 Loading…
refactor - remove async_model
#258 opened Oct 21, 2025 by jianglan89 Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.