-
Notifications
You must be signed in to change notification settings - Fork 122
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP]Add Func: npugraph_batch_size auto-adjust to different model
module:tests
#739
opened Apr 30, 2025 by
chris668899
Loading…
Add qwen2.5 vl multimodal feature for vllm-ascend v1
module:tests
#736
opened Apr 30, 2025 by
RookieChenTaoYu
Loading…
[Doc] Add release note for 0.7.3
documentation
Improvements or additions to documentation
#735
opened Apr 30, 2025 by
wangxiyuan
Loading…
support 32K model len on deepseek r1 W8A8
module:quantization
#728
opened Apr 29, 2025 by
flying632
Loading…
V1 parallel fix: bug fix to enable DP in V1
module:core
module:ops
#710
opened Apr 28, 2025 by
HanlinDu
Loading…
[WIP][Build][0.7.3] Integrate MindIE Turbo into vLLM Ascend
documentation
Improvements or additions to documentation
#708
opened Apr 28, 2025 by
MengqingCao
Loading…
[Feature] Impl the connector based on the llmdatadist for v1
module:core
#684
opened Apr 27, 2025 by
jianzs
Loading…
1 of 5 tasks
feat: performance optimization for deepseek
module:quantization
#683
opened Apr 27, 2025 by
zzzzwwjj
Loading…
update chunk prefill torch
module:ops
module:tests
#679
opened Apr 27, 2025 by
ttanzhiqiang
Loading…
[WIP] Add support for custom DeepSeek modelling in ACL Graph mode
module:core
module:ops
#677
opened Apr 27, 2025 by
yiz-liu
Loading…
[Feature] Enable disaggregated prefill functionality for v0
module:core
module:tests
#658
opened Apr 25, 2025 by
jianzs
Loading…
Adjust KV cache shape for compatibility with updated APIs for graph mode
ci/build
#657
opened Apr 25, 2025 by
linfeng-yuan
Loading…
[Misc] format patch to make the code clear
documentation
Improvements or additions to documentation
module:core
module:quantization
module:tests
#613
opened Apr 22, 2025 by
wangxiyuan
Loading…
[Bugfix] Fix the bug of
torch_npu
that raising segment fault when enable pin_memory
while creating a tensor
module:core
#597
opened Apr 21, 2025 by
shen-shanshan
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.