Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[BugFix] Illegal memory access for MoE On H20
#13693 opened Feb 22, 2025 by Abatom Loading…
[V1][Minor] Use FakeAttentionMetadata for dummy run ready ONLY add when PR is ready to merge/full CI is needed v1
#13689 opened Feb 22, 2025 by WoosukKwon Loading…
[Bugfix][Model] OLMo 2: split qkv correctly for GQA and MQA ready ONLY add when PR is ready to merge/full CI is needed
#13687 opened Feb 21, 2025 by 2015aroras Loading…
[Bugfix][API Server] Fix invalid usage of 'ge' and 'le' in port valid… frontend ready ONLY add when PR is ready to merge/full CI is needed
#13672 opened Feb 21, 2025 by WangErXiao Loading…
[Misc] Capture and log the time of loading weights ready ONLY add when PR is ready to merge/full CI is needed v1
#13666 opened Feb 21, 2025 by waltforme Loading…
Correction to TP logic for Mamba Mixer 2 when Num Groups not divisible by TP Size ready ONLY add when PR is ready to merge/full CI is needed
#13660 opened Feb 21, 2025 by fabianlim Loading…
[ROCM] fix native attention function call ready ONLY add when PR is ready to merge/full CI is needed
#13650 opened Feb 21, 2025 by gongdao123 Loading…
docs: Add a note on full CI run in contributing guide documentation Improvements or additions to documentation
#13646 opened Feb 21, 2025 by terrytangyuan Loading…
[v1] torchrun compatibility ci/build v1
#13642 opened Feb 21, 2025 by youkaichao Loading…
Fix some issues with benchmark data output ci/build
#13641 opened Feb 21, 2025 by huydhn Loading…
[V1][PP] Continue scheduling prefill chunks v1
#13637 opened Feb 21, 2025 by comaniac Loading…
use whl path to install torch ci/build
#13627 opened Feb 20, 2025 by Chenyaaang Loading…
[Kernel] Optimize moe intermediate_cache usage
#13625 opened Feb 20, 2025 by mgoin Loading…
[Misc] Bump compressed-tensors ci/build ready ONLY add when PR is ready to merge/full CI is needed
#13619 opened Feb 20, 2025 by dsikka Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.