Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add MSE vs MinMax observer comparison tests
#2110 opened Dec 11, 2025 by GOavi101 Loading…
[Bug fix] fix Qwen3VLMoe
#2104 opened Dec 9, 2025 by Wangzheee Loading…
add kv quant example autoround For any PR / issue related to autoround support
#2100 opened Dec 5, 2025 by mengniwang95 Loading…
[test] add e2e test for qwen3 moe w4a16 ready When a PR is ready for review
#2071 opened Nov 25, 2025 by HDCharles Draft
[Performance] Batched calibration ready When a PR is ready for review
#2054 opened Nov 20, 2025 by kylesayrs Loading…
[Misc] Remove is_moe_model ready When a PR is ready for review
#2053 opened Nov 20, 2025 by kylesayrs Loading…
Testing Clean-up
#2045 opened Nov 18, 2025 by dsikka Draft
Support wInt4aFp8 for moe
#2027 opened Nov 12, 2025 by Wangzheee Loading…
[TypeHint] Fix format_calibration_data type hint
#2012 opened Nov 10, 2025 by kylesayrs Loading…
Implement propagate_error argument ready When a PR is ready for review
#2008 opened Nov 10, 2025 by kylesayrs Loading…
Granite4 FP8 Block Quantization
#2001 opened Nov 6, 2025 by krishnateja95 Loading…
[Kimi Linear] FP8 Example
#1986 opened Oct 31, 2025 by dsikka Draft
[AWQ] Generalize AWQ quantization ready When a PR is ready for review
#1961 opened Oct 22, 2025 by kylesayrs Loading…
4 tasks done
[Attention] Support FP4 attention quantization nvfp4 For any PR / issue related to NVFP4 support
#1924 opened Oct 14, 2025 by kylesayrs Loading…
add gpt oss nvfp4 example
#1885 opened Sep 30, 2025 by shanjiaz Draft
ProTip! no:milestone will show everything without a milestone.