Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Optimize attention blocking nested loops
#957 opened Apr 30, 2026 by anujgupt-github Contributor Loading…
Layer wise changes for kimi model
#954 opened Apr 29, 2026 by abhishek-singh591 Contributor Loading…
[Nightly CI]: Creating separate Pipeline for Nightly Jobs
#953 opened Apr 29, 2026 by abukhoy Contributor Loading…
MLA Int4 changes
#949 opened Apr 28, 2026 by quic-mamta Contributor Loading…
Gemma4 alignment
#948 opened Apr 28, 2026 by tchawada Contributor Draft
First Block Caching Infra for diffusers Diffusers Use for PR related to diffusers in efficient-transformers.
#941 opened Apr 24, 2026 by quic-amitraj Contributor Loading…
feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill enhancement New feature or request
#935 opened Apr 21, 2026 by vbaddi Contributor Loading…
Added MDP generation to QEff Compile
#930 opened Apr 21, 2026 by quic-mohmeh Loading…
Enabled Qwen3-VL embedding model
#923 opened Apr 20, 2026 by quic-amitraj Contributor Loading…
[Qwen3_Omni]_Onboarding
#922 opened Apr 20, 2026 by mohiso22 Contributor Draft
Enabling support of rerankers models 2B and 8B of qwen3vl
#921 opened Apr 18, 2026 by quic-amitraj Contributor Loading…
feat: Enable benchmark-mode module inventory/export across all CausalLM architectures enhancement New feature or request
#906 opened Apr 3, 2026 by vbaddi Contributor Loading…
qwen3_5_linear_attn
#901 opened Apr 1, 2026 by mohiso22 Contributor Draft
ProTip! What’s not been updated in a month: updated:<2026-03-30.