-
Notifications
You must be signed in to change notification settings - Fork 81
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Optimize attention blocking nested loops
#957
opened Apr 30, 2026 by
anujgupt-github
Contributor
Loading…
Layer wise changes for kimi model
#954
opened Apr 29, 2026 by
abhishek-singh591
Contributor
Loading…
[Nightly CI]: Creating separate Pipeline for Nightly Jobs
#953
opened Apr 29, 2026 by
abukhoy
Contributor
Loading…
fix: improve weight offloading to handle plain tensor attrs and use to_empty()
#952
opened Apr 28, 2026 by
quic-rishinr
Contributor
Loading…
Add SplitTensorsTransform to QEFFAutoModel to prevent >2GB protobuf export issue
#950
opened Apr 28, 2026 by
quic-rishinr
Contributor
Loading…
First Block Caching Infra for diffusers
Diffusers
Use for PR related to diffusers in efficient-transformers.
#941
opened Apr 24, 2026 by
quic-amitraj
Contributor
Loading…
feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill
enhancement
New feature or request
#935
opened Apr 21, 2026 by
vbaddi
Contributor
Loading…
updated blocking in diffusers with cross attention check instead of SL
#932
opened Apr 21, 2026 by
tv-karthikeya
Contributor
Loading…
CB Bug fix for Qwen3VL Dense and basic cleaning of example script and Model File
#926
opened Apr 20, 2026 by
qcdipankar
Contributor
Loading…
Enabling support of rerankers models 2B and 8B of qwen3vl
#921
opened Apr 18, 2026 by
quic-amitraj
Contributor
Loading…
Removed redundancies from QEFFHybridCache and QEFFHybridChunkedCache
#914
opened Apr 13, 2026 by
quic-mamta
Contributor
•
Draft
revert(export): Revert proxy-only ONNX transform gating and restore default export behavior
1.21.0
#912
opened Apr 10, 2026 by
vbaddi
Contributor
Loading…
feat: Enable benchmark-mode module inventory/export across all CausalLM architectures
enhancement
New feature or request
#906
opened Apr 3, 2026 by
vbaddi
Contributor
Loading…
Merge ft_experimental_v1 branch to main
fine-tuning
ready for review
#887
opened Mar 25, 2026 by
quic-akuruvil
Contributor
Loading…
Undo deepstack_features based changes for Qwen3VL and Qwen3VL_MOE models
#869
opened Mar 18, 2026 by
quic-dhirajku
Contributor
•
Draft
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-03-30.