-
Notifications
You must be signed in to change notification settings - Fork 59
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update Qeff Documentation to indicate vLLM Support in Validated Models Page
#588
opened Oct 14, 2025 by
quic-vargupt
Loading…
[Upgradation]: onnx opset version updated from 13 to 17
#587
opened Oct 14, 2025 by
abukhoy
Loading…
Enable CB for vlms with multiple images and multiple prompts
#583
opened Oct 6, 2025 by
quic-mamta
•
Draft
Example walk through on how to onboard a Causal LM on Qefficient Transformers.
#574
opened Sep 24, 2025 by
quic-dhirajku
Loading…
Fix llama model o_proj lora_ids passing for finite lorax in release/v1.20.0
#573
opened Sep 23, 2025 by
quic-jouachen
Loading…
[Qwen2_5_vl] - Onboarding Qwen2_5_vl model in QEfficient
#560
opened Sep 12, 2025 by
mohiso22
Loading…
Logger Module For Efficient Transformers
1.21.0
wip
Work in progress
#555
opened Sep 10, 2025 by
quic-hemagnih
•
Draft
Extend On-Device Sampling Support to more Causal Language Models
#553
opened Sep 4, 2025 by
quic-sanising
Loading…
[QEff]: Add OpenAI Oss Models (gpt_oss)
enhancement
New feature or request
#534
opened Aug 6, 2025 by
vbaddi
Loading…
Add Support for Frequency Penalties in On Device Sampling
#523
opened Jul 24, 2025 by
quic-sanising
•
Draft
Reading mxfp6_matmul for QNN Compilation path from compile API arguments
1.21.0
#499
opened Jul 7, 2025 by
shubhagr-qc
•
Draft
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.