You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enable the QNN Adreno GPU backend as a new execution provider in ModelKit, collecting op coverage data and implementing the device preset.
Context
QNN supports both the Hexagon HTP (NPU) and Adreno GPU backends. The Adreno backend targets mobile/edge GPU inference and extends ModelKit's Qualcomm coverage beyond NPU-only. This is a new EP target for the May 1 delivery.
From plans/release/0501_release_plan/P0_CHECKLIST.md (P1-EP-004).
Summary
Enable the QNN Adreno GPU backend as a new execution provider in ModelKit, collecting op coverage data and implementing the device preset.
Context
QNN supports both the Hexagon HTP (NPU) and Adreno GPU backends. The Adreno backend targets mobile/edge GPU inference and extends ModelKit's Qualcomm coverage beyond NPU-only. This is a new EP target for the May 1 delivery.
From
plans/release/0501_release_plan/P0_CHECKLIST.md(P1-EP-004).Current State
Desired State
wmk build --device adrenoAcceptance Criteria
--device adreno)Technical Notes
backend_pathto Adreno GPU library (QnnHtp.dll vs QnnAdreno.dll)Related Files
plans/release/0501_release_plan/ep-scale.md— QNN Adreno section (P0.4)plans/release/0501_release_plan/P0_CHECKLIST.md— P1-EP-004