Skip to content

Conversation

@CuriousPanCake
Copy link
Contributor

Details:

The SDPAToPA transformation can be executed only on a stateful model (to turn it into the stateless one). Previously, this check existed in the GenAI code, but since we've done transfer of some functionality to OV runtime, we need to add this check here.

Tickets:

Signed-off-by: Andrii Staikov [email protected]

@CuriousPanCake CuriousPanCake requested a review from a team as a code owner November 24, 2025 10:22
@CuriousPanCake CuriousPanCake requested review from Wovchena and mryzhov and removed request for a team November 24, 2025 10:22
@github-actions github-actions bot added the category: Core OpenVINO Core (aka ngraph) label Nov 24, 2025
@Wovchena Wovchena requested a review from Copilot November 24, 2025 10:24
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a validation check to ensure that the SDPAToPagedAttention transformation is only executed on stateful models. The check verifies that the model contains variables before proceeding with the transformation, which previously existed in GenAI code but has been moved to the OpenVINO runtime.

Key Changes:

  • Added an assertion to validate that the model is stateful (contains variables) before performing the SDPAToPagedAttention transformation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@CuriousPanCake CuriousPanCake force-pushed the add_var_check_to_sdpa_to_pa branch from 693f903 to 0e6ce2d Compare November 25, 2025 13:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: Core OpenVINO Core (aka ngraph)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants