[Triton] Shaoclee/355 wip mha rope kv cache #723

k50112113 · 2025-09-29T16:53:51Z

Previously, VLLM_ROCM_USE_AITER_TRITON_FUSED_ROPE_ZEROS_KV_CACHE will be disabled if VLLM_ROCM_USE_AITER_MHA is turned on.

This PR enables VLLM_ROCM_USE_AITER_MHA and VLLM_ROCM_USE_AITER_TRITON_FUSED_ROPE_ZEROS_KV_CACHE both to be turned on

This change would affect Llama and GPT-OSS

…VLLM_ROCM_USE_AITER_MHA

dllehr-amd

Approved!

k50112113 added 2 commits September 29, 2025 16:44

add VLLM_ROCM_USE_AITER_TRITON_FUSED_ROPE_ZEROS_KV_CACHE support for …

a6e25d5

…VLLM_ROCM_USE_AITER_MHA

fix fp8 kv_cache bug

3a130ec

k50112113 requested a review from gshtras as a code owner September 29, 2025 16:53

k50112113 requested a review from dllehr-amd September 29, 2025 16:54

dllehr-amd approved these changes Oct 22, 2025

View reviewed changes

dllehr-amd merged commit 2b4cb8a into 355_wip Oct 22, 2025
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Triton] Shaoclee/355 wip mha rope kv cache #723

[Triton] Shaoclee/355 wip mha rope kv cache #723

Uh oh!

k50112113 commented Sep 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

dllehr-amd left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Triton] Shaoclee/355 wip mha rope kv cache #723

[Triton] Shaoclee/355 wip mha rope kv cache #723

Uh oh!

Conversation

k50112113 commented Sep 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dllehr-amd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

k50112113 commented Sep 29, 2025 •

edited by github-actions bot

Loading