[INTEL HPU] add fused block atten #1706

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

xiaoguoguo626807 merged 1 commit into PaddlePaddle:develop from yanfeich:fused_block_attn

May 26, 2025

Contributor

yanfeich commented May 23, 2025

Optimize HPU fused block attention.

fuse residual add into RMS+Rope
fuse residual add into RMS+MLP
fuse Rope+SDPA as fused_block_attention
optimize prepare_block_metadata
Add related UT


          add fused block atten

876ec27

paddle-bot bot commented May 23, 2025

Thanks for your contribution!

paddle-bot bot added the contributor label

Contributor Author

yanfeich commented May 23, 2025

add @LeoZhao-Intel @JianyuLi01 @zongwave @fmiao2372 @feiwan1 to review.
add @xiaoguoguo626807 to review.

xiaoguoguo626807 approved these changes

View reviewed changes

xiaoguoguo626807 merged commit d8e6b25 into PaddlePaddle:develop

10 of 11 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels