[SYCLTLA] Fix FlashAttention FWD performance on PVC #2415

LuFinch · 2025-11-27T06:29:05Z

I missed a else when launch kernel so that it launchs kernel twice on PVC..

Copilot

Pull request overview

Fixes a performance regression on PVC by correcting a missing else statement that caused a kernel to be launched twice. The PR also adds missing CUTLASS_DEVICE annotations to device functions.

Fixed conditional branching for kernel launch based on subgroup size
Added CUTLASS_DEVICE annotations to device-only functions

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
src/ATen/native/transformers/xpu/flash_attn/sycltla/mha_fwd.cpp	Fixed missing `else` statement that was causing duplicate kernel launches
src/ATen/native/transformers/xpu/flash_attn/sycltla/kernel/xe_sdpa_fwd_bshd.h	Added `CUTLASS_DEVICE` annotations to device functions

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

LuFinch · 2025-11-28T02:20:57Z

Can we merge this PR?

EikanWang · 2025-11-28T05:54:11Z

Sure.

fix perf

4477262

Copilot AI review requested due to automatic review settings November 27, 2025 06:29

Copilot AI reviewed Nov 27, 2025

View reviewed changes

LuFinch changed the title ~~[SYCLTLA] Fix performance on PVC~~ [SYCLTLA] Fix FlashAttention FWD performance on PVC Nov 27, 2025

LuFinch requested a review from EikanWang November 27, 2025 06:29

EikanWang approved these changes Nov 28, 2025

View reviewed changes

remove intel_gpu_bmg_g31 due to stock pytorch CI is 2025.1

4c8e6b0

EikanWang approved these changes Nov 28, 2025

View reviewed changes

refine

57c6114

EikanWang approved these changes Nov 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCLTLA] Fix FlashAttention FWD performance on PVC #2415

[SYCLTLA] Fix FlashAttention FWD performance on PVC #2415

LuFinch commented Nov 27, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

LuFinch commented Nov 28, 2025

Uh oh!

EikanWang commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SYCLTLA] Fix FlashAttention FWD performance on PVC #2415

Are you sure you want to change the base?

[SYCLTLA] Fix FlashAttention FWD performance on PVC #2415

Conversation

LuFinch commented Nov 27, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

LuFinch commented Nov 28, 2025

Uh oh!

EikanWang commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants