[TORCH] Added flex_attention hop function #4366

keshavvinayak01 · 2025-11-04T19:40:57Z

Description

Added support for PyTorch's flex_attention Higher-Order Operator in torch-mlir.
Implemented Torch_AtenFlexAttentionOp with 6 operands (query, key, value, scale, enable_gqa, return_lse) and 2 optional attributes (score_mod_fn, mask_mod_fn) for function references.
The FX importer (_import_hop_flex_attention) correctly extracts score/mask modification functions from get_attr nodes using module IDs, following the while_loop HOP pattern.
Includes TODO markers for kernel_options performance tuning parameters.
Imports flex_attention from PyTorch FX graphs into valid MLIR.

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Change 1: Converts builtin tensors → Torch tensors when entering the loop body Change 2: Ensures Torch tensors → builtin tensors when yielding back to the loop condition Without these fixes, the conversion would fail when while loops carry tensor values Also modified basic_test.py FILECHECK statements. Signed-off-by: Keshav Vinayak Jha <[email protected]>

Signed-off-by: Keshav Vinayak Jha <[email protected]>

1. Better documentation for AtenFlexAttentionOp 2. Function referece added as attributes to aten.flex_attention 3. Updates to _import_hop_flex_attention reflecting latest changes of module import. 4. Removed discardable attributes; scored_mod_fn and mask_mod_fn added as optionalAttr Signed-off-by: Keshav Vinayak Jha <[email protected]>

Remove note about method usage for HOPs.

Removed TODO note for grouped query attention support in the docstring and comments.

Signed-off-by: Keshav Vinayak Jha <[email protected]>

keshavvinayak01 and others added 17 commits October 22, 2025 09:41

Modified fx_importer to support hop_while_loop

c8c711c

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Addressed Comments | Simplified unique child_func_name creation

b250583

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Addressed comments

db1e7e9

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Formatting

d9646c6

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Added children module imports to import_frozen_program flow

cc03291

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Formatting and reordered CHECKs

6a70e1c

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Added Control flow test

e1ff87d

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Cannot FX trace HOP

558c7db

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Added flex_attention hop function

39d5b24

Formatting

dfdca75

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Fixed merge newline removals

6178d07

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Added AtenFluxAttentionOp

52f1fbc

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Added changes for correct functional references

a56433a

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Merge branch 'main' into keshavvinayak01/torch-aten-flex_attention

c34efab

Update fx_importer.py to remove deprecated note

4470978

Remove note about method usage for HOPs.

keshavvinayak01 changed the title ~~Keshavvinayak01/torch aten flex attention~~ [TORCH] Added flex_attention hop function Nov 4, 2025

keshavvinayak01 added 2 commits November 5, 2025 01:18

Clarify enable_gqa support in fx_importer.py

719fe5a

Removed TODO note for grouped query attention support in the docstring and comments.

Fix formatting in GeneratedTorchOps.td

5e024f6

keshavvinayak01 force-pushed the keshavvinayak01/torch-aten-flex_attention branch from 095cb61 to 5e024f6 Compare November 6, 2025 09:36

keshavvinayak01 marked this pull request as ready for review November 6, 2025 09:37

keshavvinayak01 requested a review from zjgarvey November 6, 2025 09:37

keshavvinayak01 added 5 commits November 6, 2025 05:10

return_lse is part of the kernel options

c78d699

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Moved op definition to TorchOps.td

da23ec9

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Formatting TorchOps

af59413

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Added lit-test; Docs for FlexAttention

0103163

Formatting

48f12bc

Signed-off-by: Keshav Vinayak Jha <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TORCH] Added flex_attention hop function #4366

[TORCH] Added flex_attention hop function #4366

Uh oh!

keshavvinayak01 commented Nov 4, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[TORCH] Added flex_attention hop function #4366

Are you sure you want to change the base?

[TORCH] Added flex_attention hop function #4366

Uh oh!

Conversation

keshavvinayak01 commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

keshavvinayak01 commented Nov 4, 2025 •

edited

Loading