[FlexAttention] fixing learnable bias assertion error in inductor #161170

liangel-02 · 2025-08-21T16:02:15Z

Users encountered unexpected behaviour when using FlexAttention with learnable biases, including assertion errors (#157677)

We traced the root cause to the registration of subgraph buffers—this caused inconsistencies in the naming and ultimately incorrect retrieval later on. This problem only arose if the model was compiled as a whole (ie using @torch.compile) since only then would there be naming conflicts.

In this PR, we register the buffers with the base graph to solve this issue.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @Chillee @drisspg @yanboliang @BoyuanFeng

pytorch-bot · 2025-08-21T16:02:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161170

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 11ec4fd with merge base e20f6d7 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

liangel-02 · 2025-08-23T03:18:17Z

@pytorchbot merge

pytorchmergebot · 2025-08-23T03:20:07Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…torch#161170) Users encountered unexpected behaviour when using FlexAttention with learnable biases, including assertion errors (pytorch#157677) We traced the root cause to the registration of subgraph buffers—this caused inconsistencies in the naming and ultimately incorrect retrieval later on. This problem only arose if the model was compiled as a whole (ie using @torch.compile) since only then would there be naming conflicts. In this PR, we register the buffers with the base graph to solve this issue. Pull Request resolved: pytorch#161170 Approved by: https://github.com/drisspg

pytorch-bot bot added ciflow/inductor module: inductor labels Aug 21, 2025

liangel-02 changed the title ~~[FlexAttention] fixing learnable biases in inductor~~ [FlexAttention] fixing learnable bias assertion error in inductor Aug 21, 2025

drisspg added topic: not user facing topic category module: flex attention labels Aug 21, 2025

drisspg approved these changes Aug 21, 2025

View reviewed changes

drisspg marked this pull request as ready for review August 22, 2025 01:22

liangel-02 added 3 commits August 22, 2025 10:49

fixing strides mismatch

c280d15

assert grad flow for bias

9306041

make sizes smaller for test

11ec4fd

liangel-02 force-pushed the learnablebias branch from 36fb03a to 11ec4fd Compare August 22, 2025 17:50

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 23, 2025

pytorchmergebot added the merging label Aug 23, 2025

pytorchmergebot added the Merged label Aug 23, 2025

pytorchmergebot closed this in 3a4140b Aug 23, 2025

pytorchmergebot removed the merging label Aug 23, 2025

github-actions bot deleted the learnablebias branch September 23, 2025 02:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FlexAttention] fixing learnable bias assertion error in inductor #161170

[FlexAttention] fixing learnable bias assertion error in inductor #161170

liangel-02 commented Aug 21, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Aug 21, 2025 •

edited

Loading

Uh oh!

liangel-02 commented Aug 23, 2025

Uh oh!

pytorchmergebot commented Aug 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[FlexAttention] fixing learnable bias assertion error in inductor #161170

[FlexAttention] fixing learnable bias assertion error in inductor #161170

Conversation

liangel-02 commented Aug 21, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161170

✅ No Failures

Uh oh!

liangel-02 commented Aug 23, 2025

Uh oh!

pytorchmergebot commented Aug 23, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

liangel-02 commented Aug 21, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 21, 2025 •

edited

Loading