[ROCm] Prevent accidental enablement of efficient attention. #134531

pytorchbot · 2024-08-27T00:08:41Z

Currently Efficient attention and Flash attention share the same set of GPU
kernels on ROCM and have common limitations on head sizes.

Fixes #132004

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang

Currently Efficient attention and Flash attention share the same set of GPU kernels on ROCM and have common limitations on head sizes. Fixes #132004 Pull Request resolved: #133331 Approved by: https://github.com/malfet, https://github.com/jithunnair-amd (cherry picked from commit 46ecc67)

pytorch-bot · 2024-08-27T00:08:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134531

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Unrelated Failure

As of commit ae4d2fd with merge base b66e3f0 ():

NEW FAILURES - The following jobs have failed:

pull / linux-focal-py3_8-clang9-xla / test (xla, 1, 1, am2.linux.12xlarge) (gh)
test_repeat_truncated
pull / linux-focal-py3.12-clang10 / test (default, 2, 3, am2.linux.2xlarge) (gh)
dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_outside_linear_module_free_dynamic_shapes

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-jammy-py3.8-gcc11 / test (distributed, 1, 2, am2.linux.2xlarge) (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pruthvistony · 2024-08-27T00:11:43Z

@atalman , @malfet
Can you please help on this PR cherry-pick.

The code changes are ROCm related and fixes - #132004

…#134531) [ROCm] Prevent accidental enablement of efficient attention. (pytorch#133331) Currently Efficient attention and Flash attention share the same set of GPU kernels on ROCM and have common limitations on head sizes. Fixes pytorch#132004 Pull Request resolved: pytorch#133331 Approved by: https://github.com/malfet, https://github.com/jithunnair-amd (cherry picked from commit 46ecc67) Co-authored-by: Xinya Zhang <Xinya.Zhang@amd.com>

…#134531) (#1565) [ROCm] Prevent accidental enablement of efficient attention. (pytorch#133331) Currently Efficient attention and Flash attention share the same set of GPU kernels on ROCM and have common limitations on head sizes. Pull Request resolved: pytorch#133331 Approved by: https://github.com/malfet, https://github.com/jithunnair-amd (cherry picked from commit 46ecc67) Fixes pytorch#132004 Co-authored-by: pytorchbot <soumith+bot@pytorch.org>

This was referenced Aug 27, 2024

[v2.4.1] Release Tracker #132400

Closed

[ROCm] Prevent accidental enablement of efficient attention. #133331

Closed

pytorch-bot bot added ciflow/rocm Trigger "default" config CI on ROCm module: rocm AMD GPU support for Pytorch labels Aug 27, 2024

pruthvistony requested a review from atalman August 27, 2024 00:10

pruthvistony requested a review from malfet August 27, 2024 00:11

pruthvistony approved these changes Aug 27, 2024

View reviewed changes

pytorchbot added the open source label Aug 27, 2024

atalman approved these changes Aug 27, 2024

View reviewed changes

atalman merged commit 6a79d4a into release/2.4 Aug 27, 2024

atalman deleted the cherry-pick-133331-by-pytorch_bot_bot_ branch August 27, 2024 11:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Prevent accidental enablement of efficient attention. #134531

[ROCm] Prevent accidental enablement of efficient attention. #134531

Uh oh!

pytorchbot commented Aug 27, 2024

Uh oh!

pytorch-bot bot commented Aug 27, 2024 •

edited

Loading

Uh oh!

pruthvistony commented Aug 27, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[ROCm] Prevent accidental enablement of efficient attention. #134531

[ROCm] Prevent accidental enablement of efficient attention. #134531

Uh oh!

Conversation

pytorchbot commented Aug 27, 2024

Uh oh!

pytorch-bot bot commented Aug 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134531

❌ 2 New Failures, 1 Unrelated Failure

Uh oh!

pruthvistony commented Aug 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Aug 27, 2024 •

edited

Loading

pruthvistony commented Aug 27, 2024 •

edited

Loading