[ROCm] Use ieee precision for fp32 in flex attention #135702

jataylo · 2024-09-11T15:53:11Z

3bebc09

Brought in a change to flex_attention to allow TF32 precision, this largely lacks support on ROCm side and we should use ieee.

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @hongxiayang @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

pytorch-bot · 2024-09-11T15:53:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/135702

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 Cancelled Jobs, 5 Unrelated Failures

As of commit eddebed with merge base 6700175 ():

CANCELLED JOBS - The following jobs were cancelled. Please retry:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

rocm / linux-focal-rocm6.1-py3.8 / test (default, 4, 6, linux.rocm.gpu.2) (gh) (detected as infra flaky with no log or failing log classifier)
rocm / linux-focal-rocm6.1-py3.8 / test (default, 5, 6, linux.rocm.gpu.2) (gh) (similar failure)
test_testing.py::TestImports::test_circular_dependencies
rocm / linux-focal-rocm6.1-py3.8 / test (default, 6, 6, linux.rocm.gpu.2) (gh) (similar failure)
inductor/test_pad_mm.py::PadMMTest::test_pad_mm_bf16
trunk / win-vs2019-cpu-py3 / test (default, 1, 3, lf.windows.4xlarge.nonephemeral) (gh) (similar failure)
'Test'

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

rocm / linux-focal-rocm6.1-py3.8 / test (default, 2, 6, linux.rocm.gpu.2) (gh) (trunk failure)
inductor/test_loop_ordering.py::LoopOrderingTest::test_fp8_cast_and_t

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jataylo · 2024-09-12T09:59:44Z

@pytorchbot merge

pytorchmergebot · 2024-09-12T10:01:46Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-09-12T10:02:04Z

Merge failed

Reason: 16 mandatory check(s) failed. The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

jataylo · 2024-09-12T15:11:51Z

Hmm failures are probably not related. I'll rebase and see if they are green

jataylo · 2024-09-12T15:11:56Z

@pytorchbot rebase

pytorchmergebot · 2024-09-12T15:13:26Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2024-09-12T15:13:28Z

Successfully rebased tf32-precision-rocm-inducto onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout tf32-precision-rocm-inducto && git pull --rebase)

jithunnair-amd · 2024-09-12T22:58:57Z

@pytorchbot merge -f "Fix ROCm CI failures in inductor/test_flex_encoding.py"

pytorchmergebot · 2024-09-12T23:00:28Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch@3bebc09 Brought in a change to flex_attention to allow TF32 precision, this largely lacks support on ROCm side and we should use ieee. Pull Request resolved: pytorch#135702 Approved by: https://github.com/jeffdaily, https://github.com/drisspg

jithunnair-amd · 2024-09-24T21:52:43Z

The cherry pick PR is at #136557

* [ROCm] skip test_fp8_cast_and_t on non-MI300 machines (#135917) Fixes #ISSUE_NUMBER Pull Request resolved: #135917 Approved by: https://github.com/malfet (cherry picked from commit 6cdc70b) * Skip pointwise associative scan tests due to regression (changes based on PR #135995) * Cherry-pick fix from #135702 --------- Co-authored-by: Prachi Gupta <prachi.gupta@amd.com> Co-authored-by: Jithun Nair <jithun.nair@amd.com>

jataylo added rocm This tag is for PRs from ROCm team ciflow/rocm Trigger "default" config CI on ROCm ciflow/inductor-rocm Trigger "inductor" config CI on ROCm labels Sep 11, 2024

jataylo requested review from Chillee and drisspg September 11, 2024 15:53

pytorch-bot bot added ciflow/inductor module: inductor module: rocm AMD GPU support for Pytorch labels Sep 11, 2024

pytorchbot added the open source label Sep 11, 2024

jeffdaily approved these changes Sep 11, 2024

View reviewed changes

drisspg approved these changes Sep 11, 2024

View reviewed changes

jataylo added the topic: not user facing topic category label Sep 11, 2024

jataylo marked this pull request as ready for review September 11, 2024 16:22

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 12, 2024

pytorchmergebot added the merging label Sep 12, 2024

pytorchmergebot removed the merging label Sep 12, 2024

[ROCm] Use ieee precision for fp32 in flex attention

eddebed

pytorchmergebot force-pushed the tf32-precision-rocm-inducto branch from 748e495 to eddebed Compare September 12, 2024 15:13

pytorchmergebot added the merging label Sep 12, 2024

pytorchmergebot added the Merged label Sep 12, 2024

pytorchmergebot closed this in fb9d8e3 Sep 12, 2024

pytorchmergebot removed the merging label Sep 12, 2024

jithunnair-amd mentioned this pull request Sep 24, 2024

[v.2.5.0] Release Tracker #135522

Closed

jithunnair-amd added a commit that referenced this pull request Sep 24, 2024

Cherry-pick fix from #135702

edcaefc

jithunnair-amd mentioned this pull request Sep 24, 2024

[ROCm] Cherry-pick unit test fixes to release/2.5 #136557

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Use ieee precision for fp32 in flex attention #135702

[ROCm] Use ieee precision for fp32 in flex attention #135702

Uh oh!

jataylo commented Sep 11, 2024 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 11, 2024 •

edited

Loading

Uh oh!

jataylo commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Uh oh!

jataylo commented Sep 12, 2024

Uh oh!

jataylo commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Uh oh!

jithunnair-amd commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Uh oh!

jithunnair-amd commented Sep 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[ROCm] Use ieee precision for fp32 in flex attention #135702

[ROCm] Use ieee precision for fp32 in flex attention #135702

Uh oh!

Conversation

jataylo commented Sep 11, 2024 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/135702

❌ 3 Cancelled Jobs, 5 Unrelated Failures

Uh oh!

jataylo commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Merge started

Uh oh!

pytorchmergebot commented Sep 12, 2024

Merge failed

Uh oh!

jataylo commented Sep 12, 2024

Uh oh!

jataylo commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Uh oh!

jithunnair-amd commented Sep 12, 2024

Uh oh!

pytorchmergebot commented Sep 12, 2024

Merge started

Uh oh!

jithunnair-amd commented Sep 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

jataylo commented Sep 11, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 11, 2024 •

edited

Loading