[inductor] Fix ReinterpretView call in TMADescriptor IR #138759

aakhundov · 2024-10-23T22:44:54Z

Stack from ghstack (oldest at bottom):

-> [inductor] Fix ReinterpretView call in TMADescriptor IR #138759

As a result of #137768, ReinterpretView call in the TMADescriptor
has become invalid. This leads to some TMA tests breaking in
test_triton_kernels.py. In this PR, we fix this.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-10-23T22:44:58Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138759

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 8e797e2 with merge base 72ea7ba ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

inductor-periodic / cuda12.1-py3.10-gcc9-sm80 / test (inductor_torchbench_smoketest_perf, 1, 1, linux.gcp.a100) (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

As a result of #137768, `ReinterpretView` call in the `TMADescriptor` has become invalid. This leads to some TMA tests breaking in test_triton_kernels.py. In this PR, we fix this. ghstack-source-id: 0f88d4f Pull Request resolved: #138759

Chillee

Wait, how come I didn't see this on my PR?

aakhundov · 2024-10-23T23:32:01Z

Wait, how come I didn't see this on my PR?

You landed your PR on 10/14 before I landed mine on 10/17. And, apparently, my base rev was older than 10/14 when I landed. I guess, it's a good habit to rebase to the newest viable/strict before landing. Will keep in mind.

Chillee · 2024-10-23T23:41:26Z

eh I should probably have warned the chat that this PR had a high chance of land races.

aakhundov · 2024-10-24T00:26:12Z

@pytorchbot merge

pytorchmergebot · 2024-10-24T00:27:45Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-10-24T01:10:34Z

Merge failed

Reason: 1 jobs have failed, first few of them are: inductor-periodic / cuda12.1-py3.10-gcc9-sm80 / test (inductor_torchbench_smoketest_perf, 1, 1, linux.gcp.a100)

Details for Dev Infra team

Raised by workflow job

aakhundov · 2024-10-24T16:21:28Z

@pytorchbot merge

pytorchmergebot · 2024-10-24T16:23:09Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-10-24T22:21:51Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

aakhundov · 2024-10-25T00:44:11Z

@pytorchbot merge -f "unrelated failing and hanging CI jobs"

pytorchmergebot · 2024-10-25T00:45:31Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This fixes some leftover typos in CreateTMADescriptorVariable.call_function (and close). Pull Request resolved: #138877 Approved by: https://github.com/davidberard98, https://github.com/zou3519, https://github.com/Skylion007 ghstack dependencies: #138759

This adds host-side Triton TMA support to AOTInductor. Notes: - Two helper functions, `init1DTMADescriptor` and `init2DTMADescriptor` are added to the C++ wrapper codegen on GPU, conditioned on the model having user-defined Triton kernels with host-side TMA (CUDA-specific). - C++ wrapper codegen on GPU emits TMA descriptor initialization via the aforementioned helper functions. - Special handling added for the TMA descriptors (in the Python wrapper codegen) during the compile-time autotuning, as the underlying tensor can't be passed directly to the user-defined Triton kernel. TMA descriptors are generated in-between the source tensor's buffer and the kernel call, like in the full Python wrapper codegen. - This PR concludes the host-side Triton TMA support in PT2. Pull Request resolved: #138878 Approved by: https://github.com/desertfire, https://github.com/chenyang78 ghstack dependencies: #138759, #138877

This adds host-side Triton TMA support to AOTInductor. Notes: - Two helper functions, `init1DTMADescriptor` and `init2DTMADescriptor` are added to the C++ wrapper codegen on GPU, conditioned on the model having user-defined Triton kernels with host-side TMA (CUDA-specific). - C++ wrapper codegen on GPU emits TMA descriptor initialization via the aforementioned helper functions. - Special handling added for the TMA descriptors (in the Python wrapper codegen) during the compile-time autotuning, as the underlying tensor can't be passed directly to the user-defined Triton kernel. TMA descriptors are generated in-between the source tensor's buffer and the kernel call, like in the full Python wrapper codegen. - This PR concludes the host-side Triton TMA support in PT2. Pull Request resolved: pytorch#138878 Approved by: https://github.com/desertfire, https://github.com/chenyang78 ghstack dependencies: pytorch#138759, pytorch#138877

Update

8e797e2

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Oct 23, 2024

aakhundov requested review from Chillee and eellison October 23, 2024 22:45

aakhundov added the topic: not user facing topic category label Oct 23, 2024

Chillee approved these changes Oct 23, 2024

View reviewed changes

eellison approved these changes Oct 23, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 24, 2024

pytorchmergebot added the merging label Oct 24, 2024

pytorchmergebot removed the merging label Oct 24, 2024

pytorchmergebot added the merging label Oct 24, 2024

pytorchmergebot added the Merged label Oct 25, 2024

pytorchmergebot closed this in f737e3f Oct 25, 2024

pytorchmergebot removed the merging label Oct 25, 2024

github-actions bot deleted the gh/aakhundov/12/head branch November 25, 2024 02:11

[inductor] Fix ReinterpretView call in TMADescriptor IR #138759

[inductor] Fix ReinterpretView call in TMADescriptor IR #138759

Uh oh!

Conversation

aakhundov commented Oct 23, 2024 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138759

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

Chillee left a comment

Choose a reason for hiding this comment

Uh oh!

aakhundov commented Oct 23, 2024

Uh oh!

Chillee commented Oct 23, 2024

Uh oh!

aakhundov commented Oct 24, 2024

Uh oh!

pytorchmergebot commented Oct 24, 2024

Merge started

Uh oh!

pytorchmergebot commented Oct 24, 2024

Merge failed

Uh oh!

aakhundov commented Oct 24, 2024

Uh oh!

pytorchmergebot commented Oct 24, 2024

Merge started

Uh oh!

pytorchmergebot commented Oct 24, 2024

Uh oh!

aakhundov commented Oct 25, 2024

Uh oh!

pytorchmergebot commented Oct 25, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aakhundov commented Oct 23, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 23, 2024 •

edited

Loading