Add Current Mask Var To CSE Cache Key #140838

eellison · 2024-11-15T18:24:11Z

Stack from ghstack (oldest at bottom):

This torch.cat kernel has multiple subblocks which load from the same input. We were incorrectly reusing the mask vars from the first load for the second load.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2024-11-15T18:24:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140838

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[DomainsOnly] Jobs fail with GLIBC version not found

❌ 1 New Failure

As of commit b9d5073 with merge base 98e441f ():

NEW FAILURE - The following job has failed:

pull / linux-focal-cuda11.8-py3.10-gcc9 / test (distributed, 3, 3, lf.linux.g4dn.12xlarge.nvidia.gpu) (gh)
RuntimeError: distributed/test_c10d_common 1/1 failed!

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: b6bd379 Pull Request resolved: #140838

This torch.cat kernel has multiple subblocks which load from the same input. We were incorrectly reusing the mask vars from the first load for the second load. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: b258212 Pull Request resolved: #140838

This torch.cat kernel has multiple subblocks which load from the same input. We were incorrectly reusing the mask vars from the first load for the second load. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: dbba5a5 Pull Request resolved: #140838

This torch.cat kernel has multiple subblocks which load from the same input. We were incorrectly reusing the mask vars from the first load for the second load. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

eellison · 2024-11-19T19:13:23Z

@pytorchbot merge

pytorchmergebot · 2024-11-19T19:15:07Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This torch.cat kernel has multiple subblocks which load from the same input. We were incorrectly reusing the mask vars from the first load for the second load. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: e0d94eb Pull Request resolved: #140838

eellison · 2024-11-19T19:24:20Z

@pytorchbot merge

pytorchmergebot · 2024-11-19T19:24:39Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

pytorchmergebot · 2024-11-19T19:26:26Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-11-19T20:19:43Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-focal-cuda11.8-py3.10-gcc9 / test (distributed, 3, 3, lf.linux.g4dn.12xlarge.nvidia.gpu)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

eellison · 2024-11-19T22:00:12Z

@pytorchbot merge -i

pytorchmergebot · 2024-11-19T22:03:07Z

Merge started

Your change will be merged while ignoring the following 1 checks: pull / linux-focal-cuda11.8-py3.10-gcc9 / test (distributed, 3, 3, lf.linux.g4dn.12xlarge.nvidia.gpu)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This torch.cat kernel has multiple subblocks which load from the same input. We were incorrectly reusing the mask vars from the first load for the second load. Pull Request resolved: pytorch#140838 Approved by: https://github.com/jansel ghstack dependencies: pytorch#140841

Invalidate CSE for variables emitted during masked subblock

2173ad6

[ghstack-poisoned]

eellison added a commit that referenced this pull request Nov 15, 2024

Invalidate CSE for variables emitted during masked subblock

6ec032b

ghstack-source-id: b6bd379 Pull Request resolved: #140838

pytorch-bot bot added ciflow/inductor module: inductor labels Nov 15, 2024

eellison added the topic: not user facing topic category label Nov 15, 2024

eellison mentioned this pull request Nov 15, 2024

[inductor] add support for TRITON_INTERPRET #140841

Closed

eellison requested review from Chillee, jansel and shunting314 and removed request for Chillee, jansel and shunting314 November 18, 2024 18:25

eellison added a commit that referenced this pull request Nov 18, 2024

Invalidate CSE for variables emitted during masked subblock

f952c4b

ghstack-source-id: b258212 Pull Request resolved: #140838

jansel approved these changes Nov 19, 2024

View reviewed changes

eellison changed the title ~~Invalidate CSE for variables emitted during masked subblock~~ Add Current Load Mask To CSE Var Key Nov 19, 2024

eellison changed the title ~~Add Current Load Mask To CSE Var Key~~ Add Current Mask Var To CSE Cache Key Nov 19, 2024

eellison added a commit that referenced this pull request Nov 19, 2024

Invalidate CSE for variables emitted during masked subblock

6b84d91

ghstack-source-id: dbba5a5 Pull Request resolved: #140838

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 19, 2024

pytorchmergebot added the merging label Nov 19, 2024

eellison added a commit that referenced this pull request Nov 19, 2024

Invalidate CSE for variables emitted during masked subblock

e000842

ghstack-source-id: e0d94eb Pull Request resolved: #140838

pytorchmergebot removed the merging label Nov 19, 2024

pytorchmergebot added the merging label Nov 19, 2024

pytorchmergebot added the Merged label Nov 20, 2024

pytorchmergebot closed this in eff2217 Nov 20, 2024

pytorchmergebot removed the merging label Nov 20, 2024

github-actions bot deleted the gh/eellison/728/head branch December 20, 2024 02:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Current Mask Var To CSE Cache Key #140838

Add Current Mask Var To CSE Cache Key #140838

Uh oh!

eellison commented Nov 15, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 15, 2024 •

edited

Loading

Uh oh!

eellison commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Uh oh!

eellison commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Uh oh!

eellison commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add Current Mask Var To CSE Cache Key #140838

Add Current Mask Var To CSE Cache Key #140838

Uh oh!

Conversation

eellison commented Nov 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140838

❗ 1 Active SEVs

❌ 1 New Failure

Uh oh!

eellison commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Merge started

Uh oh!

eellison commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Merge started

Uh oh!

pytorchmergebot commented Nov 19, 2024

Merge failed

Uh oh!

eellison commented Nov 19, 2024

Uh oh!

pytorchmergebot commented Nov 19, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eellison commented Nov 15, 2024 •

edited

Loading

pytorch-bot bot commented Nov 15, 2024 •

edited

Loading