correctly keep track of processed tensors for foreach reductions #140103

ngimel · 2024-11-08T04:20:34Z

Fixes #140066

pytorch-bot · 2024-11-08T04:20:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140103

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit cbbb78f with merge base 43f0fe6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

aten/src/ATen/native/cuda/MultiTensorApply.cuh

test/test_foreach.py

ngimel · 2024-11-08T17:35:26Z

torch/testing/_internal/common_methods_invocations.py

I'm not sure inputs here are large enough to trigger the bug. We need to test 2 cases

Enough chunks across all the tensors to trigger multiple kernel launches. That requires tensors totalling
>65536 * 320 elements today, given that we launch at most 320 blocks

Enough tensors to trigger multiple kernel launches, don't know off the top of my head but should be at least 50-100
By looking at it, foreach inputs would generate at most 10 small-ish tensors, so might not trigger either case.
We might be better off just writing one off tests for this, wdyt?

Yea, that'd probably be easiest, the meta tests also just failed too so that makes the decision easier

Eventually we probably want both tests

torch/testing/_internal/opinfo/core.py

janeyx99 · 2024-11-08T20:40:49Z

@pytorchbot merge

pytorchmergebot · 2024-11-08T20:42:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…orch#140103) Fixes pytorch#140066 Pull Request resolved: pytorch#140103 Approved by: https://github.com/janeyx99 Co-authored-by: Jane Xu <janeyx@meta.com>

correctly keep track of processed tensors for foreach reductions

302424d

ngimel requested review from eqy and syed-ahmed as code owners November 8, 2024 04:20

pytorch-bot bot added the release notes: cuda release notes category label Nov 8, 2024

janeyx99 approved these changes Nov 8, 2024

View reviewed changes

aten/src/ATen/native/cuda/MultiTensorApply.cuh Show resolved Hide resolved

janeyx99 requested a review from mruberry as a code owner November 8, 2024 16:51

janeyx99 added ciflow/trunk Trigger trunk jobs on your pull request release notes: foreach_frontend release notes category and removed release notes: cuda release notes category labels Nov 8, 2024

ngimel commented Nov 8, 2024

View reviewed changes

Add empty tensor tests for our reduce ops

dacd07b

janeyx99 force-pushed the ngimel/foreach_norm branch from 544a37d to dacd07b Compare November 8, 2024 19:23

janeyx99 reviewed Nov 8, 2024

View reviewed changes

torch/testing/_internal/opinfo/core.py Show resolved Hide resolved

lint

cbbb78f

pytorchmergebot added the merging label Nov 8, 2024

pytorchmergebot added the Merged label Nov 8, 2024

pytorchmergebot closed this in 1cdaf1d Nov 8, 2024

pytorchmergebot removed the merging label Nov 8, 2024

github-actions bot deleted the ngimel/foreach_norm branch December 9, 2024 02:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

correctly keep track of processed tensors for foreach reductions #140103

correctly keep track of processed tensors for foreach reductions #140103

Uh oh!

ngimel commented Nov 8, 2024

Uh oh!

pytorch-bot bot commented Nov 8, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ngimel Nov 8, 2024 •

edited

Loading

Uh oh!

janeyx99 Nov 8, 2024

Uh oh!

janeyx99 Nov 8, 2024

Uh oh!

Uh oh!

janeyx99 commented Nov 8, 2024

Uh oh!

pytorchmergebot commented Nov 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

correctly keep track of processed tensors for foreach reductions #140103

correctly keep track of processed tensors for foreach reductions #140103

Uh oh!

Conversation

ngimel commented Nov 8, 2024

Uh oh!

pytorch-bot bot commented Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140103

✅ No Failures

Uh oh!

Uh oh!

Uh oh!

ngimel Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janeyx99 Nov 8, 2024

Choose a reason for hiding this comment

Uh oh!

janeyx99 Nov 8, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

janeyx99 commented Nov 8, 2024

Uh oh!

pytorchmergebot commented Nov 8, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Nov 8, 2024 •

edited

Loading

ngimel Nov 8, 2024 •

edited

Loading