stop non-differentiable values from being materialized in aotautograd #110721

Chillee · 2023-10-06T17:13:19Z

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2023-10-06T17:13:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110721

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit a10ef1b with merge base e3bf500 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…aotautograd" [ghstack-poisoned]

ghstack-source-id: 673df74 Pull Request resolved: #110721

…aotautograd" [ghstack-poisoned]

ghstack-source-id: 353caab Pull Request resolved: #110721

…aotautograd" [ghstack-poisoned]

Chillee · 2023-10-07T05:52:57Z

Interestingly this seems to have a significant impact in our benchmarks. I'm guessing primarily due to eliding a bunch of memory copies in the backwards pass.

For an example model (levit_128), this is how the trace looks like prior to this change. This change results in the green area and second pink area disappearing.

ezyang · 2023-10-07T14:46:46Z

This seems plausible but you still have test errors

Chillee · 2023-10-07T15:27:37Z

Where do you see test errors 🤔

ezyang · 2023-10-09T16:28:18Z

I'm letting Brian review this

Chillee · 2023-10-09T16:28:34Z

@pytorchbot merge

pytorch-bot · 2023-10-09T16:28:37Z

This PR needs to be approved by an authorized maintainer before merge.

bdhirsh · 2023-10-09T16:35:54Z

torch/_functorch/aot_autograd.py

+            inp_tangents_filtered = [
+                x
+                for x, info_idx in zip(inp_tangents, mutated_inp_indices)
+                if input_info[info_idx].mutates_data and input_info[info_idx].requires_grad


Hmm, I think the bit of the filter on `input_info[info_idx].requires_grad should be unnecessary now?

Previously, inp_tangents corresponded to "every forward input that had a (data or metadata) mutation". We had to filter this down to the actual inputs to our backward graph that correspond to user forward inputs, which are fw inputs that had a data mutation and require grad. (Actually - double checking, I think we're already correctly filtering out metadata mutations).

But now that you're filtering down to fw inputs that have a mutation and require grad even earlier (as part of constructing the joint fw/bw), we should only have to filter out inputs with metadata-only mutations in this check.

For future discussion, this isn't true. In this case, inp_tangents represents all of the inputs to the backwards pass, which will include values that corresponds to tensors that we don't actually trace with (for example, outputs of the forwards pass that are nondifferentiable).

So we still need to filter them out in this place - the logic is pretty analogous to the existing ones.

Chillee · 2023-10-09T17:47:25Z

@pytorchbot merge

pytorchmergebot · 2023-10-09T17:49:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

stop non-differentiable values from being materialized in aotautograd

f698e19

[ghstack-poisoned]

Chillee mentioned this pull request Oct 6, 2023

Fix error in div lowering with integers #102809

Closed

pytorch-bot bot added the release notes: AO frontend label Oct 6, 2023

Chillee mentioned this pull request Oct 6, 2023

refactor aotautograd to set requires_grad on info rather than a separate array #110720

Closed

github-actions bot requested a review from ezyang October 6, 2023 17:13

github-actions bot added the ciflow/inductor label Oct 6, 2023

Update on "stop non-differentiable values from being materialized in …

4e59fb8

…aotautograd" [ghstack-poisoned]

Chillee added a commit that referenced this pull request Oct 6, 2023

stop non-differentiable values from being materialized in aotautograd

3487ece

ghstack-source-id: 673df74 Pull Request resolved: #110721

Update on "stop non-differentiable values from being materialized in …

6d027dc

…aotautograd" [ghstack-poisoned]

Chillee force-pushed the gh/chillee/228/head branch from 4e59fb8 to 6d027dc Compare October 6, 2023 23:09

Chillee added a commit that referenced this pull request Oct 6, 2023

stop non-differentiable values from being materialized in aotautograd

ef64c08

ghstack-source-id: 353caab Pull Request resolved: #110721

Chillee mentioned this pull request Oct 6, 2023

partial attempt at stopping non-differentiable values from being materialized #110592

Closed

Update on "stop non-differentiable values from being materialized in …

c8af0c5

…aotautograd" [ghstack-poisoned]

Update on "stop non-differentiable values from being materialized in …

26900e8

…aotautograd" [ghstack-poisoned]

Chillee mentioned this pull request Oct 6, 2023

Remove requires_grad_info from AOTDispatch #110773

Closed

Chillee requested a review from bdhirsh October 7, 2023 00:02

Chillee added 2 commits October 6, 2023 17:11

Update on "stop non-differentiable values from being materialized in …

4344230

…aotautograd" [ghstack-poisoned]

Update on "stop non-differentiable values from being materialized in …

a10ef1b

…aotautograd" [ghstack-poisoned]

ezyang removed their request for review October 9, 2023 16:28

bdhirsh reviewed Oct 9, 2023

View reviewed changes

bdhirsh approved these changes Oct 9, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 9, 2023

pytorchmergebot added the merging label Oct 9, 2023

pytorchmergebot added Merged and removed merging labels Oct 9, 2023

pytorchmergebot closed this in 201d02e Oct 9, 2023

This was referenced Oct 11, 2023

Stop zeroing out non-differentiable outputs in AOTAutograd #104272

Closed

AOTAutograd should put keep mutations in the graph during training #109240

Closed

facebook-github-bot deleted the gh/chillee/228/head branch October 13, 2023 14:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

stop non-differentiable values from being materialized in aotautograd #110721

stop non-differentiable values from being materialized in aotautograd #110721

Uh oh!

Chillee commented Oct 6, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 6, 2023 •

edited

Loading

Uh oh!

Chillee commented Oct 7, 2023 •

edited

Loading

Uh oh!

ezyang commented Oct 7, 2023

Uh oh!

Chillee commented Oct 7, 2023

Uh oh!

ezyang commented Oct 9, 2023

Uh oh!

Chillee commented Oct 9, 2023

Uh oh!

pytorch-bot bot commented Oct 9, 2023

Uh oh!

bdhirsh Oct 9, 2023

Uh oh!

Chillee Oct 11, 2023

Uh oh!

Chillee commented Oct 9, 2023

Uh oh!

pytorchmergebot commented Oct 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

stop non-differentiable values from being materialized in aotautograd #110721

stop non-differentiable values from being materialized in aotautograd #110721

Uh oh!

Conversation

Chillee commented Oct 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110721

✅ No Failures

Uh oh!

Chillee commented Oct 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang commented Oct 7, 2023

Uh oh!

Chillee commented Oct 7, 2023

Uh oh!

ezyang commented Oct 9, 2023

Uh oh!

Chillee commented Oct 9, 2023

Uh oh!

pytorch-bot bot commented Oct 9, 2023

Uh oh!

bdhirsh Oct 9, 2023

Choose a reason for hiding this comment

Uh oh!

Chillee Oct 11, 2023

Choose a reason for hiding this comment

Uh oh!

Chillee commented Oct 9, 2023

Uh oh!

pytorchmergebot commented Oct 9, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Chillee commented Oct 6, 2023 •

edited

Loading

pytorch-bot bot commented Oct 6, 2023 •

edited

Loading

Chillee commented Oct 7, 2023 •

edited

Loading