Improve mem efficiency of constant folding #108421

eellison · 2023-09-01T16:07:42Z

Stack from ghstack (oldest at bottom):

Couple changes to make it more efficient.

Because we replacing nodes that only have a single value, only store a single value instead of the whole tensor for node replacement
torch.fx.Interpreter will preserve a Tensor in the env as long as it has more uses. That also applies even to output uses, but we are not going to constant fold that use. Instead of using last use for garbage collection, use last non output use.

If reviewers would prefer I ghstack this bc of code movement let me know.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov

Differential Revision: D49020616

[ghstack-poisoned]

pytorch-bot · 2023-09-01T16:07:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108421

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Merge Blocking SEVs

There is 1 active merge blocking SEVs. Please view them below:

(merge blocking) GH API issues are preventing multiple ci jobs to start.

If you must merge, use @pytorchbot merge -f.

✅ No Failures

As of commit fb0988f with merge base bae14b3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

torch/_inductor/constant_folding.py

Couple changes to make it more efficient. - Because we replacing nodes that only have a single value, only store a single value instead of the whole tensor for node replacement - torch.fx.Interpreter will preserve a Tensor in the env as long as it has more uses. That also applies even to output uses, but we are not going to constant fold that use. Instead of using last use for garbage collection, use last non output use. If reviewers would prefer I ghstack this bc of code movement let me know. Fix for #108388 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

ghstack-source-id: 2ecec4c Pull Request resolved: #108421

jansel · 2023-09-01T18:01:07Z

Lints/test errors

It would be easier to review if the code movement were in another PR.

Couple changes to make it more efficient. - Because we replacing nodes that only have a single value, only store a single value instead of the whole tensor for node replacement - torch.fx.Interpreter will preserve a Tensor in the env as long as it has more uses. That also applies even to output uses, but we are not going to constant fold that use. Instead of using last use for garbage collection, use last non output use. If reviewers would prefer I ghstack this bc of code movement let me know. Fix for #108388 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

ghstack-source-id: 9609414 Pull Request resolved: #108421

eellison · 2023-09-02T01:08:25Z

@pytorchbot merge

pytorchmergebot · 2023-09-02T01:10:01Z

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

eellison · 2023-09-02T01:12:52Z

@pytorchbot merge

pytorchmergebot · 2023-09-02T01:15:52Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-09-02T03:57:36Z

Merge failed

Reason: Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x ecf525257d11ae4a5da4f1bebf948d72b58bcc74 returned non-zero exit code 1

Auto-merging test/inductor/test_torchinductor.py
CONFLICT (content): Merge conflict in test/inductor/test_torchinductor.py
CONFLICT (modify/delete): torch/_inductor/constant_folding.py deleted in HEAD and modified in ecf525257d1 (Improve mem efficiency of constant folding).  Version ecf525257d1 (Improve mem efficiency of constant folding) of torch/_inductor/constant_folding.py left in tree.
Auto-merging torch/_inductor/fx_passes/joint_graph.py
CONFLICT (content): Merge conflict in torch/_inductor/fx_passes/joint_graph.py
error: could not apply ecf525257d1... Improve mem efficiency of constant folding
hint: After resolving the conflicts, mark them with
hint: "git add/rm <pathspec>", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".

Details for Dev Infra team

Raised by workflow job

Couple changes to make it more efficient. - Because we replacing nodes that only have a single value, only store a single value instead of the whole tensor for node replacement - torch.fx.Interpreter will preserve a Tensor in the env as long as it has more uses. That also applies even to output uses, but we are not going to constant fold that use. Instead of using last use for garbage collection, use last non output use. If reviewers would prefer I ghstack this bc of code movement let me know. Fix for #108388 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

ghstack-source-id: cc1e242 Pull Request resolved: #108421

eellison · 2023-09-05T23:25:55Z

@pytorchbot merge

pytorchmergebot · 2023-09-05T23:27:33Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

eellison · 2023-09-06T17:59:35Z

@eellison has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Improve mem efficiency of constant folding

d337af6

[ghstack-poisoned]

github-actions bot added module: inductor ciflow/inductor labels Sep 1, 2023

Update on "Improve mem efficiency of constant folding"

055f431

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

Update on "Improve mem efficiency of constant folding"

1c202de

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

eellison requested review from jansel, voznesenskym and xw285cornell September 1, 2023 16:24

eellison commented Sep 1, 2023

View reviewed changes

torch/_inductor/constant_folding.py Show resolved Hide resolved

eellison added a commit that referenced this pull request Sep 1, 2023

Improve mem efficiency of constant folding

b45cb6a

ghstack-source-id: 2ecec4c Pull Request resolved: #108421

eellison mentioned this pull request Sep 1, 2023

Refactorings for constant folding #108450

Closed

eellison added a commit that referenced this pull request Sep 1, 2023

Improve mem efficiency of constant folding

ecf5252

ghstack-source-id: 9609414 Pull Request resolved: #108421

jansel approved these changes Sep 1, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 2, 2023

pytorchmergebot added the merging label Sep 2, 2023

pytorchmergebot removed the merging label Sep 2, 2023

eellison added the topic: not user facing topic category label Sep 2, 2023

pytorchmergebot added the merging label Sep 2, 2023

pytorchmergebot removed the merging label Sep 2, 2023

eellison added a commit that referenced this pull request Sep 5, 2023

Improve mem efficiency of constant folding

18bf94b

ghstack-source-id: cc1e242 Pull Request resolved: #108421

eellison mentioned this pull request Sep 5, 2023

turn back on constant folding in fbcode #108604

Closed

pytorchmergebot added the merging label Sep 5, 2023

pytorchmergebot added Merged and removed merging labels Sep 6, 2023

pytorchmergebot closed this in c8e72a4 Sep 6, 2023

facebook-github-bot deleted the gh/eellison/524/head branch September 9, 2023 14:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve mem efficiency of constant folding #108421

Improve mem efficiency of constant folding #108421

Uh oh!

eellison commented Sep 1, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 1, 2023 •

edited

Loading

Uh oh!

Uh oh!

jansel commented Sep 1, 2023

Uh oh!

eellison commented Sep 2, 2023

Uh oh!

pytorchmergebot commented Sep 2, 2023

Uh oh!

eellison commented Sep 2, 2023

Uh oh!

pytorchmergebot commented Sep 2, 2023

Uh oh!

pytorchmergebot commented Sep 2, 2023

Uh oh!

eellison commented Sep 5, 2023

Uh oh!

pytorchmergebot commented Sep 5, 2023

Uh oh!

eellison commented Sep 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improve mem efficiency of constant folding #108421

Improve mem efficiency of constant folding #108421

Uh oh!

Conversation

eellison commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108421

❗ 1 Merge Blocking SEVs

✅ No Failures

Uh oh!

Uh oh!

jansel commented Sep 1, 2023

Uh oh!

eellison commented Sep 2, 2023

Uh oh!

pytorchmergebot commented Sep 2, 2023

Merge failed

Uh oh!

eellison commented Sep 2, 2023

Uh oh!

pytorchmergebot commented Sep 2, 2023

Merge started

Uh oh!

pytorchmergebot commented Sep 2, 2023

Merge failed

Uh oh!

eellison commented Sep 5, 2023

Uh oh!

pytorchmergebot commented Sep 5, 2023

Merge started

Uh oh!

eellison commented Sep 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eellison commented Sep 1, 2023 •

edited

Loading

pytorch-bot bot commented Sep 1, 2023 •

edited

Loading