[AOTInductor] Enforce no_grad for Run entries #111613

SherlockNoMad · 2023-10-19T22:34:28Z

Summary:
Always enter no_grad mode in AOTInductor run entries.

// AOTInductor uses at::addmm_out, which doesn't supports
// arguments that requires gradient. For this reason, we
// enforce no_grad context for run APIs.

Test Plan:
buck2 test mode/dev-nosan caffe2/test/inductor:test_aot_inductor

and OSS CI

Differential Revision: D50432042

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

pytorch-bot · 2023-10-19T22:34:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/111613

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fae89ab with merge base 6a99291 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2023-10-19T22:34:40Z

This pull request was exported from Phabricator. Differential Revision: D50432042

torch/_inductor/codegen/aoti_runtime/interface.cpp

chenyang78

I think Oleg made a good point regarding recovering from Exceptions. LGTM otherwise. Thanks!

desertfire · 2023-10-20T12:35:03Z

torch/_inductor/codegen/aoti_runtime/interface.cpp

+// enforce no_grad context for run APIs.
+#define WITH_NO_GRAD(...)                               \
+  do {                                                  \
+    bool prev_mode = aoti_torch_grad_mode_is_enabled(); \


Consider print a warning msg if prev_mode is true.

This will be invoked in critical path of run(), so I shouldn't print log at every run() call.

I can print warning at ContainerCreate(), but there is a chance that create() is called under no_grad, but run() is not.

merging now.
I can add logs latter if required.

Summary: Always enter no_grad mode in AOTInductor run entries. ``` // AOTInductor uses at::addmm_out, which doesn't supports // arguments that requires gradient. For this reason, we // enforce no_grad context for run APIs. ``` Test Plan: buck2 test mode/dev-nosan caffe2/test/inductor:test_aot_inductor and OSS CI Reviewed By: chenyang78 Differential Revision: D50432042

facebook-github-bot · 2023-10-23T16:42:30Z

This pull request was exported from Phabricator. Differential Revision: D50432042

SherlockNoMad · 2023-10-23T16:45:39Z

@pytorchbot merge

pytorchmergebot · 2023-10-23T16:47:22Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-10-23T17:40:01Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

Lint / lintrunner / linux-job

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

facebook-github-bot · 2023-10-23T20:39:38Z

This pull request was exported from Phabricator. Differential Revision: D50432042

SherlockNoMad · 2023-10-23T20:39:59Z

@pytorchbot merge

pytorchmergebot · 2023-10-23T20:41:49Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-10-23T21:29:04Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 1, 5, linux.g5.4xlarge.nvidia.gpu)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

Summary: Always enter no_grad mode in AOTInductor run entries. ``` // AOTInductor uses at::addmm_out, which doesn't supports // arguments that requires gradient. For this reason, we // enforce no_grad context for run APIs. ``` Test Plan: buck2 test mode/dev-nosan caffe2/test/inductor:test_aot_inductor and OSS CI Reviewed By: khabinov, chenyang78 Differential Revision: D50432042

facebook-github-bot · 2023-10-23T22:59:37Z

This pull request was exported from Phabricator. Differential Revision: D50432042

facebook-github-bot · 2023-10-27T05:52:30Z

This pull request was exported from Phabricator. Differential Revision: D50432042

Summary: Always enter no_grad mode in AOTInductor run entries. ``` // AOTInductor uses at::addmm_out, which doesn't supports // arguments that requires gradient. For this reason, we // enforce no_grad context for run APIs. ``` Test Plan: buck2 test mode/dev-nosan caffe2/test/inductor:test_aot_inductor and OSS CI Reviewed By: khabinov, chenyang78 Differential Revision: D50432042

facebook-github-bot · 2023-10-27T05:53:59Z

This pull request was exported from Phabricator. Differential Revision: D50432042

SherlockNoMad · 2023-10-27T06:38:58Z

@pytorchbot merge

pytorchmergebot · 2023-10-27T06:40:53Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: Always enter no_grad mode in AOTInductor run entries. ``` // AOTInductor uses at::addmm_out, which doesn't supports // arguments that requires gradient. For this reason, we // enforce no_grad context for run APIs. ``` Test Plan: buck2 test mode/dev-nosan caffe2/test/inductor:test_aot_inductor and OSS CI Differential Revision: D50432042 Pull Request resolved: pytorch#111613 Approved by: https://github.com/chenyang78, https://github.com/khabinov

facebook-github-bot added the fb-exported label Oct 19, 2023

github-actions bot added module: inductor ciflow/inductor and removed fb-exported labels Oct 19, 2023

SherlockNoMad added the topic: not user facing topic category label Oct 19, 2023

SherlockNoMad requested review from chenyang78, desertfire and khabinov October 19, 2023 22:35

khabinov reviewed Oct 19, 2023

View reviewed changes

torch/_inductor/codegen/aoti_runtime/interface.cpp Outdated Show resolved Hide resolved

chenyang78 approved these changes Oct 20, 2023

View reviewed changes

desertfire reviewed Oct 20, 2023

View reviewed changes

SherlockNoMad force-pushed the export-D50432042 branch from cd061d9 to f08b46d Compare October 23, 2023 16:42

facebook-github-bot added the fb-exported label Oct 23, 2023

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 23, 2023

pytorchmergebot added the merging label Oct 23, 2023

pytorchmergebot removed the merging label Oct 23, 2023

khabinov approved these changes Oct 23, 2023

View reviewed changes

SherlockNoMad force-pushed the export-D50432042 branch from f08b46d to dd2d917 Compare October 23, 2023 20:39

pytorchmergebot added the merging label Oct 23, 2023

pytorchmergebot removed the merging label Oct 23, 2023

SherlockNoMad force-pushed the export-D50432042 branch from dd2d917 to ad7f569 Compare October 23, 2023 22:59

SherlockNoMad force-pushed the export-D50432042 branch from ad7f569 to b5fd0e3 Compare October 27, 2023 05:52

SherlockNoMad force-pushed the export-D50432042 branch from b5fd0e3 to fae89ab Compare October 27, 2023 05:53

pytorchmergebot added the merging label Oct 27, 2023

pytorchmergebot added Merged and removed merging labels Oct 27, 2023

pytorchmergebot closed this in 7265c22 Oct 27, 2023

[AOTInductor] Enforce no_grad for Run entries #111613

[AOTInductor] Enforce no_grad for Run entries #111613

Uh oh!

Conversation

SherlockNoMad commented Oct 19, 2023 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/111613

✅ No Failures

Uh oh!

facebook-github-bot commented Oct 19, 2023

Uh oh!

Uh oh!

chenyang78 left a comment

Choose a reason for hiding this comment

Uh oh!

desertfire Oct 20, 2023

Choose a reason for hiding this comment

Uh oh!

SherlockNoMad Oct 23, 2023

Choose a reason for hiding this comment

Uh oh!

SherlockNoMad Oct 23, 2023

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Oct 23, 2023

Uh oh!

SherlockNoMad commented Oct 23, 2023

Uh oh!

pytorchmergebot commented Oct 23, 2023

Merge started

Uh oh!

pytorchmergebot commented Oct 23, 2023

Merge failed

Uh oh!

facebook-github-bot commented Oct 23, 2023

Uh oh!

SherlockNoMad commented Oct 23, 2023

Uh oh!

pytorchmergebot commented Oct 23, 2023

Merge started

Uh oh!

pytorchmergebot commented Oct 23, 2023

Merge failed

Uh oh!

facebook-github-bot commented Oct 23, 2023

Uh oh!

facebook-github-bot commented Oct 27, 2023

Uh oh!

facebook-github-bot commented Oct 27, 2023

Uh oh!

SherlockNoMad commented Oct 27, 2023

Uh oh!

pytorchmergebot commented Oct 27, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

SherlockNoMad commented Oct 19, 2023 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 19, 2023 •

edited

Loading