[inductor] Parallelize Max Autotune step 1: refactor autotune_process #109126

masnesral · 2023-09-12T18:21:31Z

Stack from ghstack (oldest at bottom):

Summary: Step 1 in revamping subprocess autotune to support multiple GPUs. This diff just does some refactoring to autotune_process.py in order to prepare for the next diff:

Move all logic for managing the sub-process (like detecting sub-process crashes) into the TuningProcess class.
Use log.debug statements instead of print statements

Test Plan: python test/inductor/test_max_autotune.py

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov

Summary: Step 1 in revamping subprocess autotune to support multiple GPUs. This diff just does some refactoring to autotune_process.py in order to prepare for the next diff: * Move all logic for managing the sub-process (like detecting sub-process crashes) into the TuningProcess class. * Use log.debug statements instead of print statements Test Plan: python test/inductor/test_max_autotune.py [ghstack-poisoned]

pytorch-bot · 2023-09-12T18:21:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/109126

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 143ebfa with merge base 264f1e7 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…une_process" Summary: Step 1 in revamping subprocess autotune to support multiple GPUs. This diff just does some refactoring to autotune_process.py in order to prepare for the next diff: * Move all logic for managing the sub-process (like detecting sub-process crashes) into the TuningProcess class. * Use log.debug statements instead of print statements Test Plan: python test/inductor/test_max_autotune.py cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

masnesral · 2023-09-12T19:25:56Z

@shunting314 FYI this is a redo of #107982. We had to revert that change because it didn't play well in fbcode. In fbcode, everything is a .xar file and we got feedback that we can't necessarily guarantee the proper environment for a subprocess started via Popen. So this change goes back to using multiprocessing and multiprocessing queues. This change just does some reorg to make the next diff in the stack a little easier to review.

…une_process" Summary: Step 1 in revamping subprocess autotune to support multiple GPUs. This diff just does some refactoring to autotune_process.py in order to prepare for the next diff: * Move all logic for managing the sub-process (like detecting sub-process crashes) into the TuningProcess class. * Use log.debug statements instead of print statements Test Plan: python test/inductor/test_max_autotune.py cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

Test Plan: `python test/inductor/test_max_autotune.py` `TORCHINDUCTOR_AUTOTUNE_IN_SUBPROC=1 TORCHINDUCTOR_MAX_AUTOTUNE=1 python benchmarks/dynamo/torchbench.py --device cuda --performance --backend inductor --inference --only hf_Bart` `TORCHINDUCTOR_AUTOTUNE_MULTI_DEVICE=1 TORCHINDUCTOR_AUTOTUNE_IN_SUBPROC=1 TORCHINDUCTOR_MAX_AUTOTUNE=1 python benchmarks/dynamo/torchbench.py --device cuda --performance --backend inductor --inference --only hf_Bart` Pull Request resolved: #109127 Approved by: https://github.com/shunting314, https://github.com/eellison ghstack dependencies: #109126

masnesral mentioned this pull request Sep 12, 2023

[inductor] Parallelize Max Autotune step 2: Use multiple GPUs #109127

Closed

github-actions bot added module: inductor ciflow/inductor labels Sep 12, 2023

masnesral added 2 commits September 12, 2023 11:47

masnesral added the topic: not user facing topic category label Sep 12, 2023

masnesral marked this pull request as ready for review September 12, 2023 19:22

masnesral requested review from Skylion007, eellison and shunting314 September 12, 2023 19:23

shunting314 approved these changes Sep 12, 2023

View reviewed changes

eellison approved these changes Sep 13, 2023

View reviewed changes

pytorchmergebot added the Merged label Sep 14, 2023

pytorchmergebot closed this in ce42839 Sep 14, 2023

facebook-github-bot deleted the gh/masnesral/9/head branch September 17, 2023 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] Parallelize Max Autotune step 1: refactor autotune_process #109126

[inductor] Parallelize Max Autotune step 1: refactor autotune_process #109126

Uh oh!

masnesral commented Sep 12, 2023 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 12, 2023 •

edited

Loading

Uh oh!

masnesral commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[inductor] Parallelize Max Autotune step 1: refactor autotune_process #109126

[inductor] Parallelize Max Autotune step 1: refactor autotune_process #109126

Uh oh!

Conversation

masnesral commented Sep 12, 2023 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/109126

✅ No Failures

Uh oh!

masnesral commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

masnesral commented Sep 12, 2023 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 12, 2023 •

edited

Loading