KEMBAR78
Profile guided optimization for automatic_dynamic by ezyang · Pull Request #139001 · pytorch/pytorch · GitHub
Skip to content

Conversation

ezyang
Copy link
Contributor

@ezyang ezyang commented Oct 27, 2024

Stack from ghstack (oldest at bottom):

Previously: #138052 but the implementation is done from scratch, so I open a new PR.

This implements the ability to save and load profiles of automatic dynamic decisions, so on subsequent runs we can directly make something automatically dynamic. Unlike the previous implementation, this cache is never enabled by default; instead, you have to specify a "job id" that says it's OK to share results. We will be able to automatically populate this id for internal MAST jobs but for generic OSS users you will have to explicitly opt into it.

Signed-off-by: Edward Z. Yang ezyang@meta.com

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov @rec

Differential Revision: D65065497

[ghstack-poisoned]
@pytorch-bot
Copy link

pytorch-bot bot commented Oct 27, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139001

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 1ae1682 with merge base 92d7f29 (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ezyang added a commit that referenced this pull request Oct 27, 2024
Previously: #138052
but the implementation is done from scratch, so I open a new PR.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

ghstack-source-id: e1f0761
Pull Request resolved: #139001
@ezyang ezyang added the topic: new features topic category label Oct 28, 2024
[ghstack-poisoned]
ezyang added a commit that referenced this pull request Oct 28, 2024
Previously: #138052
but the implementation is done from scratch, so I open a new PR.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

ghstack-source-id: 6dfd835
Pull Request resolved: #139001
@ezyang
Copy link
Contributor Author

ezyang commented Oct 28, 2024

@ezyang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 28, 2024
[ghstack-poisoned]
ezyang added a commit that referenced this pull request Oct 28, 2024
Previously: #138052
but the implementation is done from scratch, so I open a new PR.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

ghstack-source-id: 868296a
Pull Request resolved: #139001
[ghstack-poisoned]
@ezyang
Copy link
Contributor Author

ezyang commented Oct 29, 2024

@ezyang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

[ghstack-poisoned]
ezyang added a commit that referenced this pull request Oct 30, 2024
Previously: #138052
but the implementation is done from scratch, so I open a new PR.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

ghstack-source-id: 79b1b5c
Pull Request resolved: #139001
@ezyang
Copy link
Contributor Author

ezyang commented Oct 30, 2024

@ezyang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

pytorchmergebot added a commit that referenced this pull request Nov 2, 2024
This reverts commit f6be44c.

Reverted #139001 on behalf of https://github.com/ezyang due to more fbcode errors ([comment](#139001 (comment)))
@pytorchmergebot
Copy link
Collaborator

@ezyang your PR has been successfully reverted.

[ghstack-poisoned]
ezyang added a commit that referenced this pull request Nov 2, 2024
Previously: #138052
but the implementation is done from scratch, so I open a new PR.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

ghstack-source-id: 78f0975
Pull Request resolved: #139001
@ezyang
Copy link
Contributor Author

ezyang commented Nov 2, 2024

@ezyang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@ezyang
Copy link
Contributor Author

ezyang commented Nov 2, 2024

Landing via fbcode as direct fbcode change needed

@ezyang
Copy link
Contributor Author

ezyang commented Nov 2, 2024

@pytorchbot merge -f "already merged in fbcode"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: This PR has internal changes and must be landed via Phabricator! Please try reimporting/rexporting the PR!

Details for Dev Infra team Raised by workflow job

@ezyang
Copy link
Contributor Author

ezyang commented Nov 3, 2024

@pytorchbot merge -f "already merged in fbcode"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: This PR has internal changes and must be landed via Phabricator! Please try reimporting/rexporting the PR!

Details for Dev Infra team Raised by workflow job

@huydhn
Copy link
Contributor

huydhn commented Nov 3, 2024

@pytorchbot merge -f "already merged in fbcode"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

rahulsingh-intel pushed a commit to rahulsingh-intel/pytorch that referenced this pull request Nov 5, 2024
Previously: pytorch#138052 but the implementation is done from scratch, so I open a new PR.

This implements the ability to save and load profiles of automatic dynamic decisions, so on subsequent runs we can directly make something automatically dynamic. Unlike the previous implementation, this cache is never enabled by default; instead, you have to specify a "job id" that says it's OK to share results. We will be able to automatically populate this id for internal MAST jobs but for generic OSS users you will have to explicitly opt into it.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

Differential Revision: [D65065497](https://our.internmc.facebook.com/intern/diff/D65065497)
Pull Request resolved: pytorch#139001
Approved by: https://github.com/oulgen
rahulsingh-intel pushed a commit to rahulsingh-intel/pytorch that referenced this pull request Nov 5, 2024
…9001)"

This reverts commit a6630bc.

Reverted pytorch#139001 on behalf of https://github.com/ezyang due to internal code triggers import cycle ([comment](pytorch#139001 (comment)))
rahulsingh-intel pushed a commit to rahulsingh-intel/pytorch that referenced this pull request Nov 5, 2024
Previously: pytorch#138052 but the implementation is done from scratch, so I open a new PR.

This implements the ability to save and load profiles of automatic dynamic decisions, so on subsequent runs we can directly make something automatically dynamic. Unlike the previous implementation, this cache is never enabled by default; instead, you have to specify a "job id" that says it's OK to share results. We will be able to automatically populate this id for internal MAST jobs but for generic OSS users you will have to explicitly opt into it.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

Differential Revision: [D65065497](https://our.internmc.facebook.com/intern/diff/D65065497)
Pull Request resolved: pytorch#139001
Approved by: https://github.com/oulgen
rahulsingh-intel pushed a commit to rahulsingh-intel/pytorch that referenced this pull request Nov 5, 2024
rahulsingh-intel pushed a commit to rahulsingh-intel/pytorch that referenced this pull request Nov 5, 2024
Previously: pytorch#138052 but the implementation is done from scratch, so I open a new PR.

This implements the ability to save and load profiles of automatic dynamic decisions, so on subsequent runs we can directly make something automatically dynamic. Unlike the previous implementation, this cache is never enabled by default; instead, you have to specify a "job id" that says it's OK to share results. We will be able to automatically populate this id for internal MAST jobs but for generic OSS users you will have to explicitly opt into it.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

Pull Request resolved: pytorch#139001
Approved by: https://github.com/oulgen
@github-actions github-actions bot deleted the gh/ezyang/2984/head branch December 5, 2024 02:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants