[pt2e][quant] Make move_exported_model_to_train/eval idempotent #142239

andrewor14 · 2024-12-06T16:26:00Z

Stack from ghstack (oldest at bottom):

-> [pt2e][quant] Make move_exported_model_to_train/eval idempotent #142239

Summary: Before we would recompile the model unnecessarily even
if the model is already in the desired mode. For training
frameworks that assume model.train() is idempotent and calls
this before every single training step, this led to a bunch of
tiny graphs and poor performance. This commit makes these calls
no-ops if we're already in the target train/eval mode.

Test Plan:
python test/test_quantization -k TestQuantizePT2E.test_allow_exported_model_train_eval_idempotent

Summary: Before we would recompile the model unnecessarily even if the model is already in the desired mode. For training frameworks that assume `model.train()` is idempotent and calls this before every single training step, this led to a bunch of tiny graphs and poor performance. This commit makes these calls no-ops if we're already in the target train/eval mode. Test Plan: python test/test_quantization -k TestQuantizePT2E.test_allow_exported_model_train_eval_idempotent [ghstack-poisoned]

pytorch-bot · 2024-12-06T16:26:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142239

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b0d946e with merge base 34033cc ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…otent" Summary: Before we would recompile the model unnecessarily even if the model is already in the desired mode. For training frameworks that assume `model.train()` is idempotent and calls this before every single training step, this led to a bunch of tiny graphs and poor performance. This commit makes these calls no-ops if we're already in the target train/eval mode. Test Plan: python test/test_quantization -k TestQuantizePT2E.test_allow_exported_model_train_eval_idempotent [ghstack-poisoned]

torch/ao/quantization/pt2e/export_utils.py

…otent" Summary: Before we would recompile the model unnecessarily even if the model is already in the desired mode. For training frameworks that assume `model.train()` is idempotent and calls this before every single training step, this led to a bunch of tiny graphs and poor performance. This commit makes these calls no-ops if we're already in the target train/eval mode. Test Plan: python test/test_quantization -k TestQuantizePT2E.test_allow_exported_model_train_eval_idempotent [ghstack-poisoned]

Summary: Before we would recompile the model unnecessarily even if the model is already in the desired mode. For training frameworks that assume `model.train()` is idempotent and calls this before every single training step, this led to a bunch of tiny graphs and poor performance. This commit makes these calls no-ops if we're already in the target train/eval mode. Test Plan: python test/test_quantization -k TestQuantizePT2E.test_allow_exported_model_train_eval_idempotent ghstack-source-id: f0e6c78 Pull Request resolved: #142239

…otent" Summary: Before we would recompile the model unnecessarily even if the model is already in the desired mode. For training frameworks that assume `model.train()` is idempotent and calls this before every single training step, this led to a bunch of tiny graphs and poor performance. This commit makes these calls no-ops if we're already in the target train/eval mode. Test Plan: python test/test_quantization -k TestQuantizePT2E.test_allow_exported_model_train_eval_idempotent [ghstack-poisoned]

Summary: Before we would recompile the model unnecessarily even if the model is already in the desired mode. For training frameworks that assume `model.train()` is idempotent and calls this before every single training step, this led to a bunch of tiny graphs and poor performance. This commit makes these calls no-ops if we're already in the target train/eval mode. Test Plan: python test/test_quantization -k TestQuantizePT2E.test_allow_exported_model_train_eval_idempotent ghstack-source-id: 6848f32 Pull Request resolved: #142239

andrewor14 · 2024-12-09T14:25:32Z

@pytorchbot merge

pytorchmergebot · 2024-12-09T14:27:33Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

andrewor14 requested a review from jerryzh168 as a code owner December 6, 2024 16:26

pytorch-bot bot added the release notes: quantization release notes category label Dec 6, 2024

jerryzh168 approved these changes Dec 6, 2024

View reviewed changes

jerryzh168 reviewed Dec 6, 2024

View reviewed changes

torch/ao/quantization/pt2e/export_utils.py Outdated Show resolved Hide resolved

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 9, 2024

pytorchmergebot added the merging label Dec 9, 2024

pytorchmergebot added the Merged label Dec 9, 2024

pytorchmergebot closed this in dd5df00 Dec 9, 2024

pytorchmergebot removed the merging label Dec 9, 2024

github-actions bot deleted the gh/andrewor14/51/head branch January 9, 2025 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pt2e][quant] Make move_exported_model_to_train/eval idempotent #142239

[pt2e][quant] Make move_exported_model_to_train/eval idempotent #142239

Uh oh!

andrewor14 commented Dec 6, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 6, 2024 •

edited

Loading

Uh oh!

Uh oh!

andrewor14 commented Dec 9, 2024

Uh oh!

pytorchmergebot commented Dec 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[pt2e][quant] Make move_exported_model_to_train/eval idempotent #142239

[pt2e][quant] Make move_exported_model_to_train/eval idempotent #142239

Uh oh!

Conversation

andrewor14 commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142239

✅ No Failures

Uh oh!

Uh oh!

andrewor14 commented Dec 9, 2024

Uh oh!

pytorchmergebot commented Dec 9, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrewor14 commented Dec 6, 2024 •

edited

Loading

pytorch-bot bot commented Dec 6, 2024 •

edited

Loading