[AOTInductor] Option to not include weight in .so #141997

muchulee8 · 2024-12-03T21:27:14Z

Summary: Add an option in config to not include weights in .so

Test Plan: test/inductor:test_aot_inductor -- -r test_so_without_weight_cuda

Reviewed By: desertfire

Differential Revision: D65968885

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @ColinPeppler @amjames @desertfire @chauhang @aakhundov

pytorch-bot · 2024-12-03T21:27:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/141997

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (3 Unrelated Failures)

As of commit 31e6b93 with merge base d648133 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

inductor-rocm / rocm6.2-py3.10-inductor / test (inductor, 1, 2, linux.rocm.gpu.2) (gh) (similar failure)
##[error]Credentials could not be loaded, please check your action inputs: Could not load credentials from any providers
inductor-rocm / rocm6.2-py3.10-inductor / test (inductor, 2, 2, linux.rocm.gpu.2) (gh) (similar failure)
##[error]Credentials could not be loaded, please check your action inputs: Could not load credentials from any providers

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

inductor / cuda12.1-py3.10-gcc9-sm86 / test (inductor_timm, 1, 2, linux.g5.4xlarge.nvidia.gpu, unstable) (gh) (#141703)
convnext_base

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-12-03T21:27:24Z

This pull request was exported from Phabricator. Differential Revision: D65968885

facebook-github-bot · 2024-12-03T22:41:27Z

This pull request was exported from Phabricator. Differential Revision: D65968885

facebook-github-bot · 2024-12-03T22:50:03Z

This pull request was exported from Phabricator. Differential Revision: D65968885

Summary: Add an option in config to not include weights in .so Test Plan: `test/inductor:test_aot_inductor -- -r test_so_without_weight_cuda` Reviewed By: desertfire Differential Revision: D65968885

facebook-github-bot · 2024-12-04T05:49:38Z

This pull request was exported from Phabricator. Differential Revision: D65968885

facebook-github-bot · 2024-12-05T03:27:38Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2024-12-05T03:29:57Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: With the changes in #140755 and #141997, I added a load_constants function to the packaging API. Currently this doesn't work for cpu. The workflow is something like: ``` ep = torch.export.export(model, example_inputs) package = torch._inductor.aoti_compile_and_package(ep, inductor_configs=inductor_configs) compiled = torch._inductor.aoti_load_package(package) print(compiled.get_constant_fqns()) # see what are the fqns needed/available compiled.load_constants(new_state_dict, check_full_update=True) # update the constants in AOTI ``` You can also use the `aot_inductor.package_constants_in_so` config to stop including the constants in the so: ``` package = torch._inductor.aoti_compile_and_package(ep, inductor_configs={`aot_inductor.package_constants_in_so`: False) compiled = torch._inductor.aoti_load_package(package) compiled(*inputs) # segfaults because there are no constants --> we should probably have a better error msg compiled.load_constants(new_state_dict, check_full_update=True) compiled(*inputs) ``` Test Plan: `buck2 run @//mode/dev-nosan //caffe2/test/inductor:aot_inductor_package -- -r "test_so_without_weight" ` Differential Revision: D66796206

Summary: Pull Request resolved: pytorch#142246 With the changes in pytorch#140755 and pytorch#141997, I added a load_constants function to the packaging API. Currently this doesn't work for cpu. The workflow is something like: ``` ep = torch.export.export(model, example_inputs) package = torch._inductor.aoti_compile_and_package(ep, inductor_configs=inductor_configs) compiled = torch._inductor.aoti_load_package(package) print(compiled.get_constant_fqns()) # see what are the fqns needed/available compiled.load_constants(new_state_dict, check_full_update=True) # update the constants in AOTI ``` You can also use the `aot_inductor.package_constants_in_so` config to stop including the constants in the so: ``` package = torch._inductor.aoti_compile_and_package(ep, inductor_configs={`aot_inductor.package_constants_in_so`: False) compiled = torch._inductor.aoti_load_package(package) compiled(*inputs) # segfaults because there are no constants --> we should probably have a better error msg compiled.load_constants(new_state_dict, check_full_update=True) compiled(*inputs) ``` Test Plan: `buck2 run @//mode/dev-nosan //caffe2/test/inductor:aot_inductor_package -- -r "test_so_without_weight" ` Reviewed By: henrylhtsang Differential Revision: D66796206

Summary: With the changes in #140755 and #141997, I added a load_constants function to the packaging API. Currently this doesn't work for cpu. The workflow is something like: ``` ep = torch.export.export(model, example_inputs) package = torch._inductor.aoti_compile_and_package(ep, inductor_configs=inductor_configs) compiled = torch._inductor.aoti_load_package(package) print(compiled.get_constant_fqns()) # see what are the fqns needed/available compiled.load_constants(new_state_dict, check_full_update=True) # update the constants in AOTI ``` You can also use the `aot_inductor.package_constants_in_so` config to stop including the constants in the so: ``` package = torch._inductor.aoti_compile_and_package(ep, inductor_configs={`aot_inductor.package_constants_in_so`: False) compiled = torch._inductor.aoti_load_package(package) compiled(*inputs) # segfaults because there are no constants --> we should probably have a better error msg compiled.load_constants(new_state_dict, check_full_update=True) compiled(*inputs) ``` Test Plan: `buck2 run @//mode/dev-nosan //caffe2/test/inductor:aot_inductor_package -- -r "test_so_without_weight" ` Differential Revision: D66796206 Pull Request resolved: #142246 Approved by: https://github.com/henrylhtsang, https://github.com/desertfire

Summary: Add an option in config to not include weights in .so Test Plan: `test/inductor:test_aot_inductor -- -r test_so_without_weight_cuda` Reviewed By: desertfire Differential Revision: D65968885 Pull Request resolved: #141997 Approved by: https://github.com/desertfire

Summary: With the changes in #140755 and #141997, I added a load_constants function to the packaging API. Currently this doesn't work for cpu. The workflow is something like: ``` ep = torch.export.export(model, example_inputs) package = torch._inductor.aoti_compile_and_package(ep, inductor_configs=inductor_configs) compiled = torch._inductor.aoti_load_package(package) print(compiled.get_constant_fqns()) # see what are the fqns needed/available compiled.load_constants(new_state_dict, check_full_update=True) # update the constants in AOTI ``` You can also use the `aot_inductor.package_constants_in_so` config to stop including the constants in the so: ``` package = torch._inductor.aoti_compile_and_package(ep, inductor_configs={`aot_inductor.package_constants_in_so`: False) compiled = torch._inductor.aoti_load_package(package) compiled(*inputs) # segfaults because there are no constants --> we should probably have a better error msg compiled.load_constants(new_state_dict, check_full_update=True) compiled(*inputs) ``` Test Plan: `buck2 run @//mode/dev-nosan //caffe2/test/inductor:aot_inductor_package -- -r "test_so_without_weight" ` Differential Revision: D66796206 Pull Request resolved: #142246 Approved by: https://github.com/henrylhtsang, https://github.com/desertfire

pytorch-bot bot added ciflow/inductor module: inductor labels Dec 3, 2024

facebook-github-bot added the fb-exported label Dec 3, 2024

muchulee8 added the release notes: inductor label Dec 3, 2024

muchulee8 force-pushed the export-D65968885 branch from e3542c1 to 9dec2a8 Compare December 3, 2024 22:41

muchulee8 force-pushed the export-D65968885 branch from 9dec2a8 to eb78637 Compare December 3, 2024 22:50

[AOTInductor] Option to not include weight in .so (pytorch#141997)

31e6b93

Summary: Add an option in config to not include weights in .so Test Plan: `test/inductor:test_aot_inductor -- -r test_so_without_weight_cuda` Reviewed By: desertfire Differential Revision: D65968885

muchulee8 force-pushed the export-D65968885 branch from eb78637 to 31e6b93 Compare December 4, 2024 05:49

desertfire approved these changes Dec 4, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 4, 2024

pytorchmergebot added the merging label Dec 5, 2024

pytorchmergebot added the Merged label Dec 5, 2024

pytorchmergebot closed this in b08bc07 Dec 5, 2024

pytorchmergebot removed the merging label Dec 5, 2024

angelayi mentioned this pull request Dec 6, 2024

[aoti] Add load_constants to package api #142246

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AOTInductor] Option to not include weight in .so #141997

[AOTInductor] Option to not include weight in .so #141997

Uh oh!

muchulee8 commented Dec 3, 2024 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Dec 3, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Dec 3, 2024

Uh oh!

facebook-github-bot commented Dec 3, 2024

Uh oh!

facebook-github-bot commented Dec 3, 2024

Uh oh!

facebook-github-bot commented Dec 4, 2024

Uh oh!

facebook-github-bot commented Dec 5, 2024

Uh oh!

pytorchmergebot commented Dec 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[AOTInductor] Option to not include weight in .so #141997

[AOTInductor] Option to not include weight in .so #141997

Uh oh!

Conversation

muchulee8 commented Dec 3, 2024 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/141997

✅ You can merge normally! (3 Unrelated Failures)

Uh oh!

facebook-github-bot commented Dec 3, 2024

Uh oh!

facebook-github-bot commented Dec 3, 2024

Uh oh!

facebook-github-bot commented Dec 3, 2024

Uh oh!

facebook-github-bot commented Dec 4, 2024

Uh oh!

facebook-github-bot commented Dec 5, 2024

Uh oh!

pytorchmergebot commented Dec 5, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

muchulee8 commented Dec 3, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Dec 3, 2024 •

edited

Loading