Store user model to simplify ONNXProgram.{adapt_torch_*,call} APIs #115281

thiagocrepaldi · 2023-12-06T18:49:49Z

Stack from ghstack (oldest at bottom):

Currently (after #114407), the user has must pass the original user model to APIs such as ONNXProgram.__call__, ONNXProgram.adapt_torch_inputs_to_onnx and ONNXProgram.adapt_torch_outputs_to_onnx APIs.

This was needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted from a real model (and used as input to the ONNX model).
That approach brings an unnecessary usability burden to the user when the model is not fakefied, because the model that was already passed to torch.onnx.dynamo_export could be used to extract state_dict.

This PR adds ONNXProgram._model_torch attribute to store the user model and demote model argument of the aforementioned APIs to optional, only (as opposed to required).

As a result, for the fakefied model scenario, the user still need to pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict, making it easier to use.

Currently, the user has must pass the original user ``model`` to APIs such as ``ONNXProgram.__call__``, ``ONNXProgram.adapt_torch_inputs_to_onnx`` and ``ONNXProgram.adapt_torch_outputs_to_onnx`` APIs. This is needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted (and used as input to the ONNX model). That approach brings an unnecessary burden to the user when the model is not fakefied, because the model that was already passed to ``torch.onnx.dynamo_export`` could be used to extract ``state_dict`` This PR adds ``ONNXProgram._model_torch`` to store the user model and promotes the ``model`` argument to the aforementioned APIs to optional. As a result, for the fakefied model scenario, the user can still pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict [ghstack-poisoned]

pytorch-bot · 2023-12-06T18:49:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/115281

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3e8612b with merge base 441ecf0 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…call__} APIs" Currently, the user has must pass the original user ``model`` to APIs such as ``ONNXProgram.__call__``, ``ONNXProgram.adapt_torch_inputs_to_onnx`` and ``ONNXProgram.adapt_torch_outputs_to_onnx`` APIs. This is needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted (and used as input to the ONNX model). That approach brings an unnecessary burden to the user when the model is not fakefied, because the model that was already passed to ``torch.onnx.dynamo_export`` could be used to extract ``state_dict`` This PR adds ``ONNXProgram._model_torch`` to store the user model and promotes the ``model`` argument to the aforementioned APIs to optional. As a result, for the fakefied model scenario, the user can still pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict [ghstack-poisoned]

BowenBao · 2023-12-06T21:58:53Z

torch/onnx/_internal/exporter.py

        self,
        *model_args,
-        model: Optional[
+        model_with_state_dict: Optional[


nit: for my understanding, do we need only state_dict or the model?

we need the model, because the lifted "constant tensors" are not part of the ExportedProgram.state_dict (yet ?!?!?!)

I will investigate further and discuss with Meta and check whether I can make that change on ExportedProgram.state_dict , simplifying the requirement to state_dict only (which would be more ideal)

test/onnx/onnx_test_common.py

…call__} APIs" Currently, the user has must pass the original user ``model`` to APIs such as ``ONNXProgram.__call__``, ``ONNXProgram.adapt_torch_inputs_to_onnx`` and ``ONNXProgram.adapt_torch_outputs_to_onnx`` APIs. This is needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted (and used as input to the ONNX model). That approach brings an unnecessary burden to the user when the model is not fakefied, because the model that was already passed to ``torch.onnx.dynamo_export`` could be used to extract ``state_dict`` This PR adds ``ONNXProgram._model_torch`` to store the user model and promotes the ``model`` argument to the aforementioned APIs to optional. As a result, for the fakefied model scenario, the user can still pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict [ghstack-poisoned]

…call__} APIs" Currently (after #114407), the user has must pass the original user ``model`` to APIs such as ``ONNXProgram.__call__``, ``ONNXProgram.adapt_torch_inputs_to_onnx`` and ``ONNXProgram.adapt_torch_outputs_to_onnx`` APIs. This was needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted from a real model (and used as input to the ONNX model). That approach brings an unnecessary usability burden to the user when the model is not fakefied, because the model that was already passed to ``torch.onnx.dynamo_export`` could be used to extract ``state_dict``. This PR adds ``ONNXProgram._model_torch`` attribute to store the user model and demote ``model`` argument of the aforementioned APIs to optional, only (as opposed to required). As a result, for the fakefied model scenario, the user still need to pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict, making it easier to use. [ghstack-poisoned]

BowenBao · 2023-12-08T21:50:36Z

torch/onnx/_internal/exporter.py

    _fake_context: Final[Optional[ONNXFakeContext]]
    _export_exception: Final[Optional[Exception]]
    _model_signature: Final[Optional[torch.export.ExportGraphSignature]]
+    _model_torch: Final[


mark as experimental?

Tracked by #115461

BowenBao

🚢 to unblock CI and experiments. Let's keep exploring and revisit #115461

thiagocrepaldi · 2023-12-09T04:59:40Z

@pytorchbot merge

pytorchmergebot · 2023-12-09T05:01:48Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…4762) Fixed by #113982 Pull Request resolved: #114762 Approved by: https://github.com/BowenBao ghstack dependencies: #114407, #115281

…115353) Pull Request resolved: #115353 Approved by: https://github.com/BowenBao ghstack dependencies: #114407, #115281, #114762

pytorch#115281) Currently (after pytorch#114407), the user has must pass the original user ``model`` to APIs such as ``ONNXProgram.__call__``, ``ONNXProgram.adapt_torch_inputs_to_onnx`` and ``ONNXProgram.adapt_torch_outputs_to_onnx`` APIs. This was needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted from a real model (and used as input to the ONNX model). That approach brings an unnecessary usability burden to the user when the model is not fakefied, because the model that was already passed to ``torch.onnx.dynamo_export`` could be used to extract ``state_dict``. This PR adds ``ONNXProgram._model_torch`` attribute to store the user model and demote ``model`` argument of the aforementioned APIs to optional, only (as opposed to required). As a result, for the fakefied model scenario, the user still need to pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict, making it easier to use. Pull Request resolved: pytorch#115281 Approved by: https://github.com/BowenBao ghstack dependencies: pytorch#114407

…orch#114762) Fixed by pytorch#113982 Pull Request resolved: pytorch#114762 Approved by: https://github.com/BowenBao ghstack dependencies: pytorch#114407, pytorch#115281

…ytorch#115353) Pull Request resolved: pytorch#115353 Approved by: https://github.com/BowenBao ghstack dependencies: pytorch#114407, pytorch#115281, pytorch#114762

pytorch#115281) Currently (after pytorch#114407), the user has must pass the original user ``model`` to APIs such as ``ONNXProgram.__call__``, ``ONNXProgram.adapt_torch_inputs_to_onnx`` and ``ONNXProgram.adapt_torch_outputs_to_onnx`` APIs. This was needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted from a real model (and used as input to the ONNX model). That approach brings an unnecessary usability burden to the user when the model is not fakefied, because the model that was already passed to ``torch.onnx.dynamo_export`` could be used to extract ``state_dict``. This PR adds ``ONNXProgram._model_torch`` attribute to store the user model and demote ``model`` argument of the aforementioned APIs to optional, only (as opposed to required). As a result, for the fakefied model scenario, the user still need to pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict, making it easier to use. Pull Request resolved: pytorch#115281 Approved by: https://github.com/BowenBao ghstack dependencies: pytorch#114407

#115281) (#115583) Currently (after #114407), the user has must pass the original user ``model`` to APIs such as ``ONNXProgram.__call__``, ``ONNXProgram.adapt_torch_inputs_to_onnx`` and ``ONNXProgram.adapt_torch_outputs_to_onnx`` APIs. This was needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted from a real model (and used as input to the ONNX model). That approach brings an unnecessary usability burden to the user when the model is not fakefied, because the model that was already passed to ``torch.onnx.dynamo_export`` could be used to extract ``state_dict``. This PR adds ``ONNXProgram._model_torch`` attribute to store the user model and demote ``model`` argument of the aforementioned APIs to optional, only (as opposed to required). As a result, for the fakefied model scenario, the user still need to pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict, making it easier to use. Pull Request resolved: #115281 Approved by: https://github.com/BowenBao ghstack dependencies: #114407

thiagocrepaldi requested review from BowenBao, abock and wschin as code owners December 6, 2023 18:49

thiagocrepaldi mentioned this pull request Dec 6, 2023

Update ONNX's IO Adapter to support FakeTensor with ExportedProgram #114407

Closed

pytorch-bot bot added the release notes: onnx torch.onnx related changes that should show up in the release notes label Dec 6, 2023

thiagocrepaldi mentioned this pull request Dec 6, 2023

Enable builtin tests for ONNX Export with ExportedProgram models #114762

Closed

pytorchbot added the open source label Dec 6, 2023

BowenBao reviewed Dec 6, 2023

View reviewed changes

test/onnx/onnx_test_common.py Outdated Show resolved Hide resolved

Thiago Crepaldi added 4 commits December 7, 2023 03:15

thiagocrepaldi mentioned this pull request Dec 7, 2023

Move ONNX's TorchModelType to pytorch_test_common to fix circ. dep. #115353

Closed

Thiago Crepaldi added 2 commits December 7, 2023 16:42

thiagocrepaldi requested a review from BowenBao December 7, 2023 19:48

thiagocrepaldi mentioned this pull request Dec 7, 2023

Add huggingface gpt2 fake tensor unit test for torch.onnx.dynamo_export #115380

Closed

BowenBao reviewed Dec 8, 2023

View reviewed changes

thiagocrepaldi mentioned this pull request Dec 8, 2023

Revisit ONNXProgram API due to fake tensor support and additional model_with_state_dict addition #115461

Closed

BowenBao approved these changes Dec 8, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 9, 2023

pytorchmergebot added the merging label Dec 9, 2023

pytorchmergebot added Merged and removed merging labels Dec 9, 2023

pytorchmergebot closed this in 7e941a9 Dec 9, 2023

pytorchmergebot pushed a commit that referenced this pull request Dec 9, 2023

Move ONNX's TorchModelType to pytorch_test_common to fix circ. dep. (#…

960ad9d

…115353) Pull Request resolved: #115353 Approved by: https://github.com/BowenBao ghstack dependencies: #114407, #115281, #114762

This was referenced Dec 11, 2023

[Release 2.2][ONNX]Store user model to simplify ONNXProgram.{adapt_torch_*,__call__} AP #115583

Merged

[v.2.2.0] Release Tracker #115300

Closed

facebook-github-bot deleted the gh/thiagocrepaldi/15/head branch December 12, 2023 15:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Store user model to simplify ONNXProgram.{adapt_torch_*,call} APIs #115281

Store user model to simplify ONNXProgram.{adapt_torch_*,call} APIs #115281

Uh oh!

thiagocrepaldi commented Dec 6, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 6, 2023 •

edited

Loading

Uh oh!

BowenBao Dec 6, 2023

Uh oh!

thiagocrepaldi Dec 7, 2023 •

edited

Loading

Uh oh!

Uh oh!

BowenBao Dec 8, 2023

Uh oh!

thiagocrepaldi Dec 8, 2023

Uh oh!

BowenBao left a comment

Uh oh!

thiagocrepaldi commented Dec 9, 2023

Uh oh!

pytorchmergebot commented Dec 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Store user model to simplify ONNXProgram.{adapt_torch_*,__call__} APIs #115281

Store user model to simplify ONNXProgram.{adapt_torch_*,__call__} APIs #115281

Uh oh!

Conversation

thiagocrepaldi commented Dec 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/115281

✅ No Failures

Uh oh!

BowenBao Dec 6, 2023

Choose a reason for hiding this comment

Uh oh!

thiagocrepaldi Dec 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BowenBao Dec 8, 2023

Choose a reason for hiding this comment

Uh oh!

thiagocrepaldi Dec 8, 2023

Choose a reason for hiding this comment

Uh oh!

BowenBao left a comment

Choose a reason for hiding this comment

Uh oh!

thiagocrepaldi commented Dec 9, 2023

Uh oh!

pytorchmergebot commented Dec 9, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Store user model to simplify ONNXProgram.{adapt_torch_*,call} APIs #115281

Store user model to simplify ONNXProgram.{adapt_torch_*,call} APIs #115281

thiagocrepaldi commented Dec 6, 2023 •

edited

Loading

pytorch-bot bot commented Dec 6, 2023 •

edited

Loading

thiagocrepaldi Dec 7, 2023 •

edited

Loading