[ONNX] Support None in fx.args as torchlib inputs #108708

titaiwangms · 2023-09-06T22:04:53Z

Stack from ghstack (oldest at bottom):

-> [ONNX] Support None in fx.args as torchlib inputs #108708

Prior to this PR, if None is returned from intermediate nodes, it will crashes the export because None is not expected to be passed into _fill_tensor_shape_type, and raise beartype roar. The function fills in shape and type to TorchScriptTensor according to its info from FX graph.

This is discovered after microsoft/onnxscript#1043 is supported. The op specifically generates None in one of its inputs, but the only output from it being consumed is the first one (not None).

Reference test from a TorchBench model:

    def test_nanogpt(self):
        import sys

        sys.path.append("/home/titaiwang")

        from nanoGPT.model import GPT, GPTConfig

        # Load the model
        kwargs = {
            "block_size": 256,
            "vocab_size": 8096,  # GPT-2 vocab_size of 50257, padded up to nearest multiple of 64 for efficiency
            "n_layer": 2,
            "n_head": 2,
            "n_embd": 128,
            "dropout": 0.0,
            "bias": False,  # True: bias in Linears and LayerNorms, like GPT-2. False: a bit better and faster
        }
        config = GPTConfig(**kwargs)
        with torch.backends.cuda.sdp_kernel(
            enable_flash=True, enable_mem_efficient=True
        ):
            model = GPT(config)
        print("Done loading model")
        inputs = torch.arange(128).view(2, 64)
        targets = torch.arange(128).view(2, 64)

        self.run_test_with_fx_to_onnx_exporter_and_onnx_runtime(
            model,
            (inputs,),
            input_kwargs={
                "targets": targets,
            },
            verbose=True,
        )

[ghstack-poisoned]

pytorch-bot · 2023-09-06T22:04:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108708

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit c961589 with merge base bde75eb ():

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: efca75f Pull Request resolved: #108708

Previous to this PR, if None is returned from intermediate nodes, it will crashes the export because None is not expected to be passed into `_fill_tensor_shape_type`, and raise beartype roar. The function fills in shape and type to TorchScriptTensor according to its info from FX graph. This is discovered after microsoft/onnxscript#1043 is supported. The op specifically generates None in one of its inputs, but the only output from it being consumed is the first one (not None). Reference test from a TorchBench model: ```python def test_nanogpt(self): import sys sys.path.append("/home/titaiwang") from nanoGPT.model import GPT, GPTConfig # Load the model kwargs = { "block_size": 256, "vocab_size": 8096, # GPT-2 vocab_size of 50257, padded up to nearest multiple of 64 for efficiency "n_layer": 2, "n_head": 2, "n_embd": 128, "dropout": 0.0, "bias": False, # True: bias in Linears and LayerNorms, like GPT-2. False: a bit better and faster } config = GPTConfig(**kwargs) with torch.backends.cuda.sdp_kernel( enable_flash=True, enable_mem_efficient=True ): model = GPT(config) print("Done loading model") inputs = torch.arange(128).view(2, 64) targets = torch.arange(128).view(2, 64) self.run_test_with_fx_to_onnx_exporter_and_onnx_runtime( model, (inputs,), input_kwargs={ "targets": targets, }, verbose=True, ) ``` [ghstack-poisoned]

ghstack-source-id: 60961c6 Pull Request resolved: #108708

justinchuby · 2023-09-12T21:35:00Z

The op specifically generates None in one of its inputs, but the only output from it being consumed is the first one (not None).

Do you mean an input or output is None? If output which one?

justinchuby · 2023-09-12T21:58:19Z

torch/onnx/_internal/fx/fx_onnx_interpreter.py

-        # TODO(titaiwang): set shape?
-        if isinstance(expected_value, (torch.SymInt, torch.SymFloat, torch.SymBool)):
+        if expected_value is None:
+            # There is no shape/type from None.


I would link to the example you shown so readers know when this will happen

Would it be possible for use to assume it is always a scalar? Or could there be other cases?

hmmm it all depends on what fx.node gives us. This is pure product from fx graph. So I feel like what we should is only taking None into consideration of our code.

My guess is that for CPU, those outputs are useless or non-generated, so it returns None.

Prior to this PR, if None is returned from intermediate nodes, it will crashes the export because None is not expected to be passed into `_fill_tensor_shape_type`, and raise beartype roar. The function fills in shape and type to TorchScriptTensor according to its info from FX graph. This is discovered after microsoft/onnxscript#1043 is supported. The op specifically generates None in one of its inputs, but the only output from it being consumed is the first one (not None). Reference test from a TorchBench model: ```python def test_nanogpt(self): import sys sys.path.append("/home/titaiwang") from nanoGPT.model import GPT, GPTConfig # Load the model kwargs = { "block_size": 256, "vocab_size": 8096, # GPT-2 vocab_size of 50257, padded up to nearest multiple of 64 for efficiency "n_layer": 2, "n_head": 2, "n_embd": 128, "dropout": 0.0, "bias": False, # True: bias in Linears and LayerNorms, like GPT-2. False: a bit better and faster } config = GPTConfig(**kwargs) with torch.backends.cuda.sdp_kernel( enable_flash=True, enable_mem_efficient=True ): model = GPT(config) print("Done loading model") inputs = torch.arange(128).view(2, 64) targets = torch.arange(128).view(2, 64) self.run_test_with_fx_to_onnx_exporter_and_onnx_runtime( model, (inputs,), input_kwargs={ "targets": targets, }, verbose=True, ) ``` [ghstack-poisoned]

ghstack-source-id: 35b1cb7 Pull Request resolved: #108708

titaiwangms · 2023-09-12T23:18:21Z

@pytorchbot merge

pytorchmergebot · 2023-09-12T23:20:01Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[ONNX] Support None in fx.args as torchlib inputs

6e7ecdc

[ghstack-poisoned]

titaiwangms requested review from BowenBao, abock, thiagocrepaldi and wschin as code owners September 6, 2023 22:04

pytorch-bot bot added the release notes: onnx torch.onnx related changes that should show up in the release notes label Sep 6, 2023

titaiwangms added a commit that referenced this pull request Sep 6, 2023

[ONNX] Support None in fx.args as torchlib inputs

9e3b9d0

ghstack-source-id: efca75f Pull Request resolved: #108708

titaiwangms marked this pull request as draft September 6, 2023 22:05

pytorchbot added the open source label Sep 6, 2023

titaiwangms added module: onnx Related to torch.onnx topic: improvements topic category labels Sep 11, 2023

titaiwangms added a commit that referenced this pull request Sep 12, 2023

[ONNX] Support None in fx.args as torchlib inputs

b7f5a27

ghstack-source-id: 60961c6 Pull Request resolved: #108708

titaiwangms marked this pull request as ready for review September 12, 2023 20:33

titaiwangms requested a review from justinchuby September 12, 2023 20:34

titaiwangms added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 12, 2023

justinchuby self-assigned this Sep 12, 2023

justinchuby reviewed Sep 12, 2023

View reviewed changes

justinchuby approved these changes Sep 12, 2023

View reviewed changes

thiagocrepaldi approved these changes Sep 12, 2023

View reviewed changes

titaiwangms added a commit that referenced this pull request Sep 12, 2023

[ONNX] Support None in fx.args as torchlib inputs

40a6e5f

ghstack-source-id: 35b1cb7 Pull Request resolved: #108708

pytorchmergebot added the merging label Sep 12, 2023

pytorchmergebot added Merged and removed merging labels Sep 13, 2023

pytorchmergebot closed this in 91e154f Sep 13, 2023

facebook-github-bot deleted the gh/titaiwangms/46/head branch September 16, 2023 14:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ONNX] Support None in fx.args as torchlib inputs #108708

[ONNX] Support None in fx.args as torchlib inputs #108708

Uh oh!

titaiwangms commented Sep 6, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 6, 2023 •

edited

Loading

Uh oh!

justinchuby commented Sep 12, 2023

Uh oh!

justinchuby Sep 12, 2023

Uh oh!

justinchuby Sep 12, 2023

Uh oh!

titaiwangms Sep 12, 2023

Uh oh!

titaiwangms Sep 12, 2023

Uh oh!

titaiwangms commented Sep 12, 2023

Uh oh!

pytorchmergebot commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[ONNX] Support None in fx.args as torchlib inputs #108708

[ONNX] Support None in fx.args as torchlib inputs #108708

Uh oh!

Conversation

titaiwangms commented Sep 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108708

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

justinchuby commented Sep 12, 2023

Uh oh!

justinchuby Sep 12, 2023

Choose a reason for hiding this comment

Uh oh!

justinchuby Sep 12, 2023

Choose a reason for hiding this comment

Uh oh!

titaiwangms Sep 12, 2023

Choose a reason for hiding this comment

Uh oh!

titaiwangms Sep 12, 2023

Choose a reason for hiding this comment

Uh oh!

titaiwangms commented Sep 12, 2023

Uh oh!

pytorchmergebot commented Sep 12, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

titaiwangms commented Sep 6, 2023 •

edited

Loading

pytorch-bot bot commented Sep 6, 2023 •

edited

Loading