Add support for tracing vmap in pre-dispatch export #154650

tugsbayasgalan · 2025-05-29T19:11:54Z

Summary: ONNX team and recent transformer upgrade ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops.

The implementation strategy is:

We add python wrappers around vmap APIs so that we attach custom torch function handler that is only on during non-strict export. The reason is we don't want to add this to default torch_function handler because it will break BC.
Some dynamo changes to make sure it picks up new python wrapper APIs. The reason is when we do strict export, we need to re-materialize these APIs in pre-dispatch IR from torch IR. We can avoid this by special casing in dynamo for export to proxy different API calls but i feel that is too much chaos because you need to be able to proxy 2 different variants of same vmap API.

Test Plan: CI

Differential Revision: D75623875

cc @ezyang @SherlockNoMad @EikanWang @jgong5 @wenzhe-nrv @voznesenskym @penguinwu @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @jiayisunx @chenyang78 @kadeng @chauhang @amjames @Lucaskabela

pytorch-bot · 2025-05-29T19:11:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154650

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8459179 with merge base 5ee464d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-05-29T19:12:02Z

This pull request was exported from Phabricator. Differential Revision: D75623875

torch/fx/experimental/proxy_tensor.py

torch/fx/passes/shape_prop.py

torch/fx/experimental/symbolic_shapes.py

torch/export/_trace.py

torch/_functorch/apis.py

torch/_dynamo/variables/builder.py

zou3519

Our conclusion from the meeting on Tuesday was:

Yes, we're going to put all the API calls into the graph
We should only interpose on these API calls when non-strict export is on. These shouldn't go through regular torch_function, because they are private APIs.
It might be easier to do this by creating a python function wrapper around e.g. add_batch_dim, and then putting the torch_function handler and export checks into said function. This may require you to add these new functions to Dynamo skiplists to not break the Dynamo side of things

Summary: ONNX team ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. Test Plan: CI Differential Revision: D75623875

facebook-github-bot · 2025-07-23T06:14:08Z

This pull request was exported from Phabricator. Differential Revision: D75623875

facebook-github-bot · 2025-07-23T08:19:54Z

This pull request was exported from Phabricator. Differential Revision: D75623875

Summary: Pull Request resolved: pytorch#154650 ONNX team ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. Test Plan: CI Differential Revision: D75623875

Summary: ONNX team ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. Test Plan: CI Differential Revision: D75623875

facebook-github-bot · 2025-08-20T01:56:56Z

This pull request was exported from Phabricator. Differential Revision: D75623875

Summary: ONNX team ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. Test Plan: CI Differential Revision: D75623875

facebook-github-bot · 2025-08-20T02:34:50Z

This pull request was exported from Phabricator. Differential Revision: D75623875

Summary: ONNX team ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. Test Plan: CI Differential Revision: D75623875

facebook-github-bot · 2025-08-20T02:35:14Z

This pull request was exported from Phabricator. Differential Revision: D75623875

Summary: ONNX team ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. Test Plan: CI Differential Revision: D75623875

facebook-github-bot · 2025-08-20T03:06:56Z

This pull request was exported from Phabricator. Differential Revision: D75623875

Summary: ONNX team ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. Test Plan: CI Differential Revision: D75623875

facebook-github-bot · 2025-08-20T14:22:47Z

This pull request was exported from Phabricator. Differential Revision: D75623875

torch/_export/verifier.py

zou3519

give me some docs about what is going on in proxy_tensor.py

Summary: ONNX team ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. Test Plan: CI Reviewed By: zou3519 Differential Revision: D75623875

facebook-github-bot · 2025-08-20T14:58:43Z

This pull request was exported from Phabricator. Differential Revision: D75623875

facebook-github-bot · 2025-08-20T19:23:40Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-08-20T19:25:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: ONNX team and recent transformer upgrade ran into this error and we also ran into during our export benchmarking. This diff makes it possible to trace through vmap implementation in pre-dispatch IR. Note that we don't support serializing functorch ops in pre-dispatch IR and in the future, we should desugar them to post-grad ops. The implementation strategy is: 1. We add python wrappers around vmap APIs so that we attach custom torch function handler that is only on during non-strict export. The reason is we don't want to add this to default torch_function handler because it will break BC. 2. Some dynamo changes to make sure it picks up new python wrapper APIs. The reason is when we do strict export, we need to re-materialize these APIs in pre-dispatch IR from torch IR. We can avoid this by special casing in dynamo for export to proxy different API calls but i feel that is too much chaos because you need to be able to proxy 2 different variants of same vmap API. Test Plan: CI Differential Revision: D75623875 Pull Request resolved: pytorch#154650 Approved by: https://github.com/ezyang, https://github.com/zou3519

tugsbayasgalan requested review from angelayi, avikchaudhuri, bobrenjc93, laithsakka, ydwu4 and zhxchen17 as code owners May 29, 2025 19:11

pytorch-bot bot added ciflow/inductor module: dynamo release notes: export labels May 29, 2025

facebook-github-bot added the fx label May 29, 2025

facebook-github-bot added the fb-exported label May 29, 2025

tugsbayasgalan requested review from bdhirsh and zou3519 May 29, 2025 19:14

bdhirsh reviewed May 29, 2025

View reviewed changes

torch/fx/experimental/proxy_tensor.py Outdated Show resolved Hide resolved

bdhirsh reviewed May 29, 2025

View reviewed changes

torch/fx/experimental/proxy_tensor.py Outdated Show resolved Hide resolved

bdhirsh reviewed May 29, 2025

View reviewed changes

torch/fx/passes/shape_prop.py Outdated Show resolved Hide resolved

bdhirsh reviewed May 29, 2025

View reviewed changes

torch/fx/experimental/symbolic_shapes.py Show resolved Hide resolved

bdhirsh reviewed May 29, 2025

View reviewed changes

torch/export/_trace.py Outdated Show resolved Hide resolved

bdhirsh reviewed May 29, 2025

View reviewed changes

torch/_functorch/apis.py Outdated Show resolved Hide resolved

bdhirsh reviewed May 29, 2025

View reviewed changes

torch/_dynamo/variables/builder.py Outdated Show resolved Hide resolved

zou3519 reviewed Jun 5, 2025

View reviewed changes

angelayi mentioned this pull request Jul 9, 2025

Regression in llama2 model export #157323

Open

tugsbayasgalan force-pushed the export-D75623875 branch from 7472d71 to bd6401b Compare July 23, 2025 06:13

tugsbayasgalan force-pushed the export-D75623875 branch from bd6401b to c0a8978 Compare July 23, 2025 08:20

tugsbayasgalan requested a review from zou3519 August 19, 2025 19:04

tugsbayasgalan force-pushed the export-D75623875 branch from 71f718a to 11f079b Compare August 20, 2025 01:56

tugsbayasgalan force-pushed the export-D75623875 branch from 11f079b to b327868 Compare August 20, 2025 02:33

tugsbayasgalan force-pushed the export-D75623875 branch from b327868 to 6592b95 Compare August 20, 2025 02:35

tugsbayasgalan force-pushed the export-D75623875 branch from 6592b95 to d4b6021 Compare August 20, 2025 03:06

tugsbayasgalan force-pushed the export-D75623875 branch from d4b6021 to 65fb23a Compare August 20, 2025 14:21

zou3519 reviewed Aug 20, 2025

View reviewed changes

torch/_export/verifier.py Outdated Show resolved Hide resolved

zou3519 approved these changes Aug 20, 2025

View reviewed changes

tugsbayasgalan force-pushed the export-D75623875 branch from 65fb23a to 8459179 Compare August 20, 2025 14:57

pytorchmergebot added the merging label Aug 20, 2025

pytorchmergebot closed this in dbef606 Aug 20, 2025

pytorchmergebot added Merged and removed merging labels Aug 20, 2025

pianpwk mentioned this pull request Sep 19, 2025

[torch.export] AssertionError: assert isinstance(a, FakeTensor) in _free_unbacked_symbols_with_path when exporting model that uses torch.func.jvp + functional_call + dict params #163051

Open

Add support for tracing vmap in pre-dispatch export #154650

Add support for tracing vmap in pre-dispatch export #154650

Uh oh!

Conversation

tugsbayasgalan commented May 29, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154650

✅ No Failures

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zou3519 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 23, 2025

Uh oh!

facebook-github-bot commented Jul 23, 2025

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

Uh oh!

zou3519 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

facebook-github-bot commented Aug 20, 2025

Uh oh!

pytorchmergebot commented Aug 20, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

tugsbayasgalan commented May 29, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented May 29, 2025 •

edited

Loading