Add aot_export_joint_with_descriptors and aot_compile_joint_with_descriptors #158715

ezyang · 2025-07-20T02:39:18Z

Stack from ghstack (oldest at bottom):

Signed-off-by: Edward Z. Yang ezyang@meta.com

[ghstack-poisoned]

pytorch-bot · 2025-07-20T02:39:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158715

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 345d04e with merge base 85ee2fb ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, linux.12xlarge, unstable) (gh) (#158876)
sccache: error: couldn't connect to server

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 031d1cf Pull-Request: #158715

[ghstack-poisoned]

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 47dab91 Pull-Request: #158715

[ghstack-poisoned]

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 6dea18a Pull-Request: #158715

[ghstack-poisoned]

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: b8777d0 Pull-Request: #158715

[ghstack-poisoned]

wconstab · 2025-07-23T20:14:06Z

torch/_functorch/aot_autograd.py

+
+    Some descriptors can be quite exotic, so we recommend thinking carefully
+    if there is a safe fallback you can apply to descriptors you don't understand.
+    For example, you should have some way to handle not finding a particular


this would be for cases like desugaring a tensor subclass input into the user fn, and potentially flattening it into one or more tensor and POD inputs into the final graph?

Actually, tensor subclasses are extra super special, and probably will not work without more user case understanding! See #159005

torch/_functorch/aot_autograd.py

[ghstack-poisoned]

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: dad5ffc Pull-Request: #158715

fmassa

Approving to unblock, but the overall structure LGTM.

It would be good to address @wconstab comment on the missing comment though before merging

ezyang · 2025-07-25T13:41:14Z

Comment problem was addressed!

ezyang · 2025-07-25T13:41:28Z

@pytorchbot merge

pytorchmergebot · 2025-07-25T13:43:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

wconstab

this is looking good. I had one question about the tree_spec issue on PlainAOTInput, but i'm not sure if its worth delaying the PR on.

wconstab · 2025-07-25T13:30:34Z

torch/_functorch/aot_autograd.py

    args,
-    fw_compiler: AOTDispatchCompiler,
-    bw_compiler: AOTDispatchCompiler,
+    kwargs,


seems like it would be better to name this 'flatten_kwargs' and put it next to the flatten bool?

It's not a "flat" kwargs though, it's a dict. IMO, it should go with args because you pass the args along with the kwargs.

sorry, i totally misinterpreted this as being kwargs that somehow influence the tree_flatten. I didn't realize they were user kwargs. It makes sense that you would only support user kwargs if you are flattening and otherwise you are asserting they are args-only.

wconstab · 2025-07-25T13:34:57Z

torch/_functorch/aot_autograd.py

+    full_args = [*params_flat, *buffers_flat, *args]
+    in_spec, out_spec = None, None
+    if flatten:
+        functional_call, out_spec = create_tree_flattened_fn(functional_call, full_args, kwargs)


hmm. does the new functional_call expect the flattened full_args? if so, i wonder if 'create_tree_flattened_fn' should return full_args, in_spec also.

Yes. I think that would not be unreasonable, but this is a preexisting function and the convention seems to be to just handle the flattening manually outside.

wconstab · 2025-07-25T13:43:11Z

torch/_functorch/aot_autograd.py

    full_args_descs.extend(ParamAOTInput(fqn) for fqn in params_spec)
+    full_args_descs.extend(BufferAOTInput(fqn) for fqn in buffers_spec)
+    # TODO: it would be better to put pytree information in here
+    full_args_descs.extend(PlainAOTInput(i) for i in range(len(full_args) - len(full_args_descs)))


hmm, do i understand this..
1.len(full_args) - len(full_args_descs)

full_args includes params,bufs,flattened_args. full_args_descs only includes params, bufs. so this is just 'len(flattened_args)' which you couldn't directly compute bc we flattened the whole 'full_args'

We make one PlainAOTInput for each 'flattened arg'.

Help me understand what the point of returning these descriptors are for the flattened case. If I passed a weird input object to my fwd and you pytree flattened it and returned N descriptors, I can't figure out which one corresponds to which of my inputs right? So is this TODO load bearing for the flatten case? Maybe it is fine to fix later just wanted to be clear.

Yes and Yes.

In an ideal world, we would also report pytree paths (the TODO) on the PlainInput as well. This would make it easier to tell how the flattened arguments corresponded to the original (non-flattened) arguments. But you can also figure this out manually by flattening your arguments yourself and seeing where they turn up.

wconstab · 2025-07-25T13:47:28Z

torch/_functorch/aot_autograd.py

    not the intermediate export result.

-    TODO: talk carefully about how parameters/buffers work here
+    NB: If the passed nn.Module has parameters and buffers on it, we will


so, iiuc this just means that once we start using torch.export frontend with this, we'll have to fix this gap, but it will be straightforward to do so.

torch.export puts the parameters/buffers on the module, actually, so it's fine.

pytorchmergebot · 2025-07-25T14:10:02Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-jammy-py3.9-clang12 / test (dynamo_wrapped, 2, 3, linux.2xlarge)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

[ghstack-poisoned]

ezyang · 2025-07-25T14:23:02Z

With some claude code assistance I added some UTs too!

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 88525ec Pull-Request: #158715

[ghstack-poisoned]

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: f20a22b Pull-Request: #158715

ezyang · 2025-07-25T17:06:09Z

@pytorchbot merge

pytorchmergebot · 2025-07-25T17:08:03Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-07-25T18:09:17Z

Starting merge as part of PR stack under #159005

…riptors (#158715) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #158715 Approved by: https://github.com/fmassa, https://github.com/wconstab, https://github.com/xmfan ghstack dependencies: #158624, #158708, #158734

Update

2b2216f

[ghstack-poisoned]

ezyang requested a review from bdhirsh as a code owner July 20, 2025 02:39

pytorch-bot bot added ciflow/inductor release notes: AO frontend labels Jul 20, 2025

ezyang added a commit that referenced this pull request Jul 20, 2025

Add aot_export_joint_with_descriptors and aot_compile_joint_with_desc…

ff2935f

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 031d1cf Pull-Request: #158715

github-actions bot requested review from SherlockNoMad, albanD, antoniojkim and miladm July 20, 2025 02:39

Update

acdd3dc

[ghstack-poisoned]

ezyang added a commit that referenced this pull request Jul 21, 2025

Add aot_export_joint_with_descriptors and aot_compile_joint_with_desc…

720c105

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 47dab91 Pull-Request: #158715

ezyang mentioned this pull request Jul 21, 2025

Use simple_wraps instead of functools.wraps in AOTAutograd #158734

Closed

ezyang requested review from jamesjwu and wconstab July 21, 2025 04:29

albanD removed their request for review July 21, 2025 20:15

Update

0851736

[ghstack-poisoned]

ezyang added a commit that referenced this pull request Jul 21, 2025

Add aot_export_joint_with_descriptors and aot_compile_joint_with_desc…

28b78dc

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 6dea18a Pull-Request: #158715

Update

78031fa

[ghstack-poisoned]

ezyang added a commit that referenced this pull request Jul 21, 2025

Add aot_export_joint_with_descriptors and aot_compile_joint_with_desc…

54bafce

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: b8777d0 Pull-Request: #158715

Update

e88e7d6

[ghstack-poisoned]

ezyang mentioned this pull request Jul 22, 2025

[WIP] Checkpoint #158859

Closed

wconstab reviewed Jul 23, 2025

View reviewed changes

torch/_functorch/aot_autograd.py Outdated Show resolved Hide resolved

wconstab reviewed Jul 23, 2025

View reviewed changes

torch/_functorch/aot_autograd.py Show resolved Hide resolved

xmfan reviewed Jul 24, 2025

View reviewed changes

torch/_functorch/aot_autograd.py Show resolved Hide resolved

Update

deabf92

[ghstack-poisoned]

ezyang added a commit that referenced this pull request Jul 25, 2025

Add aot_export_joint_with_descriptors and aot_compile_joint_with_desc…

e3f4a15

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: dad5ffc Pull-Request: #158715

ezyang added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 25, 2025

fmassa approved these changes Jul 25, 2025

View reviewed changes

ezyang added the topic: new features topic category label Jul 25, 2025

pytorchmergebot added the merging label Jul 25, 2025

wconstab approved these changes Jul 25, 2025

View reviewed changes

pytorchmergebot removed the merging label Jul 25, 2025

Update

a7a56c1

[ghstack-poisoned]

ezyang added a commit that referenced this pull request Jul 25, 2025

Add aot_export_joint_with_descriptors and aot_compile_joint_with_desc…

7aae354

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 88525ec Pull-Request: #158715

ezyang added a commit that referenced this pull request Jul 25, 2025

Add aot_export_joint_with_descriptors and aot_compile_joint_with_desc…

21cf230

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: 88525ec Pull-Request: #158715

Update

345d04e

[ghstack-poisoned]

ezyang added a commit that referenced this pull request Jul 25, 2025

Add aot_export_joint_with_descriptors and aot_compile_joint_with_desc…

bf778a5

…riptors Signed-off-by: Edward Z. Yang <ezyang@meta.com> ghstack-source-id: f20a22b Pull-Request: #158715

xmfan approved these changes Jul 25, 2025

View reviewed changes

pytorchmergebot added the merging label Jul 25, 2025

pytorchmergebot added the Merged label Jul 25, 2025

pytorchmergebot closed this in 56c45f8 Jul 25, 2025

pytorchmergebot removed the merging label Jul 25, 2025

tugsbayasgalan mentioned this pull request Jul 29, 2025

Support for non-scalar valued Loss Tensor in _export_forward_backward() #159316

Open

github-actions bot deleted the gh/ezyang/3111/head branch August 25, 2025 02:18

Add aot_export_joint_with_descriptors and aot_compile_joint_with_descriptors #158715

Add aot_export_joint_with_descriptors and aot_compile_joint_with_descriptors #158715

Uh oh!

Conversation

ezyang commented Jul 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158715

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Jul 25, 2025

Uh oh!

ezyang commented Jul 25, 2025

Uh oh!

pytorchmergebot commented Jul 25, 2025

Merge started

Uh oh!

wconstab left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pytorchmergebot commented Jul 25, 2025

Merge failed

Uh oh!

ezyang commented Jul 25, 2025

Uh oh!

ezyang commented Jul 25, 2025

Uh oh!

pytorchmergebot commented Jul 25, 2025

Merge started

Uh oh!

pytorchmergebot commented Jul 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ezyang commented Jul 20, 2025 •

edited

Loading

pytorch-bot bot commented Jul 20, 2025 •

edited

Loading