[export] runtime asserts for while HOP subgraphs #158467

pianpwk · 2025-07-16T19:09:08Z

Differential Revision: D78431075

Calls runtime asserts pass for HOP subgraphs (in reenter_make_fx)
For while_loop only (can be expanded), clones input tensors for subgraph tracing, so unbacked memos (item, nonzero, etc.) aren't reused

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames @Lucaskabela @zou3519 @ydwu4

pytorch-bot · 2025-07-16T19:09:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158467

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

linux.aws.h100.8 instance is down, potentially longer queue on linux.aws.h100

✅ You can merge normally! (1 Unrelated Failure)

As of commit 88751f4 with merge base c917c63 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, linux.12xlarge) (gh) (#158876)
sccache: error: couldn't connect to server

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-07-16T19:09:15Z

This pull request was exported from Phabricator. Differential Revision: D78431075

Summary: For #158366, disables unbacked memoization across HOP subgraphs and adds runtime assertions, so asserts stay in their respective graphs Rollback Plan: Differential Revision: D78431075

facebook-github-bot · 2025-07-16T22:28:32Z

This pull request was exported from Phabricator. Differential Revision: D78431075

facebook-github-bot · 2025-07-16T23:44:59Z

This pull request was exported from Phabricator. Differential Revision: D78431075

Summary: For #158366 - Calls runtime asserts pass for HOP subgraphs (in reenter_make_fx) - For while_loop only (can be expanded), adds disabling unbacked memoization (item, nonzero, unique), to separate runtime asserts in their respective graphs Test Plan: test_export Rollback Plan: Differential Revision: D78431075

facebook-github-bot · 2025-07-16T23:47:22Z

This pull request was exported from Phabricator. Differential Revision: D78431075

Summary: For #158366 - Calls runtime asserts pass for HOP subgraphs (in reenter_make_fx) - For while_loop only (can be expanded), adds disabling unbacked memoization (item, nonzero, unique), to separate runtime asserts in their respective graphs Test Plan: test_export Rollback Plan: Differential Revision: D78431075

facebook-github-bot · 2025-07-17T01:15:39Z

This pull request was exported from Phabricator. Differential Revision: D78431075

ydwu4 · 2025-07-17T23:56:33Z

torch/_dynamo/config.py

 _custom_ops_profile: Optional[Any] = None

+# Disable memoization for data-dependent ops like item(), nonzero(), unique()
+disable_unbacked_memo = False


this probably shouldn't be a dynamo config? Since what we're trying to do is more dynamic shape/hop related.

ydwu4 · 2025-07-18T00:02:58Z

torch/_subclasses/fake_impls.py

 @register_op_impl(torch.ops.aten._local_scalar_dense.default)
 def local_scalar_dense(fake_mode, func, arg):
-    if (r := arg.item_memo) is not None:
+    if (


is the item memo associated with the fake_tensor?

Disabling item_memo may not be what we want e.g. users might be doing something like:

def body_fn(c): for _ in range(10): c_val = c.item() ...

In this case, we don't want to create an unbacked symint every iteration since they're essentially the same .item().

Summary: For #158366 - Calls runtime asserts pass for HOP subgraphs (in reenter_make_fx) - For while_loop only (can be expanded), clones input tensors for subgraph tracing, so unbacked memos (item, nonzero, etc.) aren't reused Test Plan: test_export Rollback Plan: Differential Revision: D78431075

facebook-github-bot · 2025-07-21T22:51:44Z

This pull request was exported from Phabricator. Differential Revision: D78431075

Summary: For #158366 - Calls runtime asserts pass for HOP subgraphs (in reenter_make_fx) - For while_loop only (can be expanded), clones input tensors for subgraph tracing, so unbacked memos (item, nonzero, etc.) aren't reused Test Plan: test_export Rollback Plan: Differential Revision: D78431075

facebook-github-bot · 2025-07-21T23:01:35Z

This pull request was exported from Phabricator. Differential Revision: D78431075

Summary: For #158366 - Calls runtime asserts pass for HOP subgraphs (in reenter_make_fx) - For while_loop only (can be expanded), clones input tensors for subgraph tracing, so unbacked memos (item, nonzero, etc.) aren't reused Test Plan: test_export Rollback Plan: Differential Revision: D78431075

facebook-github-bot · 2025-07-22T00:57:08Z

This pull request was exported from Phabricator. Differential Revision: D78431075

Summary: For #158366 - Calls runtime asserts pass for HOP subgraphs (in reenter_make_fx) - For while_loop only (can be expanded), clones input tensors for subgraph tracing, so unbacked memos (item, nonzero, etc.) aren't reused Test Plan: test_export Rollback Plan: Differential Revision: D78431075

facebook-github-bot · 2025-07-22T18:31:06Z

This pull request was exported from Phabricator. Differential Revision: D78431075

ydwu4 · 2025-07-22T18:50:44Z

A follow up is to make all control flow operators (cond, scan, associative_scan, map) clone the inputs before speculate_subgraph.

ydwu4 · 2025-07-22T19:42:40Z

torch/_dynamo/variables/higher_order_ops.py

-            new_operands_seq = [
-                unspecialize_carried_inputs(tx, carry) for carry in operands_seq
+            # clone inputs across subgraphs, to avoid unbacked memoization in fake prop
+            cond_operands_seq = [


this should be put under with discard_graph_changes(tx):

ydwu4 · 2025-07-22T19:43:09Z

torch/_higher_order_ops/while_loop.py

+            )
+            return reenter_make_fx(fn)(*cloned_carried_inputs, *additional_inputs)
+
+        cond_graph = produce_graph(cond_fn)


put under with disable_proxy_modes_tracing():

Summary: For #158366 - Calls runtime asserts pass for HOP subgraphs (in reenter_make_fx) - For while_loop only (can be expanded), clones input tensors for subgraph tracing, so unbacked memos (item, nonzero, etc.) aren't reused Test Plan: test_export Rollback Plan: Reviewed By: ydwu4 Differential Revision: D78431075

facebook-github-bot · 2025-07-22T19:51:43Z

This pull request was exported from Phabricator. Differential Revision: D78431075

facebook-github-bot · 2025-07-23T00:26:50Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-07-23T00:28:37Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pianpwk requested a review from zou3519 as a code owner July 16, 2025 19:09

pytorch-bot bot added ciflow/inductor module: dynamo labels Jul 16, 2025

facebook-github-bot added the fb-exported label Jul 16, 2025

pianpwk added module: higher order operators torch.cond and similar and removed module: dynamo labels Jul 16, 2025

pytorch-bot bot added the module: dynamo label Jul 16, 2025

pianpwk added release notes: export and removed module: dynamo labels Jul 16, 2025

facebook-github-bot force-pushed the export-D78431075 branch from ffd32f5 to 9fc62c3 Compare July 16, 2025 22:28

facebook-github-bot requested review from justinchuby, shubhambhokare1, titaiwangms and wschin as code owners July 16, 2025 22:28

pytorch-bot bot added the module: dynamo label Jul 16, 2025

facebook-github-bot force-pushed the export-D78431075 branch from 9fc62c3 to 3a691ce Compare July 16, 2025 23:45

facebook-github-bot force-pushed the export-D78431075 branch from 3a691ce to e0af870 Compare July 16, 2025 23:47

facebook-github-bot force-pushed the export-D78431075 branch from e0af870 to e621d3b Compare July 17, 2025 01:15

pianpwk changed the title ~~[WIP][export] runtime asserts for HOP subgraphs~~ [export] runtime asserts for while HOP subgraphs Jul 17, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 17, 2025

pianpwk requested a review from ydwu4 July 17, 2025 23:20

ydwu4 reviewed Jul 18, 2025

View reviewed changes

facebook-github-bot force-pushed the export-D78431075 branch from e621d3b to c3e82fc Compare July 21, 2025 22:51

facebook-github-bot force-pushed the export-D78431075 branch from c3e82fc to 351e725 Compare July 21, 2025 23:01

facebook-github-bot force-pushed the export-D78431075 branch from 351e725 to 180ae10 Compare July 22, 2025 00:56

facebook-github-bot force-pushed the export-D78431075 branch from 180ae10 to 6094c4c Compare July 22, 2025 18:30

pianpwk requested a review from ydwu4 July 22, 2025 18:31

ydwu4 approved these changes Jul 22, 2025

View reviewed changes

ydwu4 reviewed Jul 22, 2025

View reviewed changes

facebook-github-bot force-pushed the export-D78431075 branch from 6094c4c to 88751f4 Compare July 22, 2025 19:51

pytorchmergebot added the merging label Jul 23, 2025

pytorchmergebot added the Merged label Jul 23, 2025

pytorchmergebot closed this in 39b54b7 Jul 23, 2025

pytorchmergebot removed the merging label Jul 23, 2025

github-actions bot deleted the export-D78431075 branch August 22, 2025 02:15

[export] runtime asserts for while HOP subgraphs #158467

[export] runtime asserts for while HOP subgraphs #158467

Conversation

pianpwk commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158467

❗ 1 Active SEVs

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

facebook-github-bot commented Jul 16, 2025

Uh oh!

facebook-github-bot commented Jul 16, 2025

Uh oh!

facebook-github-bot commented Jul 16, 2025

Uh oh!

facebook-github-bot commented Jul 16, 2025

Uh oh!

facebook-github-bot commented Jul 17, 2025

Uh oh!

ydwu4 Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

ydwu4 Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 21, 2025

Uh oh!

facebook-github-bot commented Jul 21, 2025

Uh oh!

facebook-github-bot commented Jul 22, 2025

Uh oh!

facebook-github-bot commented Jul 22, 2025

Uh oh!

ydwu4 commented Jul 22, 2025

Uh oh!

ydwu4 Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

ydwu4 Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 22, 2025

Uh oh!

facebook-github-bot commented Jul 23, 2025

Uh oh!

pytorchmergebot commented Jul 23, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pianpwk commented Jul 16, 2025 •

edited

Loading

pytorch-bot bot commented Jul 16, 2025 •

edited

Loading