[Inductor][CPP] Fix layout for local buf in outer loop fusion #160857

CaoE · 2025-08-18T03:10:54Z

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

pytorch-bot · 2025-08-18T03:10:58Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160857

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d62fb9f with merge base a4fc051 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

leslie-fang-intel · 2025-08-19T00:18:37Z

torch/_inductor/codegen/cpp.py

                        # Local Buffer is a view of global buffer
+                        local_buffer_stride: list[int] = []
+                        stride = global_buffer_layout.stride[-1]
+                        local_buffer_size = get_call_ranges(scheduler_node)[


is this case scheduler_node also a view of global_buffer?

In this case global_buffer is scheduler_node.node.

then why can't use global_buffer_layout.size[size_offset:] directly?

Because global_buffer_layout is the tensor layout while size_offset is the loop depth. The dimension of global_buffer_layout may not be the same as loop number, e.g, there are merged dims.
In this case, global_buffer_layout size is [5, 1, 32, 32] but the callrange is [5, 1024]. If use global_buffer_layout.size[size_offset:] to create local_buffer_layout we get the size [32] but we need [1024].

leslie-fang-intel

LGTM

jansel · 2025-08-20T00:25:57Z

torch/_inductor/codegen/cpp.py

                            continue
                        # Local Buffer is a view of global buffer
+                        local_buffer_stride: list[int] = []
+                        stride = global_buffer_layout.stride[-1]


Will this work for a size = [] tensor?

global_buffer is an instance of ir.ComputedBuffer. We haven't encountered a size = [] tensor yet. Is there a possible case for this? @leslie-fang-intel Could you please help on this question ?

CaoE · 2025-08-21T05:52:44Z

@pytorchbot merge

pytorchmergebot · 2025-08-21T05:54:27Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…h#160857) Fixes pytorch#159154 Pull Request resolved: pytorch#160857 Approved by: https://github.com/leslie-fang-intel, https://github.com/jansel

pytorch-bot bot added ciflow/inductor module: inductor labels Aug 18, 2025

CaoE added topic: not user facing topic category ciflow/trunk Trigger trunk jobs on your pull request labels Aug 18, 2025

CaoE changed the title ~~[Inductor][CPP] Fix layout for local buf of outer loop fusion~~ [Inductor][CPP] Fix layout for local buf in outer loop fusion Aug 18, 2025

pytorchbot added the open source label Aug 18, 2025

fix layout for local buf of outer loop fusion

d62fb9f

CaoE force-pushed the fix_local branch from 47ad284 to d62fb9f Compare August 18, 2025 05:26

CaoE requested review from jgong5 and leslie-fang-intel and removed request for jgong5 August 18, 2025 14:03

leslie-fang-intel reviewed Aug 19, 2025

View reviewed changes

leslie-fang-intel approved these changes Aug 19, 2025

View reviewed changes

CaoE marked this pull request as ready for review August 19, 2025 02:48

CaoE requested a review from jansel August 19, 2025 02:48

jansel approved these changes Aug 20, 2025

View reviewed changes

pytorchmergebot added the merging label Aug 21, 2025

pytorchmergebot closed this in 23b0334 Aug 21, 2025

pytorchmergebot added Merged and removed merging labels Aug 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inductor][CPP] Fix layout for local buf in outer loop fusion #160857

[Inductor][CPP] Fix layout for local buf in outer loop fusion #160857

CaoE commented Aug 18, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Aug 18, 2025 •

edited

Loading

Uh oh!

leslie-fang-intel Aug 19, 2025

Uh oh!

CaoE Aug 19, 2025

Uh oh!

leslie-fang-intel Aug 19, 2025

Uh oh!

CaoE Aug 20, 2025

Uh oh!

leslie-fang-intel left a comment

Uh oh!

jansel Aug 20, 2025

Uh oh!

CaoE Aug 20, 2025

Uh oh!

CaoE commented Aug 21, 2025

Uh oh!

pytorchmergebot commented Aug 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Inductor][CPP] Fix layout for local buf in outer loop fusion #160857

[Inductor][CPP] Fix layout for local buf in outer loop fusion #160857

Conversation

CaoE commented Aug 18, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160857

✅ No Failures

Uh oh!

leslie-fang-intel Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

CaoE Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

CaoE Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel left a comment

Choose a reason for hiding this comment

Uh oh!

jansel Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

CaoE Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

CaoE commented Aug 21, 2025

Uh oh!

pytorchmergebot commented Aug 21, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

CaoE commented Aug 18, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 18, 2025 •

edited

Loading