fix layer_norm decomp precision for cpu #140557

bdhirsh · 2024-11-13T15:41:38Z

xref: https://fb.workplace.com/groups/1075192433118967/posts/1540519826586223/?comment_id=1543752356262970&reply_comment_id=1544425069529032

the issue is that our decomp needs to branch on device (it only upcasts for cpu), but the device shows up as "meta" because it is registered as a meta tensor rule.

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2024-11-13T15:41:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140557

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[DomainsOnly] Jobs fail with GLIBC version not found

✅ You can merge normally! (2 Unrelated Failures)

As of commit f17cb34 with merge base e6c5a77 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 4, 5, lf.linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
functorch/test_control_flow.py::AssociativeScanTests::test_associative_scan_binary_operator_compile_mode_compile_dynamic_shape_combine_mode_pointwise_reverse_False_cuda
trunk / linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 5, 5, lf.linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
functorch/test_control_flow.py::AssociativeScanTests::test_associative_scan_binary_operator_compile_mode_compile_dynamic_shape_combine_mode_pointwise_reverse_False_cuda

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 7911d85 Pull Request resolved: #140557

arui-meta · 2024-11-18T21:39:39Z

Gentle ping: could we land this by itself? I have a usecase that depends on it. Thanks!

bdhirsh · 2024-11-18T21:48:19Z

@pytorchbot merge

pytorchmergebot · 2024-11-18T21:50:10Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

xref: https://fb.workplace.com/groups/1075192433118967/posts/1540519826586223/?comment_id=1543752356262970&reply_comment_id=1544425069529032 the issue is that our decomp needs to branch on device (it only upcasts for cpu), but the device shows up as "meta" because it is registered as a meta tensor rule. Pull Request resolved: pytorch#140557 Approved by: https://github.com/ezyang

fix layer_norm decomp precision for cpu

f17cb34

[ghstack-poisoned]

bdhirsh added a commit that referenced this pull request Nov 13, 2024

fix layer_norm decomp precision for cpu

201927d

ghstack-source-id: 7911d85 Pull Request resolved: #140557

github-actions bot requested review from SherlockNoMad, albanD, antoniojkim, ezyang and miladm November 13, 2024 15:41

bdhirsh added the release notes: composability release notes category label Nov 13, 2024

ezyang approved these changes Nov 14, 2024

View reviewed changes

bdhirsh mentioned this pull request Nov 14, 2024

avoid specializing strides with DDPOptimizer + inductor #140751

Closed

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 18, 2024

pytorchmergebot added the merging label Nov 18, 2024

pytorchmergebot added the Merged label Nov 19, 2024

pytorchmergebot closed this in 9ae19ff Nov 19, 2024

pytorchmergebot removed the merging label Nov 19, 2024

github-actions bot deleted the gh/bdhirsh/624/head branch December 20, 2024 02:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix layer_norm decomp precision for cpu #140557

fix layer_norm decomp precision for cpu #140557

Uh oh!

bdhirsh commented Nov 13, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 13, 2024 •

edited

Loading

Uh oh!

arui-meta commented Nov 18, 2024

Uh oh!

bdhirsh commented Nov 18, 2024

Uh oh!

pytorchmergebot commented Nov 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix layer_norm decomp precision for cpu #140557

fix layer_norm decomp precision for cpu #140557

Uh oh!

Conversation

bdhirsh commented Nov 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140557

❗ 1 Active SEVs

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

arui-meta commented Nov 18, 2024

Uh oh!

bdhirsh commented Nov 18, 2024

Uh oh!

pytorchmergebot commented Nov 18, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bdhirsh commented Nov 13, 2024 •

edited

Loading

pytorch-bot bot commented Nov 13, 2024 •

edited

Loading