Implement unfold_backward on MPS #135411

malfet · 2024-09-07T03:34:06Z

This PR adds native implementation of unfold_backward as metal shader, mostly copy-n-paste of algorithms used in CUDA and CPU implementations, i.e. considering out = in.unfold(dim, size, step), then following holds true:

out.shape[dim] == (in.shape[dim] - size) / step + 1
out.shape[-1] == size
out.ndim == in.ndim + 1
unfold_backward Metal kernel receives grad_in and returns grad_out such that:
grad_in.shape == out.shape
grad_out.shape == in.shape

For each index in grad_out find the elements contributing to it and sum them up. Such algorithm requires no synchronization between threads.
That is grad_out[...,out_dim_idx,...] accumulates all values grad_in[...,in_dim_idx,...,in_last_idx], where in_dim_idx is range [(out_dim_idx - size) / step, out_dim_idx / step] clamped to (0, in_dim_size) and in_last_idx are equal out_dim_idx - in_dim_idx * step . Accumulation step is skipped if in_last_idx is outside of [0, size] range.

This operator has been requested 16 times on #77764

pytorch-bot · 2024-09-07T03:34:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/135411

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

GLIBC not found in Nova workflows

❌ 1 New Failure

As of commit 15a2972 with merge base 7578a0b ():

NEW FAILURE - The following job has failed:

Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for torch/_inductor/aoti_eager.py:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2024-09-07T03:37:57Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

github-actions · 2024-11-07T15:34:54Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

aten/src/ATen/native/mps/kernels/UnfoldBackward.metal

Co-authored-by: Manuel Candales <42380156+manuelcandales@users.noreply.github.com>

malfet · 2024-11-13T23:02:22Z

@pytorchbot merge -f "This was mostly green in the past"

pytorchmergebot · 2024-11-13T23:04:01Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This PR adds native implementation of unfold_backward as metal shader, mostly copy-n-paste of algorithms used in CUDA and CPU implementations, i.e. considering `out = in.unfold(dim, size, step)`, then following holds true: * `out.shape[dim] == (in.shape[dim] - size) / step + 1` * `out.shape[-1] == size` * `out.ndim == in.ndim + 1` `unfold_backward` Metal kernel receives `grad_in` and returns `grad_out` such that: * `grad_in.shape == out.shape` * `grad_out.shape == in.shape` For each index in `grad_out` find the elements contributing to it and sum them up. Such algorithm requires no synchronization between threads. That is `grad_out[...,out_dim_idx,...]` accumulates all values `grad_in[...,in_dim_idx,...,in_last_idx]`, where `in_dim_idx` is range [`(out_dim_idx - size) / step`, `out_dim_idx / step`] clamped to (0, `in_dim_size`) and `in_last_idx` are equal `out_dim_idx - in_dim_idx * step` . Accumulation step is skipped if `in_last_idx` is outside of [0, size] range. This operator has been requested 16 times on pytorch#77764 Pull Request resolved: pytorch#135411 Approved by: https://github.com/manuelcandales Co-authored-by: Manuel Candales <42380156+manuelcandales@users.noreply.github.com>

malfet added the ciflow/mps Run MPS tests (subset of trunk) label Sep 7, 2024

malfet mentioned this pull request Sep 7, 2024

General MPS op coverage tracking issue #77764

Open

malfet added topic: improvements topic category release notes: mps Release notes category labels Sep 7, 2024

malfet marked this pull request as draft September 8, 2024 15:24

github-actions bot added the Stale label Nov 7, 2024

malfet added 3 commits November 12, 2024 14:35

[MPS] Implement UnfoldBackwards

a36f7b8

Work in progress

8f45078

Almost at it?

b35eca0

malfet changed the title ~~Enable unfold_backward on MPS~~ Implement unfold_backward on MPS Nov 12, 2024

malfet removed the Stale label Nov 12, 2024

malfet marked this pull request as ready for review November 12, 2024 22:35

malfet requested review from kulinseth and manuelcandales November 12, 2024 22:36

Better comments/formatting/error checking

e19d8f5

malfet force-pushed the malfet-patch-14 branch from 89d8310 to e19d8f5 Compare November 13, 2024 05:02

malfet requested a review from Skylion007 November 13, 2024 05:02

More typos

de8616b

manuelcandales approved these changes Nov 13, 2024

View reviewed changes

aten/src/ATen/native/mps/kernels/UnfoldBackward.metal Outdated Show resolved Hide resolved

Update aten/src/ATen/native/mps/kernels/UnfoldBackward.metal

15a2972

Co-authored-by: Manuel Candales <42380156+manuelcandales@users.noreply.github.com>

pytorchmergebot added the merging label Nov 13, 2024

pytorchmergebot closed this in 9d93c27 Nov 13, 2024

pytorchmergebot added Merged and removed merging labels Nov 13, 2024

malfet deleted the malfet-patch-14 branch December 12, 2024 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement unfold_backward on MPS #135411

Implement unfold_backward on MPS #135411

Uh oh!

malfet commented Sep 7, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 7, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Sep 7, 2024

Uh oh!

github-actions bot commented Nov 7, 2024

Uh oh!

Uh oh!

malfet commented Nov 13, 2024

Uh oh!

pytorchmergebot commented Nov 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implement unfold_backward on MPS #135411

Implement unfold_backward on MPS #135411

Uh oh!

Conversation

malfet commented Sep 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/135411

❗ 1 Active SEVs

❌ 1 New Failure

Uh oh!

github-actions bot commented Sep 7, 2024

Attention! native_functions.yaml was changed

Uh oh!

github-actions bot commented Nov 7, 2024

Uh oh!

Uh oh!

malfet commented Nov 13, 2024

Uh oh!

pytorchmergebot commented Nov 13, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

malfet commented Sep 7, 2024 •

edited

Loading

pytorch-bot bot commented Sep 7, 2024 •

edited

Loading