[MPS] Add upsample_bicubic2d as Metal op #136123

malfet · 2024-09-16T00:27:07Z

More or less literal copy-n-paste of

pytorch/aten/src/ATen/native/cuda/UpSampleBicubic2d.cu

Line 24 in c33b058

__global__ void upsample_bicubic2d_out_frame(

and

pytorch/aten/src/ATen/native/cuda/UpSampleBicubic2d.cu

Line 99 in c33b058

__global__ void upsample_bicubic2d_backward_out_frame(

Missing uint8 implementation mimics CUDA behavior
Initial version coded live in https://www.youtube.com/watch?v=shi6Kb5xxvk
Later refinements:

Switch from 2D dispatch to 1D one (to match CUDA behavior)
Added batch + channel loops
Fixed scale computation to match align corners behavior
Added backward implementation

Backward implementation again, mimics CUDA, so it has issues precision issue for torch.half as well as a somewhat slow simulation of atomic adds using atomic compare and exchange of the pair of adjacent values, i.e.

emplate <typename T>
static inline void atomic_add_helper(
    device atomic<int>* data,
    long offset,
    float value) {
  auto ptr = data + (offset >> 1);
  auto old = atomic_load_explicit(ptr, memory_order_relaxed);
  union {
    int i;
    T t[2];
  } val;
  do {
    val.i = old;
    val.t[offset & 1] += static_cast<T>(value);
  } while (!atomic_compare_exchange_weak_explicit(
      ptr, &old, val.i, memory_order_relaxed, memory_order_relaxed));
}

Bump basic Metal language version to 3.0, as it's supported on MacOS13 and that's the first version that has atomic_float

pytorch-bot · 2024-09-16T00:27:10Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/136123

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 13b2f66 with merge base 08dba25 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / linux-focal-py3.11-clang10 / test (dynamo, 3, 3, lf.linux.2xlarge) (gh) (disabled by #134602)
test_transformers.py::TestSDPAPrivateUse1Only::test_scaled_dot_product_fused_attention_overrideable_backward
pull / linux-jammy-py3.10-clang15-asan / test (default, 2, 6, lf.linux.4xlarge) (gh) (disabled by #134602)
test_transformers.py::TestSDPAPrivateUse1Only::test_scaled_dot_product_fused_attention_overrideable_backward

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2024-09-16T00:31:03Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

malfet · 2024-09-16T15:57:09Z

It now has an unexpected successes, which is a good sign, next step is to fix it for uint8 dtype.

albanD

Integration sounds good, only small nit on edge case.
I didn't double check the convolution algorithm but I guess that CI is enough to validate it?

aten/src/ATen/native/mps/operations/UpSample.mm

malfet · 2024-09-24T18:56:23Z

@pytorchbot merge -f "Lint + MPS tests are green"

pytorchmergebot · 2024-09-24T18:57:57Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

More or less literal copy-n-paste of https://github.com/pytorch/pytorch/blob/c33b0580e6a702be0cd5be691b3b465da012aa34/aten/src/ATen/native/cuda/UpSampleBicubic2d.cu#L24 and https://github.com/pytorch/pytorch/blob/c33b0580e6a702be0cd5be691b3b465da012aa34/aten/src/ATen/native/cuda/UpSampleBicubic2d.cu#L99 Missing `uint8` implementation mimics CUDA behavior Initial version coded live in https://www.youtube.com/watch?v=shi6Kb5xxvk Later refinements: - Switch from 2D dispatch to 1D one (to match CUDA behavior) - Added batch + channel loops - Fixed scale computation to match align corners behavior - Added backward implementation Backward implementation again, mimics CUDA, so it has issues precision issue for `torch.half` as well as a somewhat slow simulation of atomic adds using atomic compare and exchange of the pair of adjacent values, i.e. ```metal emplate <typename T> static inline void atomic_add_helper( device atomic<int>* data, long offset, float value) { auto ptr = data + (offset >> 1); auto old = atomic_load_explicit(ptr, memory_order_relaxed); union { int i; T t[2]; } val; do { val.i = old; val.t[offset & 1] += static_cast<T>(value); } while (!atomic_compare_exchange_weak_explicit( ptr, &old, val.i, memory_order_relaxed, memory_order_relaxed)); } ``` Bump basic Metal language version to 3.0, as it's supported on MacOS13 and that's the first version that has `atomic_float` Pull Request resolved: pytorch#136123 Approved by: https://github.com/albanD

malfet requested a review from kulinseth as a code owner September 16, 2024 00:27

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Sep 16, 2024

malfet requested review from albanD and msaroufim September 16, 2024 16:55

malfet added the topic: improvements topic category label Sep 16, 2024

malfet mentioned this pull request Sep 16, 2024

General MPS op coverage tracking issue #77764

Open

albanD approved these changes Sep 21, 2024

View reviewed changes

aten/src/ATen/native/mps/operations/UpSample.mm Outdated Show resolved Hide resolved

malfet added 3 commits September 23, 2024 19:22

[MPS] Add upsample_bicubic2d as Metal op

fa59dd6

Fix align corners and adjust tests

2ce9d48

And port backward operation as well

9f0cd01

malfet force-pushed the malfet/mps-add-bicubic-sample-2d branch from 530eefe to 9f0cd01 Compare September 24, 2024 15:59

malfet added 2 commits September 24, 2024 09:01

Address review feedback

8e7ade9

Update metal version to 3.0

13b2f66

pytorchmergebot added the merging label Sep 24, 2024

pytorchmergebot closed this in c6192f3 Sep 24, 2024

pytorchmergebot added Merged and removed merging labels Sep 24, 2024

github-actions bot deleted the malfet/mps-add-bicubic-sample-2d branch October 25, 2024 02:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MPS] Add upsample_bicubic2d as Metal op #136123

[MPS] Add upsample_bicubic2d as Metal op #136123

Uh oh!

malfet commented Sep 16, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 16, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Sep 16, 2024

Uh oh!

malfet commented Sep 16, 2024

Uh oh!

albanD left a comment

Uh oh!

Uh oh!

malfet commented Sep 24, 2024

Uh oh!

pytorchmergebot commented Sep 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[MPS] Add upsample_bicubic2d as Metal op #136123

[MPS] Add upsample_bicubic2d as Metal op #136123

Uh oh!

Conversation

malfet commented Sep 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/136123

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

github-actions bot commented Sep 16, 2024

Attention! native_functions.yaml was changed

Uh oh!

malfet commented Sep 16, 2024

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

malfet commented Sep 24, 2024

Uh oh!

pytorchmergebot commented Sep 24, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

malfet commented Sep 16, 2024 •

edited

Loading

pytorch-bot bot commented Sep 16, 2024 •

edited

Loading