[MPS] Add `shifted_chebyshev_polynomial_[tuvw]` #157488

malfet · 2025-07-02T19:50:45Z

Stack from ghstack (oldest at bottom):

For eager and inductor

As for all other chebyshev ops, logic is simply compiled from

pytorch/aten/src/ATen/native/cuda/Math.cuh

Line 2821 in 94716db

T shifted_chebyshev_polynomial_t_forward(T x, int64_t n) {

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-07-02T19:50:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/157488

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 5 Pending

As of commit 41a44e7 with merge base 0f9c1b3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

For eager and inductor ghstack-source-id: 3cc7415 Pull Request resolved: #157488

github-actions · 2025-07-02T19:54:39Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

malfet · 2025-07-02T23:27:51Z

@pytorchbot merge -f "Lint + MPS are green"

pytorchmergebot · 2025-07-02T23:29:18Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This reverts commit 9620994. Reverted #157488 on behalf of https://github.com/clee2000 due to caused slow test config to time out [GH job link](https://github.com/pytorch/pytorch/actions/runs/16037776972/job/45254574100) [HUD commit link](https://hud.pytorch.org/pytorch/pytorch/commit/e124a0d88ca2aa04bfaca2dcabf5de6244048e45) ([comment](#157464 (comment)))

pytorchmergebot · 2025-07-03T15:24:19Z

@malfet your PR has been reverted as part of the stack under #157464.

[ghstack-poisoned]

malfet · 2025-07-03T15:46:46Z

@pytorchbot merge -f "Let's land this one first than"

pytorchmergebot · 2025-07-03T15:48:22Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

They might have been slow on CUDA-11.3, but this version of CUDA is long gone. More fundamental underlying issue were linear complexity of the recursive polynomial definitions for higher order polynomials, for example see this loop from implementation of Chebyshev polynomial of the first kind https://github.com/pytorch/pytorch/blob/7081b8233a64c350c64e9f00c9b9d00e52020241/aten/src/ATen/native/Math.h#L2969-L2973 which were tested by `test_compare_cpu` using following values (as sample index 16) https://github.com/pytorch/pytorch/blob/7081b8233a64c350c64e9f00c9b9d00e52020241/torch/testing/_internal/opinfo/core.py#L2079 Luckily chebyshev polynomials for absolute values higher than 1 pretty quickly reach infinity, see below ``` python3 -c "import torch;print(torch.special.chebyshev_polynomial_v(torch.nextafter(torch.tensor(1.0), torch.tensor(2.0)), torch.tensor(1e6)))" tensor(nan) ``` Which is not the case for Laguerre polynomials, but it's probably fine to just limit it to 1e7 Before ``` $ PYTORCH_TEST_WITH_SLOW=1 python test_ops.py -k chebyshev_polynomial_ ssssssss..ssssss..ssssss..ssssssssssssssssssssss..ssssss/home/ubuntu/py3.10-nightly/lib/python3.10/site-packages/torch/backends/cuda/__init__.py:131: UserWarning: This API is going to be deprecated, please see https://pytorch.org/docs/main/notes/cuda.html#tensorfloat-32-tf32-on-ampere-and-later-devices (Triggered internally at /pytorch/aten/src/ATen/Context.cpp:78.) return torch._C._get_cublas_allow_tf32() ....ssssssssssss..ssssss..ssssss............ssssssssssssssssssssssssssssssssssss..ssssssssssssss..ssssss..ssssssssssssssssssssssssssssss..ssssss....ssssssssssss..ssssss..ssssss............ssssssssssssssssssssssssssssssssssss..ssssss..ssssssssssssss..ssssss..ssssss..ssssssssssssss..ssssss..ssssss..ssssss..ssssss..ssssss..ssssss..ssssss..ssssss..ssssss..ssssss..ssssssssssssss ---------------------------------------------------------------------- Ran 432 tests in 8.575s OK (skipped=344) ``` After ``` $ PYTORCH_TEST_WITH_SLOW=1 python test_ops.py -k chebyshev_polynomial_ ssssssss........................ssssssssssssssss......../home/ubuntu/pytorch/torch/backends/cuda/__init__.py:131: UserWarning: This API is going to be deprecated, please see https://pytorch.org/docs/main/notes/cuda.html#tensorfloat-32-tf32-on-ampere-and-later-devices (Triggered internally at /home/ubuntu/pytorch/aten/src/ATen/Context.cpp:78.) return torch._C._get_cublas_allow_tf32() ........................................................................................xxxxxxxx................ssssssssssssssssssssssss........................................................................................................ssssssss........................ssssssss........................................................................................ssssssss ---------------------------------------------------------------------- Ran 432 tests in 45.580s OK (skipped=72, expected failures=8) ``` Fixes #79528 Pull Request resolved: #157464 Approved by: https://github.com/Skylion007, https://github.com/dcci ghstack dependencies: #157488

Update

1dcb73b

[ghstack-poisoned]

malfet requested a review from kulinseth as a code owner July 2, 2025 19:50

malfet mentioned this pull request Jul 2, 2025

Add isnan exit condition to special ops #157464

Closed

pytorch-bot bot added ciflow/inductor ciflow/mps Run MPS tests (subset of trunk) module: inductor release notes: mps Release notes category labels Jul 2, 2025

malfet added a commit that referenced this pull request Jul 2, 2025

[MPS] Add shifted_chebyshev_polynomial_[tuvw]

b9f70b7

For eager and inductor ghstack-source-id: 3cc7415 Pull Request resolved: #157488

malfet requested review from dcci and manuelcandales July 2, 2025 19:52

malfet added the topic: improvements topic category label Jul 2, 2025

malfet requested a review from jansel July 2, 2025 19:52

dcci approved these changes Jul 2, 2025

View reviewed changes

pytorchmergebot added the merging label Jul 2, 2025

pytorchmergebot closed this in 9620994 Jul 2, 2025

pytorchmergebot added Merged and removed merging labels Jul 2, 2025

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Jul 3, 2025

pytorchmergebot reopened this Jul 3, 2025

Update

41a44e7

[ghstack-poisoned]

pytorchmergebot added the merging label Jul 3, 2025

pytorchmergebot closed this in ec816d7 Jul 3, 2025

pytorchmergebot removed the merging label Jul 3, 2025

This was referenced Jul 5, 2025

[pruning] feat : Taylor expansion unstructured pruning #157620

Open

[pruning] add more test cases for pruning #157613

Closed

github-actions bot deleted the gh/malfet/428/head branch August 3, 2025 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MPS] Add `shifted_chebyshev_polynomial_[tuvw]` #157488

[MPS] Add `shifted_chebyshev_polynomial_[tuvw]` #157488

malfet commented Jul 2, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 2, 2025

Uh oh!

malfet commented Jul 2, 2025

Uh oh!

pytorchmergebot commented Jul 2, 2025

Uh oh!

pytorchmergebot commented Jul 3, 2025

Uh oh!

malfet commented Jul 3, 2025

Uh oh!

pytorchmergebot commented Jul 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[MPS] Add shifted_chebyshev_polynomial_[tuvw] #157488

[MPS] Add shifted_chebyshev_polynomial_[tuvw] #157488

Conversation

malfet commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/157488

⏳ No Failures, 5 Pending

Uh oh!

github-actions bot commented Jul 2, 2025

Attention! native_functions.yaml was changed

Uh oh!

malfet commented Jul 2, 2025

Uh oh!

pytorchmergebot commented Jul 2, 2025

Merge started

Uh oh!

pytorchmergebot commented Jul 3, 2025

Uh oh!

malfet commented Jul 3, 2025

Uh oh!

pytorchmergebot commented Jul 3, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[MPS] Add `shifted_chebyshev_polynomial_[tuvw]` #157488

[MPS] Add `shifted_chebyshev_polynomial_[tuvw]` #157488

malfet commented Jul 2, 2025 •

edited

Loading

pytorch-bot bot commented Jul 2, 2025 •

edited

Loading