[MPS] Speedup torch.full for 1-byte types #158874

malfet · 2025-07-22T21:59:39Z

Stack from ghstack (oldest at bottom):

By using fillBuffer:range:value: rather than MPSGraph op, which should be faster and also does not have INT_MAX limit

Which in turn fixes test_index_put_accumulate_large_tensor_mps test

[ghstack-poisoned]

pytorch-bot · 2025-07-22T21:59:42Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158874

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit ce64e3f with merge base ddd74d1 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, linux.12xlarge, unstable) (gh) (#158876)
sccache: error: couldn't connect to server

This comment was automatically generated by Dr. CI and updates every 15 minutes.

By using [`fillBuffer:range:value:`](https://developer.apple.com/documentation/metal/mtlblitcommandencoder/fillbuffer:range:value:?language=objc) rather than MPSGraph op, which should be faster and also does not have INT_MAX limit ghstack-source-id: dd69275 Pull Request resolved: #158874

[ghstack-poisoned]

By using [`fillBuffer:range:value:`](https://developer.apple.com/documentation/metal/mtlblitcommandencoder/fillbuffer:range:value:?language=objc) rather than MPSGraph op, which should be faster and also does not have INT_MAX limit ghstack-source-id: 5332cf5 Pull Request resolved: #158874

[ghstack-poisoned]

dcci · 2025-07-23T03:19:21Z

test/test_indexing.py

    def test_index_put_accumulate_large_tensor(self, device):
-        if device.startswith("mps"):
-            raise unittest.SkipTest("Crash with max number of dimentions")
+        # if device.startswith("mps"):


can we just remove it instead of commenting?

That's the plan, but I hugely suspect I'll have to leave the skip for MacOS-13, where 4GB tensors are big taboo

[ghstack-poisoned]

By using [`fillBuffer:range:value:`](https://developer.apple.com/documentation/metal/mtlblitcommandencoder/fillbuffer:range:value:?language=objc) rather than MPSGraph op, which should be faster and also does not have INT_MAX limit ghstack-source-id: 98be968 Pull Request resolved: #158874

[ghstack-poisoned]

malfet · 2025-07-23T13:53:15Z

@pytorchbot merge

pytorchmergebot · 2025-07-23T13:55:04Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Though testing is a lie and dependent on #153835 Fixes #153789 Pull Request resolved: #158888 Approved by: https://github.com/albanD ghstack dependencies: #158874

Update

43dc73e

[ghstack-poisoned]

malfet requested a review from kulinseth as a code owner July 22, 2025 21:59

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Jul 22, 2025

malfet added the topic: improvements topic category label Jul 22, 2025

Update

b869aa7

[ghstack-poisoned]

Update

67ce69d

[ghstack-poisoned]

malfet mentioned this pull request Jul 22, 2025

[MPS] Enable dlpack integration #158888

Closed

malfet requested a review from dcci July 22, 2025 23:44

dcci approved these changes Jul 23, 2025

View reviewed changes

Update

9cf3118

[ghstack-poisoned]

malfet added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 23, 2025

Update

ce64e3f

[ghstack-poisoned]

pytorchmergebot added the merging label Jul 23, 2025

pytorchmergebot closed this in 5998cd4 Jul 23, 2025

pytorchmergebot added Merged and removed merging labels Jul 23, 2025

pytorchmergebot pushed a commit that referenced this pull request Jul 24, 2025

[MPS] Enable dlpack integration (#158888)

347a97d

Though testing is a lie and dependent on #153835 Fixes #153789 Pull Request resolved: #158888 Approved by: https://github.com/albanD ghstack dependencies: #158874

yangw-dev pushed a commit that referenced this pull request Aug 1, 2025

[MPS] Enable dlpack integration (#158888)

f762f99

Though testing is a lie and dependent on #153835 Fixes #153789 Pull Request resolved: #158888 Approved by: https://github.com/albanD ghstack dependencies: #158874

github-actions bot deleted the gh/malfet/445/head branch August 23, 2025 02:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MPS] Speedup torch.full for 1-byte types #158874

[MPS] Speedup torch.full for 1-byte types #158874

Uh oh!

malfet commented Jul 22, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 22, 2025 •

edited

Loading

Uh oh!

dcci Jul 23, 2025

Uh oh!

malfet Jul 23, 2025

Uh oh!

malfet commented Jul 23, 2025

Uh oh!

pytorchmergebot commented Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[MPS] Speedup torch.full for 1-byte types #158874

[MPS] Speedup torch.full for 1-byte types #158874

Uh oh!

Conversation

malfet commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158874

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

dcci Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

malfet Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

malfet commented Jul 23, 2025

Uh oh!

pytorchmergebot commented Jul 23, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

malfet commented Jul 22, 2025 •

edited

Loading

pytorch-bot bot commented Jul 22, 2025 •

edited

Loading