[MPS] Add API to query GPU core count #160414

malfet · 2025-08-12T13:39:27Z

Stack from ghstack (oldest at bottom):

-> [MPS] Add API to query GPU core count #160414

Using good old IOKit to get gpu-core-count property from device implementing AGXAccelerator service
Expose this one as torch.backend.mps.get_core_count() and make it accessible via MpsInterface to the inductor

Test Plan: Run python3 -c "import torch;print(torch.backends.mps.get_name(), torch.backends.mps.get_core_count())" and compare it to system_profiler SPDisplaysDataType|head -n10

% python3 -c "import torch;print(torch.backends.mps.get_name(), torch.backends.mps.get_core_count())"
Apple M1 Pro 16
% system_profiler SPDisplaysDataType|head -n10                                                       
Graphics/Displays:

    Apple M1 Pro:

      Chipset Model: Apple M1 Pro
      Type: GPU
      Bus: Built-In
      Total Number of Cores: 16
      Vendor: Apple (0x106b)
      Metal Support: Metal 3

This would significantly improve occupancy for torch.compile generated kernels

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @Lucaskabela

[ghstack-poisoned]

pytorch-bot · 2025-08-12T13:39:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160414

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 15 Pending

As of commit 518bda6 with merge base fc80f68 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

ghstack-source-id: a76b9db Pull Request resolved: #160414

[ghstack-poisoned]

ghstack-source-id: 54b8aec Pull Request resolved: #160414

[ghstack-poisoned]

malfet · 2025-08-14T00:03:30Z

@pytorchbot merge -f "Lint + MPS are green"

pytorchmergebot · 2025-08-14T00:05:03Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Using good old IOKit to get `gpu-core-count` property from device implementing `AGXAccelerator` service Expose this one as `torch.backend.mps.get_core_count()` and make it accessible via `MpsInterface` to the inductor Test Plan: Run `python3 -c "import torch;print(torch.backends.mps.get_name(), torch.backends.mps.get_core_count())"` and compare it to `system_profiler SPDisplaysDataType|head -n10` ``` % python3 -c "import torch;print(torch.backends.mps.get_name(), torch.backends.mps.get_core_count())" Apple M1 Pro 16 % system_profiler SPDisplaysDataType|head -n10 Graphics/Displays: Apple M1 Pro: Chipset Model: Apple M1 Pro Type: GPU Bus: Built-In Total Number of Cores: 16 Vendor: Apple (0x106b) Metal Support: Metal 3 ``` This would significantly improve occupancy for torch.compile generated kernels Pull Request resolved: #160414 Approved by: https://github.com/dcci

Using good old IOKit to get `gpu-core-count` property from device implementing `AGXAccelerator` service Expose this one as `torch.backend.mps.get_core_count()` and make it accessible via `MpsInterface` to the inductor Test Plan: Run `python3 -c "import torch;print(torch.backends.mps.get_name(), torch.backends.mps.get_core_count())"` and compare it to `system_profiler SPDisplaysDataType|head -n10` ``` % python3 -c "import torch;print(torch.backends.mps.get_name(), torch.backends.mps.get_core_count())" Apple M1 Pro 16 % system_profiler SPDisplaysDataType|head -n10 Graphics/Displays: Apple M1 Pro: Chipset Model: Apple M1 Pro Type: GPU Bus: Built-In Total Number of Cores: 16 Vendor: Apple (0x106b) Metal Support: Metal 3 ``` This would significantly improve occupancy for torch.compile generated kernels Pull Request resolved: pytorch#160414 Approved by: https://github.com/dcci

Update

967268a

[ghstack-poisoned]

malfet requested a review from kulinseth as a code owner August 12, 2025 13:39

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Aug 12, 2025

Update

3083b03

[ghstack-poisoned]

malfet added a commit that referenced this pull request Aug 12, 2025

[MPS] Add avility to query GPU count

9e96f67

ghstack-source-id: a76b9db Pull Request resolved: #160414

Update

96d560b

[ghstack-poisoned]

malfet added a commit that referenced this pull request Aug 12, 2025

[MPS] Add avility to query GPU count

74456ad

ghstack-source-id: 54b8aec Pull Request resolved: #160414

pytorch-bot bot added ciflow/inductor module: dynamo module: inductor labels Aug 12, 2025

malfet changed the title ~~[MPS] Add avility to query GPU count~~ [MPS] Add ability to query GPU count Aug 12, 2025

malfet added the topic: improvements topic category label Aug 12, 2025

malfet requested a review from dcci August 12, 2025 22:21

dcci approved these changes Aug 12, 2025

View reviewed changes

malfet added 3 commits August 13, 2025 14:04

Update

89bb488

[ghstack-poisoned]

Update

4ff1fa9

[ghstack-poisoned]

Update

518bda6

[ghstack-poisoned]

malfet changed the title ~~[MPS] Add ability to query GPU count~~ [MPS] Add API to query GPU core count Aug 14, 2025

pytorchmergebot added the merging label Aug 14, 2025

pytorchmergebot closed this in a06ec54 Aug 14, 2025

pytorchmergebot added Merged and removed merging labels Aug 14, 2025

github-actions bot deleted the gh/malfet/483/head branch September 13, 2025 02:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MPS] Add API to query GPU core count #160414

[MPS] Add API to query GPU core count #160414

Uh oh!

malfet commented Aug 12, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 12, 2025 •

edited

Loading

Uh oh!

malfet commented Aug 14, 2025

Uh oh!

pytorchmergebot commented Aug 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[MPS] Add API to query GPU core count #160414

[MPS] Add API to query GPU core count #160414

Uh oh!

Conversation

malfet commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160414

⏳ No Failures, 15 Pending

Uh oh!

malfet commented Aug 14, 2025

Uh oh!

pytorchmergebot commented Aug 14, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

malfet commented Aug 12, 2025 •

edited

Loading

pytorch-bot bot commented Aug 12, 2025 •

edited

Loading