[AOTI][refactor] Rename embed_cubin to embed_kernel_binary #154412

desertfire · 2025-05-27T13:30:16Z

Stack from ghstack (oldest at bottom):

Summary: Rename as it is not CUDA specific.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

Differential Revision: D75452095

Summary: Rename as it is not CUDA specific. [ghstack-poisoned]

pytorch-bot · 2025-05-27T13:30:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154412

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job

As of commit e64fdad with merge base ef6306e ():

CANCELLED JOB - The following job was cancelled. Please retry:

inductor-rocm / rocm-py3.10-inductor / test (inductor, 2, 2, linux.rocm.gpu.2) (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Rename as it is not CUDA specific. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

desertfire · 2025-05-27T14:38:04Z

@desertfire has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

desertfire · 2025-05-27T17:36:37Z

@desertfire has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

pytorchmergebot · 2025-05-27T18:18:57Z

Starting merge as part of PR stack under #154414

pytorchmergebot · 2025-05-28T00:38:23Z

Starting merge as part of PR stack under #154414

pytorchmergebot · 2025-05-28T00:41:50Z

Starting merge as part of PR stack under #154414

pytorchmergebot · 2025-05-28T01:20:01Z

Starting merge as part of PR stack under #154414

Summary: CUDA can support multi-arch with the fatbin format. Add this multi_arch_kernel_binary option, so the compiled model binary can run across different GPU archs. Differential Revision: [D75452094](https://our.internmc.facebook.com/intern/diff/D75452094) Pull Request resolved: #154413 Approved by: https://github.com/angelayi ghstack dependencies: #154412

Summary: Add support of multi_arch_kernel_binary in the package_cpp_only mode. More specifically, generate specific cmake targets to compile .ptx to .fatbin and embed them in the final shared library or binary. Differential Revision: [D75452096](https://our.internmc.facebook.com/intern/diff/D75452096) Pull Request resolved: #154414 Approved by: https://github.com/angelayi ghstack dependencies: #154412, #154413

Summary: Add support of multi_arch_kernel_binary in the package_cpp_only mode. More specifically, generate specific cmake targets to compile .ptx to .fatbin and embed them in the final shared library or binary. Differential Revision: [D75452096](https://our.internmc.facebook.com/intern/diff/D75452096) Pull Request resolved: #154414 Approved by: https://github.com/angelayi ghstack dependencies: #154412, #154413 ghstack-source-id: 55d13e3

[AOTI][refactor] Rename embed_cubin to embed_kernel_binary

89a0b62

Summary: Rename as it is not CUDA specific. [ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor release notes: inductor (aoti) labels May 27, 2025

This was referenced May 27, 2025

[AOTI] Add a multi_arch_kernel_binary option #154413

Closed

[AOTI] Support multi-arch when using package_cpp_only #154414

Closed

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 27, 2025

angelayi approved these changes May 27, 2025

View reviewed changes

pytorchmergebot closed this in 4d8f3d5 May 28, 2025

pytorchmergebot added the Merged label May 28, 2025

github-actions bot deleted the gh/desertfire/577/head branch June 27, 2025 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AOTI][refactor] Rename embed_cubin to embed_kernel_binary #154412

[AOTI][refactor] Rename embed_cubin to embed_kernel_binary #154412

Uh oh!

desertfire commented May 27, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 27, 2025 •

edited

Loading

Uh oh!

desertfire commented May 27, 2025

Uh oh!

desertfire commented May 27, 2025

Uh oh!

pytorchmergebot commented May 27, 2025

Uh oh!

pytorchmergebot commented May 28, 2025

Uh oh!

pytorchmergebot commented May 28, 2025

Uh oh!

pytorchmergebot commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[AOTI][refactor] Rename embed_cubin to embed_kernel_binary #154412

[AOTI][refactor] Rename embed_cubin to embed_kernel_binary #154412

Uh oh!

Conversation

desertfire commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154412

❌ 1 Cancelled Job

Uh oh!

desertfire commented May 27, 2025

Uh oh!

desertfire commented May 27, 2025

Uh oh!

pytorchmergebot commented May 27, 2025

Uh oh!

pytorchmergebot commented May 28, 2025

Uh oh!

pytorchmergebot commented May 28, 2025

Uh oh!

pytorchmergebot commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

desertfire commented May 27, 2025 •

edited

Loading

pytorch-bot bot commented May 27, 2025 •

edited

Loading