-
Notifications
You must be signed in to change notification settings - Fork 25.7k
[AOTI][refactor] Rename embed_cubin to embed_kernel_binary #154412
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Summary: Rename as it is not CUDA specific. [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154412
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 Cancelled JobAs of commit e64fdad with merge base ef6306e ( CANCELLED JOB - The following job was cancelled. Please retry:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
Summary: Rename as it is not CUDA specific. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]
@desertfire has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
@desertfire has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
Starting merge as part of PR stack under #154414 |
3 similar comments
Starting merge as part of PR stack under #154414 |
Starting merge as part of PR stack under #154414 |
Starting merge as part of PR stack under #154414 |
Summary: CUDA can support multi-arch with the fatbin format. Add this multi_arch_kernel_binary option, so the compiled model binary can run across different GPU archs. Differential Revision: [D75452094](https://our.internmc.facebook.com/intern/diff/D75452094) Pull Request resolved: #154413 Approved by: https://github.com/angelayi ghstack dependencies: #154412
Summary: Add support of multi_arch_kernel_binary in the package_cpp_only mode. More specifically, generate specific cmake targets to compile .ptx to .fatbin and embed them in the final shared library or binary. Differential Revision: [D75452096](https://our.internmc.facebook.com/intern/diff/D75452096) Pull Request resolved: #154414 Approved by: https://github.com/angelayi ghstack dependencies: #154412, #154413
Summary: Add support of multi_arch_kernel_binary in the package_cpp_only mode. More specifically, generate specific cmake targets to compile .ptx to .fatbin and embed them in the final shared library or binary. Differential Revision: [D75452096](https://our.internmc.facebook.com/intern/diff/D75452096) Pull Request resolved: #154414 Approved by: https://github.com/angelayi ghstack dependencies: #154412, #154413 ghstack-source-id: 55d13e3
Stack from ghstack (oldest at bottom):
Summary: Rename as it is not CUDA specific.
cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov
Differential Revision: D75452095