-
Notifications
You must be signed in to change notification settings - Fork 25.7k
Turn on compile with NVSHMEM #154538
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Turn on compile with NVSHMEM #154538
Conversation
[ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154538
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit a9095a4 with merge base 241f8dc ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
We'll need https://pypi.nvidia.com/nvidia-nvshmem-cu12/ added to the release builds at some point if they aren't already |
|
Thanks @Skylion007 . Thinking the same. Do you know how I can do that? |
|
At the very least, you need to add it to here:
|
|
I added the extra dependency in #154568. |
Before: `USE_NVSHMEM=1` need to be explicit set in build environment. After: `USE_NVSHMEM=1` is the default for CUDA/Rocm on Linux. [ghstack-poisoned]
|
If we want binaries to have nvshmem, we need to make sure it's installed and discoverable for binary builds? |
Before: `USE_NVSHMEM=1` need to be explicit set in build environment. After: `USE_NVSHMEM=1` is the default for CUDA/Rocm on Linux. [ghstack-poisoned]
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Before: `USE_NVSHMEM=1` need to be explicit set in build environment. After: `USE_NVSHMEM=1` is the default for CUDA/Rocm on Linux. Pull Request resolved: pytorch#154538 Approved by: https://github.com/ngimel
NVSHMEM 3.2.5 (released Mar 2025) have both cu11 and cu12 builds. See: https://pypi.nvidia.com/nvidia-nvshmem-cu12/ https://pypi.nvidia.com/nvidia-nvshmem-cu11/ Pull Request resolved: #154568 Approved by: https://github.com/atalman ghstack dependencies: #154538
Before: `USE_NVSHMEM=1` need to be explicit set in build environment. After: `USE_NVSHMEM=1` is the default for CUDA/Rocm on Linux. Pull Request resolved: pytorch#154538 Approved by: https://github.com/ngimel
NVSHMEM 3.2.5 (released Mar 2025) have both cu11 and cu12 builds. See: https://pypi.nvidia.com/nvidia-nvshmem-cu12/ https://pypi.nvidia.com/nvidia-nvshmem-cu11/ Pull Request resolved: pytorch#154568 Approved by: https://github.com/atalman ghstack dependencies: pytorch#154538
This reverts commit 3685b10.
Stack from ghstack (oldest at bottom):
Before:
USE_NVSHMEM=1need to be explicit set in build environment.After:
USE_NVSHMEM=1is the default for CUDA/Rocm on Linux.