KEMBAR78
Add almalinux docker for magma and CUDA installation for CUDA 13 by tinglvv · Pull Request #160201 · pytorch/pytorch · GitHub
Skip to content

Conversation

@tinglvv
Copy link
Collaborator

@tinglvv tinglvv commented Aug 8, 2025

#159779

CUDA 13.0.0
NVSHMEM 3.3.20
CusparseLt 0.8.0.4
CUDNN 9.12.0.46

  1. Add the almalinux docker for building magma-cuda 13.0
  2. Add install_cuda.sh script for CUDA 13.0 (for x86 and sbsa)

cc @atalman @malfet @ptrblck @nWEIdia

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Aug 8, 2025
@pytorch-bot
Copy link

pytorch-bot bot commented Aug 8, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160201

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 97 Pending

As of commit 6da4fb1 with merge base 6b994c4 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@johnnynunez
Copy link
Contributor

johnnynunez commented Aug 9, 2025

cudnn 9.12 and tensorrt 10.13.2 and nccl 2.27.7 are the minimum for cuda 13 right?

https://pypi.org/project/nvidia-cudnn-cu13/

@tinglvv tinglvv marked this pull request as ready for review August 11, 2025 22:39
@tinglvv tinglvv requested review from a team and jeffdaily as code owners August 11, 2025 22:39
@tinglvv tinglvv mentioned this pull request Aug 8, 2025
15 tasks
@jerryzh168 jerryzh168 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Aug 13, 2025
@atalman atalman added the ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR label Aug 13, 2025
@atalman
Copy link
Contributor

atalman commented Aug 13, 2025

@tinglvv we do have 12.4 build still here: https://github.com/pytorch/pytorch/actions/runs/16915397606/job/47927902158 if we are removing 12.4 artifacts we must make sure to run periodic tests not to regress this job

@johnnynunez
Copy link
Contributor

@tinglvv cusparselt is now available for cuda 13. You can upgrade it
IMG_0482

@tinglvv
Copy link
Collaborator Author

tinglvv commented Aug 14, 2025

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased cu13-docker onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout cu13-docker && git pull --rebase)

@atalman atalman self-requested a review August 15, 2025 15:49
@atalman
Copy link
Contributor

atalman commented Aug 15, 2025

Looks good

@tinglvv
Copy link
Collaborator Author

tinglvv commented Aug 15, 2025

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 15, 2025
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: New commits were pushed while merging. Please rerun the merge command.

Details for Dev Infra team Raised by workflow job

@JaydenChao101
Copy link

LGTM 👍

@atalman
Copy link
Contributor

atalman commented Aug 18, 2025

@pytorchmergebot merge -f "lint and almalinux builds look good"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@tinglvv tinglvv changed the title Add CUDA installation script for CUDA 13 Add almalinux docker and CUDA installation script for CUDA 13 Aug 19, 2025
@tinglvv tinglvv changed the title Add almalinux docker and CUDA installation script for CUDA 13 Add almalinux docker for magma and CUDA installation for CUDA 13 Aug 19, 2025
can-gaa-hou pushed a commit to can-gaa-hou/pytorch that referenced this pull request Aug 22, 2025
Add the almalinux docker for building magma-cuda 13.0
pytorch#159779

Also fixed the NVSHMEM download link

Pull Request resolved: pytorch#160201
Approved by: https://github.com/atalman

Co-authored-by: Andrey Talman <atalman@fb.com>
markc-614 pushed a commit to markc-614/pytorch that referenced this pull request Sep 17, 2025
Add the almalinux docker for building magma-cuda 13.0
pytorch#159779

Also fixed the NVSHMEM download link

Pull Request resolved: pytorch#160201
Approved by: https://github.com/atalman

Co-authored-by: Andrey Talman <atalman@fb.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/inductor ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR ciflow/trunk Trigger trunk jobs on your pull request Merged open source topic: not user facing topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants