KEMBAR78
[CD] Add CUDA 13.0 Windows build by tinglvv · Pull Request #161663 · pytorch/pytorch · GitHub
Skip to content

Conversation

@tinglvv
Copy link
Collaborator

@tinglvv tinglvv commented Aug 27, 2025

#159779

CUDA 13.0.0 for Windows Build
CUDA 12.9 still needs the WAR for OOM #156181

cc @ptrblck @nWEIdia @atalman @malfet

@tinglvv tinglvv requested review from a team, eqy and syed-ahmed as code owners August 27, 2025 21:58
@pytorch-bot pytorch-bot bot added the release notes: releng release notes category label Aug 27, 2025
@pytorch-bot
Copy link

pytorch-bot bot commented Aug 27, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161663

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 6 New Failures, 1 Cancelled Job, 2 Unrelated Failures

As of commit 16f0823 with merge base 0e45023 (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOB - The following job was cancelled. Please retry:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@tinglvv tinglvv changed the title Add CUDA 13.0 Windows build [CD] Add CUDA 13.0 Windows build Aug 27, 2025
@tinglvv tinglvv added the ciflow/binaries Trigger all binary build and upload jobs on the PR label Aug 27, 2025
@tinglvv
Copy link
Collaborator Author

tinglvv commented Aug 27, 2025

Build failure https://github.com/pytorch/pytorch/actions/runs/17279798745/job/49045542640?pr=161663, could be because AMI is still failing.

2025-08-27T22:19:25.1085821Z  if not exist "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v13.0\bin\nvcc.exe" (
2025-08-27T22:19:25.1086268Z echo CUDA 13.0 installed failed.  

@tinglvv tinglvv mentioned this pull request Aug 29, 2025
15 tasks
@tinglvv
Copy link
Collaborator Author

tinglvv commented Aug 29, 2025

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased cu130-win-build onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout cu130-win-build && git pull --rebase)

@tinglvv
Copy link
Collaborator Author

tinglvv commented Aug 29, 2025

Build failure with driver installation https://github.com/pytorch/pytorch/actions/runs/17330479439/job/49205072771

Installing GPU driver DLLs

7-Zip 25.01 (x64) : Copyright (c) 1999-2025 Igor Pavlov : 2025-08-03

Scanning the drive for archives:
1 file, 5127591 bytes (5008 KiB)

Extracting archive: C:\actions-runner\_work\pytorch\pytorch\pytorch\.ci\pytorch\windows\internal\\..\temp_build\gpu_driver_dlls.zip
--
Path = C:\actions-runner\_work\pytorch\pytorch\pytorch\.ci\pytorch\windows\internal\\..\temp_build\gpu_driver_dlls.zip
Type = zip
Physical Size = 5127591


Would you like to replace the existing file:
  Path:     C:\Windows\System32\nvcuda.dll
  Size:     3482736 bytes (3402 KiB)
  Modified: 2025-06-12 19:30:46
with the file from archive:
  Path:     nvcuda.dll
  Size:     17459864 bytes (17 MiB)
  Modified: 2019-11-14 22:47:12


Break signaled
? (Y)es / (N)o / (A)lways / (S)kip all / A(u)to rename all / (Q)uit? 
Cleaning temp files

Do we need to upload the Windows driver for CUDA 13.0, since CTK doesn't come with driver now.

** Starting with CUDA 13.0, the Windows display driver is no longer bundled with the CUDA Toolkit package. Users must download and install the appropriate NVIDIA driver separately from the official driver download page.
from https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html

Can use 580.88 https://www.nvidia.com/en-us/drivers/details/251576/?

@tinglvv
Copy link
Collaborator Author

tinglvv commented Aug 29, 2025

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased cu130-win-build onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout cu130-win-build && git pull --rebase)

@atalman
Copy link
Contributor

atalman commented Sep 1, 2025

@pytorchmergebot merge -f "all required workflows are passing"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@tinglvv tinglvv self-assigned this Sep 2, 2025
markc-614 pushed a commit to markc-614/pytorch that referenced this pull request Sep 17, 2025
mansiag05 pushed a commit to mansiag05/pytorch that referenced this pull request Sep 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/binaries Trigger all binary build and upload jobs on the PR Merged open source release notes: releng release notes category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants