Add warning about removed sm50 and sm60 arches #158301

atalman · 2025-07-15T00:03:06Z

Related to #157517

Detect when users are executing torch build with cuda 12.8/12.9 and running on Maxwell or Pascal architectures.
We would like to include reference to the issue: #157517 as well as ask people to install CUDA 12.6 builds if they are running on sm50 or sm60 architectures.

Test:

>>> torch.cuda.get_arch_list()
['sm_70', 'sm_75', 'sm_80', 'sm_86', 'sm_90', 'sm_100', 'sm_120', 'compute_120']
>>> torch.cuda.init()
/home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:263: UserWarning: 
    Found <GPU Name> which is of cuda capability 5.0.
    PyTorch no longer supports this GPU because it is too old.
    The minimum cuda capability supported by this library is 7.0.

  warnings.warn(
/home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:268: UserWarning: 
                        Support for Maxwell and Pascal architectures is removed for CUDA 12.8+ builds.
                        Please see https://github.com/pytorch/pytorch/issues/157517
                        Please install CUDA 12.6 builds if you require Maxwell or Pascal support.

cc @ptrblck @msaroufim @eqy @jerryzh168 @albanD @malfet

pytorch-bot · 2025-07-15T00:03:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158301

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 61 Cancelled Jobs, 1 Unrelated Failure

As of commit 100002a with merge base 0879921 ():

CANCELLED JOBS - The following jobs were cancelled. Please retry:

pull / linux-docs / build-docs-cpp-false (gh)
##[error]The operation was canceled.
pull / linux-docs / build-docs-functorch-false (gh)
##[error]The operation was canceled.
pull / linux-docs / build-docs-python-false (gh)
##[error]The operation was canceled.
pull / linux-jammy-cpu-py3.10-gcc11-bazel-test / build-and-test (default, 1, 1, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-cuda12.8-py3.10-gcc11 / test (default, 1, 5, lf.linux.4xlarge.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11 / test (default, 2, 5, lf.linux.4xlarge.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11 / test (default, 3, 5, lf.linux.4xlarge.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11 / test (default, 4, 5, lf.linux.4xlarge.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11 / test (default, 5, 5, lf.linux.4xlarge.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11-sm89 / test (default, 1, 5, lf.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11-sm89 / test (default, 2, 5, lf.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11-sm89 / test (default, 3, 5, lf.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11-sm89 / test (default, 4, 5, lf.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11-sm89 / test (default, 5, 5, lf.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11-test / test (distributed, 1, 3, lf.linux.g4dn.12xlarge.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11-test / test (distributed, 2, 3, lf.linux.g4dn.12xlarge.nvidia.gpu) (gh)
pull / linux-jammy-cuda12.8-py3.10-gcc11-test / test (distributed, 3, 3, lf.linux.g4dn.12xlarge.nvidia.gpu) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, lf.linux.12xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.10-clang18-asan / test (default, 1, 6, lf.linux.4xlarge) (gh)
pull / linux-jammy-py3.10-clang18-asan / test (default, 2, 6, lf.linux.4xlarge) (gh)
pull / linux-jammy-py3.10-clang18-asan / test (default, 3, 6, lf.linux.4xlarge) (gh)
pull / linux-jammy-py3.10-clang18-asan / test (default, 4, 6, lf.linux.4xlarge) (gh)
pull / linux-jammy-py3.10-clang18-asan / test (default, 5, 6, lf.linux.4xlarge) (gh)
pull / linux-jammy-py3.10-clang18-asan / test (default, 6, 6, lf.linux.4xlarge) (gh)
pull / linux-jammy-py3.13-clang12 / test (crossref, 1, 2, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (crossref, 2, 2, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (default, 1, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (default, 2, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (default, 3, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (default, 4, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (default, 5, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (dynamo_wrapped, 1, 3, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (dynamo_wrapped, 2, 3, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (dynamo_wrapped, 3, 3, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.13-clang12 / test (einops, 1, 1, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (crossref, 1, 2, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (crossref, 2, 2, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (default, 1, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (default, 2, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (default, 3, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (default, 4, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (default, 5, 5, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (dynamo_wrapped, 1, 3, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (dynamo_wrapped, 2, 3, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (dynamo_wrapped, 3, 3, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12 / test (einops, 1, 1, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12-onnx / test (default, 1, 2, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-clang12-onnx / test (default, 2, 2, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (backwards_compat, 1, 1, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (default, 1, 5, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (default, 2, 5, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (default, 3, 5, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (default, 4, 5, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (default, 5, 5, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (distributed, 1, 2, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (distributed, 2, 2, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (docs_test, 1, 1, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (jit_legacy, 1, 1, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11 / test (numpy_2_x, 1, 1, lf.linux.2xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.9-gcc11-mobile-lightweight-dispatch-build / build (gh)
##[error]The operation was canceled.
pull / linux-jammy-rocm-py3.10 / build (gh)
##[error]The operation was canceled.

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / cuda12.8-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu, unstable) (gh) (#153987)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

atalman · 2025-07-15T15:22:48Z

torch/cuda/__init__.py

            )
            if current_arch < min_arch:
                warnings.warn(
                    old_gpu_warn


Looks like incorrect_binary_warn is never used. However its probably more accurate warning

nWEIdia

LGTM. Just had one small suggestion.

torch/cuda/__init__.py

albanD · 2025-07-16T14:49:00Z

Wait, these warning are saying two opposite things lol.
One says that it is not supported and one says you just need to install another binary.

Can we rationalize these messages to be more aligned with the state of the world:

What is the real lowest supported version by ANY binary (this needs to be hardcoded). Warn based on that.
What is the lowest/newest supported version for THIS binary, warn and suggest an appropriate binary for that. Both down (for old arch) and up (for newer arch)

albanD

Looks great, thanks!

atalman · 2025-07-16T20:04:23Z

@pytorchmergebot merge -f "lint is green"

pytorchmergebot · 2025-07-16T20:11:03Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

nWEIdia · 2025-07-16T20:16:58Z

Just adding a note that in future, we might want to re-evaluate for "cur_arch > max_arch" case, as there could be scenarios that the binary may be able to still support. But "cur_arch < min_arch" is definitely not supported.

e.g. suppose we build sm up to sm80, running on sm86 would still work. Similarly along this line for sm120.

atalman · 2025-07-16T20:29:03Z

@pytorchbot cherry-pick --onto release/2.8 -c critical

Related to #157517 Detect when users are executing torch build with cuda 12.8/12.9 and running on Maxwell or Pascal architectures. We would like to include reference to the issue: #157517 as well as ask people to install CUDA 12.6 builds if they are running on sm50 or sm60 architectures. Test: ``` >>> torch.cuda.get_arch_list() ['sm_70', 'sm_75', 'sm_80', 'sm_86', 'sm_90', 'sm_100', 'sm_120', 'compute_120'] >>> torch.cuda.init() /home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:263: UserWarning: Found <GPU Name> which is of cuda capability 5.0. PyTorch no longer supports this GPU because it is too old. The minimum cuda capability supported by this library is 7.0. warnings.warn( /home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:268: UserWarning: Support for Maxwell and Pascal architectures is removed for CUDA 12.8+ builds. Please see #157517 Please install CUDA 12.6 builds if you require Maxwell or Pascal support. ``` Pull Request resolved: #158301 Approved by: https://github.com/nWEIdia, https://github.com/albanD (cherry picked from commit fb731fe)

pytorchbot · 2025-07-16T20:36:22Z

Cherry picking #158301

The cherry pick PR is at #158478 and it is recommended to link a critical cherry pick PR with an issue. The following tracker issues are updated:

[v.2.8.0] Release Tracker #156745 (comment)

Details for Dev Infra team

Raised by workflow job

Add warning about removed sm50 and sm60 arches (#158301) Related to #157517 Detect when users are executing torch build with cuda 12.8/12.9 and running on Maxwell or Pascal architectures. We would like to include reference to the issue: #157517 as well as ask people to install CUDA 12.6 builds if they are running on sm50 or sm60 architectures. Test: ``` >>> torch.cuda.get_arch_list() ['sm_70', 'sm_75', 'sm_80', 'sm_86', 'sm_90', 'sm_100', 'sm_120', 'compute_120'] >>> torch.cuda.init() /home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:263: UserWarning: Found <GPU Name> which is of cuda capability 5.0. PyTorch no longer supports this GPU because it is too old. The minimum cuda capability supported by this library is 7.0. warnings.warn( /home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:268: UserWarning: Support for Maxwell and Pascal architectures is removed for CUDA 12.8+ builds. Please see #157517 Please install CUDA 12.6 builds if you require Maxwell or Pascal support. ``` Pull Request resolved: #158301 Approved by: https://github.com/nWEIdia, https://github.com/albanD (cherry picked from commit fb731fe) Co-authored-by: atalman <atalman@fb.com>

facebook-github-bot · 2025-07-19T00:30:19Z

@pytorchbot revert -m="Diff reverted internally" -c="ghfirst"

This Pull Request has been reverted by a revert inside Meta. To re-land this change, please open another pull request, assign the same reviewers, fix the CI failures that caused the revert and make sure that the failing CI runs on the PR by applying the proper ciflow label (e.g., ciflow/trunk).)

pytorchmergebot · 2025-07-19T00:31:55Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

This reverts commit fb731fe. Reverted #158301 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](#158301 (comment)))

pytorchmergebot · 2025-07-19T00:32:09Z

@atalman your PR has been successfully reverted.

Move code fixes Revert "conda" This reverts commit 2853662. Revert "use tos accept" This reverts commit 8b34264. Revert "conda" This reverts commit 2853662. Revert "Revert "conda"" This reverts commit e732654. Revert "Revert "use tos accept"" This reverts commit c456c54. Revert "Revert "conda"" This reverts commit bb4fa09. fix fix_arch_list fix fixes

atalman · 2025-07-19T00:50:16Z

torch/cuda/__init__.py

-    if torch.version.cuda is not None:  # on ROCm we don't want this check
-        CUDA_VERSION = torch._C._cuda_getCompiledVersion()  # noqa: F841
+    if (
+        torch.version.cuda is not None and torch.cuda.get_arch_list()


New version added check for torch.cuda.get_arch_list()

Related to #157517 Detect when users are executing torch build with cuda 12.8/12.9 and running on Maxwell or Pascal architectures. We would like to include reference to the issue: #157517 as well as ask people to install CUDA 12.6 builds if they are running on sm50 or sm60 architectures. Test: ``` >>> torch.cuda.get_arch_list() ['sm_70', 'sm_75', 'sm_80', 'sm_86', 'sm_90', 'sm_100', 'sm_120', 'compute_120'] >>> torch.cuda.init() /home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:263: UserWarning: Found <GPU Name> which is of cuda capability 5.0. PyTorch no longer supports this GPU because it is too old. The minimum cuda capability supported by this library is 7.0. warnings.warn( /home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:268: UserWarning: Support for Maxwell and Pascal architectures is removed for CUDA 12.8+ builds. Please see #157517 Please install CUDA 12.6 builds if you require Maxwell or Pascal support. ``` Please note I reverted original PR #158301 because it broke internal users. This is a reland, added added check for non empty torch.cuda.get_arch_list() Pull Request resolved: #158700 Approved by: https://github.com/huydhn, https://github.com/Skylion007, https://github.com/eqy

Add warning about removed sm50 and sm60 arches (pytorch#158301) Related to pytorch#157517 Detect when users are executing torch build with cuda 12.8/12.9 and running on Maxwell or Pascal architectures. We would like to include reference to the issue: pytorch#157517 as well as ask people to install CUDA 12.6 builds if they are running on sm50 or sm60 architectures. Test: ``` >>> torch.cuda.get_arch_list() ['sm_70', 'sm_75', 'sm_80', 'sm_86', 'sm_90', 'sm_100', 'sm_120', 'compute_120'] >>> torch.cuda.init() /home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:263: UserWarning: Found <GPU Name> which is of cuda capability 5.0. PyTorch no longer supports this GPU because it is too old. The minimum cuda capability supported by this library is 7.0. warnings.warn( /home/atalman/.conda/envs/py312/lib/python3.12/site-packages/torch/cuda/__init__.py:268: UserWarning: Support for Maxwell and Pascal architectures is removed for CUDA 12.8+ builds. Please see pytorch#157517 Please install CUDA 12.6 builds if you require Maxwell or Pascal support. ``` Pull Request resolved: pytorch#158301 Approved by: https://github.com/nWEIdia, https://github.com/albanD (cherry picked from commit fb731fe) Co-authored-by: atalman <atalman@fb.com>

atalman added topic: not user facing topic category module: cuda Related to torch.cuda, and CUDA support in general labels Jul 15, 2025

atalman requested review from eqy and syed-ahmed as code owners July 15, 2025 13:43

atalman commented Jul 15, 2025

View reviewed changes

nWEIdia approved these changes Jul 15, 2025

View reviewed changes

torch/cuda/__init__.py Outdated Show resolved Hide resolved

atalman requested review from albanD and malfet July 16, 2025 10:36

albanD approved these changes Jul 16, 2025

View reviewed changes

pytorchmergebot added the merging label Jul 16, 2025

pytorchmergebot closed this in fb731fe Jul 16, 2025

pytorchmergebot added Merged and removed merging labels Jul 16, 2025

pytorchbot mentioned this pull request Jul 16, 2025

[v.2.8.0] Release Tracker #156745

Open

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Jul 19, 2025

pytorchmergebot reopened this Jul 19, 2025

atalman force-pushed the add_warning_about_old_sm branch from e17ea11 to 7706737 Compare July 19, 2025 00:39

atalman force-pushed the add_warning_about_old_sm branch from e8cd442 to 100002a Compare July 19, 2025 00:49

atalman commented Jul 19, 2025

View reviewed changes

atalman closed this Jul 19, 2025

atalman mentioned this pull request Jul 19, 2025

[Reland] Add warning about removed sm50 and sm60 arches #158700

Closed

Add warning about removed sm50 and sm60 arches #158301

Add warning about removed sm50 and sm60 arches #158301

Uh oh!

Conversation

atalman commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158301

❌ 61 Cancelled Jobs, 1 Unrelated Failure

Uh oh!

atalman Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

nWEIdia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

albanD commented Jul 16, 2025

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

atalman commented Jul 16, 2025

Uh oh!

pytorchmergebot commented Jul 16, 2025

Merge started

Uh oh!

nWEIdia commented Jul 16, 2025

Uh oh!

atalman commented Jul 16, 2025

Uh oh!

pytorchbot commented Jul 16, 2025

Cherry picking #158301

Uh oh!

facebook-github-bot commented Jul 19, 2025

Uh oh!

pytorchmergebot commented Jul 19, 2025

Uh oh!

pytorchmergebot commented Jul 19, 2025

Uh oh!

atalman Jul 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

atalman commented Jul 15, 2025 •

edited

Loading

pytorch-bot bot commented Jul 15, 2025 •

edited

Loading