[Build] Make PyTorch compilable with gcc-14 on ARM #157867

malfet · 2025-07-08T23:20:01Z

Stack from ghstack (oldest at bottom):

-> [Build] Make PyTorch compilable with gcc-14 on ARM #157867

Fixes numerous ICEs in vreg allocations for SVE+BF16

/pytorch/aten/src/ATen/ParallelOpenMP.h:25:9: error: unrecognizable insn:
   25 | #pragma omp parallel
      |         ^~~
(insn 257 256 258 30 (set (reg:VNx8BF 449 [ bf16_vec1_217 ])
        (unspec:VNx8BF [
                (reg:VNx8BF 455)
                (reg:VNx8BF 456)
            ] UNSPEC_IORF)) "/pytorch/aten/src/ATen/cpu/vec/sve/vec_bfloat16.h":228:31 discrim 1 -1
     (nil))
during RTL pass: vregs
/pytorch/aten/src/ATen/ParallelOpenMP.h:25:9: internal compiler error: in extract_insn, at recog.cc:2812
0xd73c33 internal_error(char const*, ...)
	???:0
0xd73d1f fancy_abort(char const*, int, char const*)
	???:0
0x890053 _fatal_insn(char const*, rtx_def const*, char const*, int, char const*)
	???:0
0x890087 _fatal_insn_not_found(rtx_def const*, char const*, int, char const*)
	???:0
0x1379093 extract_insn(rtx_insn*)
	???:0

And one in RTL-expand pass while compiling Activation.cpp

during RTL pass: expand
In file included from /pytorch/aten/src/ATen/native/cpu/Activation.cpp:12,
                 from /pytorch/build/aten/src/ATen/native/cpu/Activation.cpp.DEFAULT.cpp:1:
/pytorch/aten/src/ATen/native/cpu/Activation.cpp: In lambda function:
/pytorch/aten/src/ATen/native/cpu/Activation.cpp:94:7: internal compiler error: Segmentation fault
   94 |       });
      |       ^
/pytorch/aten/src/ATen/Dispatch.h:201:7: note: in definition of macro 'AT_DISPATCH_SWITCH'
  201 |       __VA_ARGS__                                                           \
      |       ^~~~~~~~~~~
/pytorch/aten/src/ATen/Dispatch.h:72:3: note: in expansion of macro 'AT_PRIVATE_CASE_TYPE_USING_HINT'
   72 |   AT_PRIVATE_CASE_TYPE_USING_HINT(enum_type, scalar_t, __VA_ARGS__)
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/pytorch/aten/src/ATen/Dispatch.h:214:3: note: in expansion of macro 'AT_DISPATCH_CASE'
  214 |   AT_DISPATCH_CASE(at::ScalarType::Double, __VA_ARGS__) \
      |   ^~~~~~~~~~~~~~~~
/pytorch/aten/src/ATen/Dispatch.h:218:34: note: in expansion of macro 'AT_DISPATCH_CASE_FLOATING_TYPES'
  218 |   AT_DISPATCH_SWITCH(TYPE, NAME, AT_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
      |                                  ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/pytorch/aten/src/ATen/native/cpu/Activation.cpp:70:5: note: in expansion of macro 'AT_DISPATCH_FLOATING_TYPES'
   70 |     AT_DISPATCH_FLOATING_TYPES(input.scalar_type(), "log_sigmoid_cpu", [&] {
      |     ^~~~~~~~~~~~~~~~~~~~~~~~~~
0xd73c33 internal_error(char const*, ...)
	???:0
0x134f987 rebuild_jump_labels(rtx_insn*)
	???:0

Interestingly enough, attempt to compile Unfold2d.cpp for -march=armv8-a+sve (i.e. without sve+bf16) support also causes ICE

/pytorch/aten/src/ATen/native/cpu/Unfold2d.cpp:221:1: error: unrecognizable insn:
  221 | }
      | ^
(insn 2918 2917 2919 296 (set (reg:VNx8BI 5917)
        (unspec:VNx16BI [
                (reg:VNx8BI 5920)
                (reg:VNx8BI 5922)
                (const_vector:VNx4BI [
                        (const_int 0 [0]) repeated x8
                    ])
            ] UNSPEC_TRN1_CONV)) "/usr/include/aarch64-linux-gnu/bits/string_fortified.h":29:33 discrim 1 -1
     (expr_list:REG_EQUAL (const_vector:VNx8BI [
                (const_int 1 [0x1]) repeated x9
                (const_int 0 [0])
                (const_int 1 [0x1]) repeated x2
                (const_int 0 [0]) repeated x4
            ])
        (nil)))
during RTL pass: vregs

Which could be worked around by adding

diff --git a/aten/src/ATen/native/cpu/Unfold2d.cpp b/aten/src/ATen/native/cpu/Unfold2d.cpp
index 8ef0741e77af0a..59c76505dd6246 100644
--- a/aten/src/ATen/native/cpu/Unfold2d.cpp
+++ b/aten/src/ATen/native/cpu/Unfold2d.cpp
@@ -169,6 +169,10 @@ static void unfolded2d_acc_channels_last(
 
 /* note: due to write issues, this one cannot be parallelized as well as
  * unfolded2d_copy */
+#if defined(__GNUC__) && __GNUC__ == 14 && defined(__ARM_FEATURE_SVE)
+// Workaround for gcc-14.2.0 ICE during RTL pass: vregs when compiling for SVE
+__attribute__((optimize("no-tree-vectorize")))
+#endif
 void unfolded2d_acc_kernel(
     ScalarType dtype,
     void *finput_data,

Fixes #157842

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168

[ghstack-poisoned]

pytorch-bot · 2025-07-08T23:20:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/157867

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 46 Pending

As of commit 0d24f16 with merge base dea4864 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

atalman

lgtm, how do we validate it currently ? As far as I know there is no CI with gcc-14 on arm in OSS

[ghstack-poisoned]

malfet · 2025-07-08T23:48:36Z

lgtm, how do we validate it currently ? As far as I know there is no CI with gcc-14 on arm in OSS

Alas, manually, I've just docker run ubuntu:24.04 on my local Mac and install g++-14 there

malfet · 2025-07-08T23:59:00Z

@pytorchbot merge

pytorchmergebot · 2025-07-09T00:00:48Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-07-09T00:11:58Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

Lint / lintrunner-clang / linux-job

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

[ghstack-poisoned]

Fixes #157842 ghstack-source-id: 52e3e69 Pull Request resolved: #157867

malfet · 2025-07-09T02:57:24Z

@pytorchbot merge -f "Lint is green"

pytorchmergebot · 2025-07-09T02:58:54Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

robert-hardwick · 2025-07-10T20:44:44Z

Just adding that this ICE issue has been identified as fixed in GCC15 but requires backporting to GCC14 - bug tracker here https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121027

linking issue with some relevant context #157626

Update

a15b612

[ghstack-poisoned]

pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Jul 8, 2025

malfet added topic: bug fixes topic category release notes: cpu (aarch64) release notes category for aarch64, arm, etc. labels Jul 8, 2025

malfet requested review from Skylion007 and swolchok July 8, 2025 23:24

atalman approved these changes Jul 8, 2025

View reviewed changes

Skylion007 approved these changes Jul 8, 2025

View reviewed changes

Update

d28e32d

[ghstack-poisoned]

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 8, 2025

pytorchmergebot added the merging label Jul 9, 2025

pytorchmergebot removed the merging label Jul 9, 2025

Update

0d24f16

[ghstack-poisoned]

malfet added a commit that referenced this pull request Jul 9, 2025

[Build] Make PyTorch compilable with gcc-14 on ARM

45f2f2e

Fixes #157842 ghstack-source-id: 52e3e69 Pull Request resolved: #157867

pytorchmergebot added the merging label Jul 9, 2025

pytorchmergebot added the Merged label Jul 9, 2025

pytorchmergebot closed this in d623772 Jul 9, 2025

pytorchmergebot removed the merging label Jul 9, 2025

github-actions bot deleted the gh/malfet/433/head branch August 10, 2025 02:20

GaetanLepage mentioned this pull request Sep 2, 2025

python3Packages.torch: fix on aarch64-linux NixOS/nixpkgs#439489

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Build] Make PyTorch compilable with gcc-14 on ARM #157867

[Build] Make PyTorch compilable with gcc-14 on ARM #157867

Uh oh!

malfet commented Jul 8, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 8, 2025 •

edited

Loading

Uh oh!

atalman left a comment

Uh oh!

malfet commented Jul 8, 2025

Uh oh!

malfet commented Jul 8, 2025

Uh oh!

pytorchmergebot commented Jul 9, 2025

Uh oh!

pytorchmergebot commented Jul 9, 2025

Uh oh!

malfet commented Jul 9, 2025

Uh oh!

pytorchmergebot commented Jul 9, 2025

Uh oh!

robert-hardwick commented Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Build] Make PyTorch compilable with gcc-14 on ARM #157867

[Build] Make PyTorch compilable with gcc-14 on ARM #157867

Uh oh!

Conversation

malfet commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/157867

⏳ No Failures, 46 Pending

Uh oh!

atalman left a comment

Choose a reason for hiding this comment

Uh oh!

malfet commented Jul 8, 2025

Uh oh!

malfet commented Jul 8, 2025

Uh oh!

pytorchmergebot commented Jul 9, 2025

Merge started

Uh oh!

pytorchmergebot commented Jul 9, 2025

Merge failed

Uh oh!

malfet commented Jul 9, 2025

Uh oh!

pytorchmergebot commented Jul 9, 2025

Merge started

Uh oh!

robert-hardwick commented Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

malfet commented Jul 8, 2025 •

edited

Loading

pytorch-bot bot commented Jul 8, 2025 •

edited

Loading