KEMBAR78
[CI] Update sleef submodule to v3.8 by malfet · Pull Request #151955 · pytorch/pytorch · GitHub
Skip to content

Conversation

@malfet
Copy link
Contributor

@malfet malfet commented Apr 22, 2025

Stack from ghstack (oldest at bottom):

Should help with RISC-V cross-compilation.
3.9.0 migration is blocked by sleef project switching to C++20

[ghstack-poisoned]
@pytorch-bot
Copy link

pytorch-bot bot commented Apr 22, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151955

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 4 Pending

As of commit 697b3f0 with merge base 2f74cff (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

malfet added a commit that referenced this pull request Apr 22, 2025
Should help with RiscV cross-compilation

ghstack-source-id: cb51a22
Pull Request resolved: #151955
@pytorch-bot pytorch-bot bot added topic: not user facing topic category labels Apr 22, 2025
@malfet malfet added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 22, 2025
malfet added a commit that referenced this pull request Apr 22, 2025
Should help with RiscV cross-compilation

ghstack-source-id: cb51a22
Pull Request resolved: #151955
malfet added a commit that referenced this pull request Apr 23, 2025
Should help with RiscV cross-compilation

ghstack-source-id: cb51a22
Pull Request resolved: #151955
@malfet
Copy link
Contributor Author

malfet commented Apr 23, 2025

Ok, we can't update to 3.9, as this moves base compiler compatibility to C++20

[ghstack-poisoned]
malfet added a commit that referenced this pull request Apr 23, 2025
Should help with RiscV cross-compilation

ghstack-source-id: 4bd1b3f
Pull Request resolved: #151955
@malfet malfet changed the title [CI] Update sleef submodule to v3.9.0 [CI] Update sleef submodule to v3.8.0 Apr 23, 2025
@malfet malfet changed the title [CI] Update sleef submodule to v3.8.0 [CI] Update sleef submodule to v3.8 Apr 23, 2025
@malfet
Copy link
Contributor Author

malfet commented Apr 23, 2025

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@malfet
Copy link
Contributor Author

malfet commented Apr 23, 2025

@pytorchbot merge -f "All builds passed, remaining tests seems irrelevant"

@pytorchmergebot
Copy link
Collaborator

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

pytorchmergebot pushed a commit that referenced this pull request Apr 24, 2025
As reported in #149292, according to manual, `vfmsq_f32` implements `c - a * b` rather than `a * b - c`, so it's call must be prefixed with `vnegq_f32`

Also, adjust the tests to use OpMath for FMA computation to avoid accuracy error accumulation due to non-fused multiply-and-add over lower precision dtypes

Note that `Vectorized::fmsub` is not currently instantiated anywhere, so it could safely remain broken

TODO:
 - Enable C++ testing on MacOS and/or aarch64 platforms (right now Mac tests are build without C++ tests)

Fixes #149292

Pull Request resolved: #152075
Approved by: https://github.com/swolchok
ghstack dependencies: #151955
wangkuiyi pushed a commit to wangkuiyi/pytorch that referenced this pull request Apr 25, 2025
As reported in pytorch#149292, according to manual, `vfmsq_f32` implements `c - a * b` rather than `a * b - c`, so it's call must be prefixed with `vnegq_f32`

Also, adjust the tests to use OpMath for FMA computation to avoid accuracy error accumulation due to non-fused multiply-and-add over lower precision dtypes

Note that `Vectorized::fmsub` is not currently instantiated anywhere, so it could safely remain broken

TODO:
 - Enable C++ testing on MacOS and/or aarch64 platforms (right now Mac tests are build without C++ tests)

Fixes pytorch#149292

Pull Request resolved: pytorch#152075
Approved by: https://github.com/swolchok
ghstack dependencies: pytorch#151955
pytorchbot pushed a commit that referenced this pull request May 7, 2025
As reported in #149292, according to manual, `vfmsq_f32` implements `c - a * b` rather than `a * b - c`, so it's call must be prefixed with `vnegq_f32`

Also, adjust the tests to use OpMath for FMA computation to avoid accuracy error accumulation due to non-fused multiply-and-add over lower precision dtypes

Note that `Vectorized::fmsub` is not currently instantiated anywhere, so it could safely remain broken

TODO:
 - Enable C++ testing on MacOS and/or aarch64 platforms (right now Mac tests are build without C++ tests)

Fixes #149292

Pull Request resolved: #152075
Approved by: https://github.com/swolchok
ghstack dependencies: #151955

(cherry picked from commit 2ea8653)
atalman pushed a commit that referenced this pull request May 7, 2025
[vec128] Fix fmsub NEON defintion (#152075)

As reported in #149292, according to manual, `vfmsq_f32` implements `c - a * b` rather than `a * b - c`, so it's call must be prefixed with `vnegq_f32`

Also, adjust the tests to use OpMath for FMA computation to avoid accuracy error accumulation due to non-fused multiply-and-add over lower precision dtypes

Note that `Vectorized::fmsub` is not currently instantiated anywhere, so it could safely remain broken

TODO:
 - Enable C++ testing on MacOS and/or aarch64 platforms (right now Mac tests are build without C++ tests)

Fixes #149292

Pull Request resolved: #152075
Approved by: https://github.com/swolchok
ghstack dependencies: #151955

(cherry picked from commit 2ea8653)

Co-authored-by: Nikita Shulga <nshulga@meta.com>
@github-actions github-actions bot deleted the gh/malfet/297/head branch June 7, 2025 02:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk Trigger trunk jobs on your pull request Merged topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants