-
Notifications
You must be signed in to change notification settings - Fork 25.7k
[ROCM][CI] Introduce tests-to-include as rocm-test workflow input #110511
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110511
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit ac4ae2f with merge base 0fd856c ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
Seeing a failure |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
37181f5 to
84a7a0c
Compare
|
ROCm CI ran 1 shard as expected: https://github.com/pytorch/pytorch/actions/runs/6552121852/job/17795441333 Since the aim of this PR is to introduce a way to run smaller set of core unit tests, we should exclude the cpp tests. |
b21f82a to
8e934ef
Compare
|
@huydhn @clee2000 The ROCm CI as part of
|
|
Ping @jithunnair-amd to see if there is any update on this one. The context is that we start to see some ROCm failures landing in trunk, i.e. https://hud.pytorch.org/pytorch/pytorch/commit/7ccca60927cdccde63d6a1d40480950f24e9877a, because the PR didn't have |
Just updated this PR to use ROCm5.7, but otherwise it looks good from my end if all ROCm tests pass. Requesting @jeffdaily to also take a look in case I'm missing something. |
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
f17f649 to
f39847d
Compare
|
@jithunnair-amd are the CI failures real? |
The torchvision build failure is real, but is due to some unsupported compiler flags: The previous CI run succeeded and used a different torchvision commit. There was another torchvision commit bump after the one in the most recent failing CI run. I'm assessing if that'll resolve this issue. |
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
f39847d to
ac4ae2f
Compare
Actually, 0a7eef9 fixed the issue wrt unsupported compiler flags, so expecting a rebase to help. |
|
ROCm CI passing with rebase: https://github.com/pytorch/pytorch/actions/runs/6854687390/job/18638805996 Merging as pre-rebase commit had all CI checks passing except ROCm. @pytorchbot merge -f "ROCm CI check passed post-rebase; all other CI checks passed pre-rebase already" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…torch#110511) Fixes pytorch#110181 Pull Request resolved: pytorch#110511 Approved by: https://github.com/huydhn
Fixes #110181
cc @jeffdaily @sunway513 @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang