use explicitly non-returning GPU atomics #60607

jeffdaily · 2021-06-23T23:19:50Z

Enables an important performance optimization for ROCm, in light of the discussion in #41028.

CC @jithunnair-amd @sunway513

facebook-github-bot · 2021-06-23T23:19:58Z

💊 CI failures summary and remediations

As of commit de042d3 (more details on the Dr. CI page and at hud.pytorch.org/pr/60607):

3/3 failures introduced in this PR

🕵️ 3 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

pytorch_xla_linux_bionic_py3_6_clang9_build (1/3)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.

CONFLICT (add/add): Merge conflict in .circleci/verbatim-sources/job-specs/job-specs-custom.yml
Auto-merging .circleci/verbatim-sources/job-specs/job-specs-custom.yml
CONFLICT (add/add): Merge conflict in .circleci/verbatim-sources/build-parameters/pytorch-build-params.yml
Auto-merging .circleci/verbatim-sources/build-parameters/pytorch-build-params.yml
CONFLICT (add/add): Merge conflict in .circleci/scripts/setup_ci_environment.sh
Auto-merging .circleci/scripts/setup_ci_environment.sh
CONFLICT (add/add): Merge conflict in .circleci/config.yml
Auto-merging .circleci/config.yml
CONFLICT (add/add): Merge conflict in .circleci/cimodel/data/pytorch_build_definitions.py
Auto-merging .circleci/cimodel/data/pytorch_build_definitions.py
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

pytorch_linux_xenial_py3_6_gcc5_4_build (2/3)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.

CONFLICT (add/add): Merge conflict in .circleci/verbatim-sources/job-specs/job-specs-custom.yml
Auto-merging .circleci/verbatim-sources/job-specs/job-specs-custom.yml
CONFLICT (add/add): Merge conflict in .circleci/verbatim-sources/build-parameters/pytorch-build-params.yml
Auto-merging .circleci/verbatim-sources/build-parameters/pytorch-build-params.yml
CONFLICT (add/add): Merge conflict in .circleci/scripts/setup_ci_environment.sh
Auto-merging .circleci/scripts/setup_ci_environment.sh
CONFLICT (add/add): Merge conflict in .circleci/config.yml
Auto-merging .circleci/config.yml
CONFLICT (add/add): Merge conflict in .circleci/cimodel/data/pytorch_build_definitions.py
Auto-merging .circleci/cimodel/data/pytorch_build_definitions.py
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

pytorch_macos_10_13_py3_test (3/3)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

Jun 28 18:30:12 RuntimeError: test_quantization failed!

Jun 28 18:30:12 Generated XML report: test-reports/dist-gloo/test_quantization/TEST-quantization.core.test_quantized_op.TestQNNPackOps-20210628182712.xml
Jun 28 18:30:12 Generated XML report: test-reports/dist-gloo/test_quantization/TEST-quantization.fx.test_quantize_fx.TestQuantizeFx-20210628182712.xml
Jun 28 18:30:12 Generated XML report: test-reports/dist-gloo/test_quantization/TEST-quantization.fx.test_quantize_fx.TestQuantizeFxOps-20210628182712.xml
Jun 28 18:30:12 Generated XML report: test-reports/dist-gloo/test_quantization/TEST-quantization.eager.test_quantize_eager_ptq.TestQuantizeONNXExport-20210628182712.xml
Jun 28 18:30:12 Generated XML report: test-reports/dist-gloo/test_quantization/TEST-quantization.core.test_quantized_op.TestQuantizedEmbeddingOps-20210628182712.xml
Jun 28 18:30:12 Traceback (most recent call last):
Jun 28 18:30:12   File "test/run_test.py", line 1305, in <module>
Jun 28 18:30:12     main()
Jun 28 18:30:12   File "test/run_test.py", line 1284, in main
Jun 28 18:30:12     raise RuntimeError(err_message)
Jun 28 18:30:12 RuntimeError: test_quantization failed!
Jun 28 18:30:13 + cleanup
Jun 28 18:30:13 + retcode=1
Jun 28 18:30:13 + set +x


Exited with code exit status 1

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

aten/src/ATen/native/cuda/Sorting.cu

aten/src/THC/THCAtomics.cuh

facebook-github-bot · 2021-06-26T00:31:55Z

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ngimel · 2021-06-26T00:32:14Z

Importing to see how internal builds go.

facebook-github-bot · 2021-06-28T19:51:24Z

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-06-29T01:19:09Z

@ngimel merged this pull request in d36ce61.

Summary: Enables an important performance optimization for ROCm, in light of the discussion in pytorch#41028. CC jithunnair-amd sunway513 Pull Request resolved: pytorch#60607 Reviewed By: jbschlosser Differential Revision: D29409894 Pulled By: ngimel fbshipit-source-id: effca258a0f37eaefa35674a7fd19459ca7dc95b

jeffdaily added 3 commits June 23, 2021 21:43

use explicitly non-returning atomics in caffe2

cc981be

use explicitly non-returning atomics in aten

6271dc6

missing update to cuda_to_hip_mappings.py for GpuAtomics

4d3ca63

jeffdaily requested a review from ngimel June 23, 2021 23:19

facebook-github-bot added the cla signed label Jun 23, 2021

pytorchbot added the open source label Jun 23, 2021

jeffdaily added 2 commits June 24, 2021 19:55

partial revert aten/src/ATen/native/cuda/KernelUtils.cuh

48f1da5

partial revert of caffe2/sgd/adagrad_fused_op_gpu.cuh

560aee5

ngimel reviewed Jun 26, 2021

View reviewed changes

aten/src/ATen/native/cuda/Sorting.cu Outdated Show resolved Hide resolved

aten/src/THC/THCAtomics.cuh Outdated Show resolved Hide resolved

remove use of atomicAdd() instead of gpuAtomicAdd() in Sorting.cu

de042d3

jbschlosser added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jun 28, 2021

ngimel approved these changes Jun 28, 2021

View reviewed changes

jeffdaily mentioned this pull request Jun 28, 2021

use explicitly non-returning GPU atomics ROCm/pytorch#840

Merged

facebook-github-bot closed this in d36ce61 Jun 29, 2021

facebook-github-bot added the Merged label Jun 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

use explicitly non-returning GPU atomics #60607

use explicitly non-returning GPU atomics #60607

Uh oh!

jeffdaily commented Jun 23, 2021

Uh oh!

facebook-github-bot commented Jun 23, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

facebook-github-bot commented Jun 26, 2021

Uh oh!

ngimel commented Jun 26, 2021

Uh oh!

facebook-github-bot commented Jun 28, 2021

Uh oh!

facebook-github-bot commented Jun 29, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

use explicitly non-returning GPU atomics #60607

use explicitly non-returning GPU atomics #60607

Uh oh!

Conversation

jeffdaily commented Jun 23, 2021

Uh oh!

facebook-github-bot commented Jun 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🕵️ 3 new failures recognized by patterns

pytorch_xla_linux_bionic_py3_6_clang9_build (1/3)

pytorch_linux_xenial_py3_6_gcc5_4_build (2/3)

pytorch_macos_10_13_py3_test (3/3)

Uh oh!

Uh oh!

Uh oh!

facebook-github-bot commented Jun 26, 2021

Uh oh!

ngimel commented Jun 26, 2021

Uh oh!

facebook-github-bot commented Jun 28, 2021

Uh oh!

facebook-github-bot commented Jun 29, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

facebook-github-bot commented Jun 23, 2021 •

edited

Loading