Always synchronize on src and dst current streams when copying tensors #16966

mrshenli · 2019-02-11T18:11:15Z

mrshenli · 2019-02-15T22:24:42Z

test/test_cuda.py

+        _test_copy(self, x0, x1, torch.zeros(5, 5, device=d1))
+
+        x2 = torch.zeros(5, 5, device=d0)
+        _test_copy(self, x0, x2, torch.ones(5, 5, device=d1))


@colesbury

This behavior is expected (as we do not sync if src and dst are the same), but looks weird to me. Should we do the sync using dst default stream regardless of whether src and dst are different?

I don't understand. Which part is weird or unexpected here?

These four lines, both are copying a zero tensor to x0, but the result differs, depending on whether the src and dst are on the same device.

x1 = torch.zeros(5, 5, device=d1) _test_copy(self, x0, x1, torch.zeros(5, 5, device=d1)) x2 = torch.zeros(5, 5, device=d0) _test_copy(self, x0, x2, torch.ones(5, 5, device=d1))

To make the multi-device case like the single-device case you would have to set the streams for both d0 and d1 (and synchronize with the current stream instead of default stream). Something like:

def _test_copy(self, x, y, output): x_plus_one = x + 1 s0 = torch.cuda.Stream() s1 = torch.cuda.Stream() s2 = torch.cuda.Stream(device=y.device) s3 = torch.cuda.Stream(device=y.device) with torch.cuda.stream(s2), torch.cuda.stream(s0): torch.cuda._sleep(50000000) y.copy_(x_plus_one) with torch.cuda.stream(s3), torch.cuda.stream(s1): y.copy_(x) s0.synchronize() s1.synchronize()

colesbury · 2019-02-19T20:47:20Z

The copy is still synchronizing on the default stream on the destination. It should be synchronizing on the current stream on the destination. That way it appears as-if the copy takes place in both the streams of the source and destination even though the kernel only runs on the current stream for the source's device.

pytorch/aten/src/ATen/native/cuda/Copy.cu

Line 78 in c5be4c5

dst_ready.record(getDefaultCUDAStream(dst_device.index()));

and

pytorch/aten/src/ATen/native/cuda/Copy.cu

Line 153 in c5be4c5

src_ready.block(getDefaultCUDAStream(dst_device.index()));

mrshenli · 2019-02-24T16:16:55Z

rerun tests after merging #17439

facebook-github-bot

@mrshenli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: fixes #15568 Pull Request resolved: pytorch/pytorch#16966 Differential Revision: D14213144 Pulled By: mrshenli fbshipit-source-id: 2fcf5e07895fde80b4aee72e2736b0def876d21f

mrshenli added 4 commits February 11, 2019 10:09

always synchronize src and dst streams when copying tensors

3c938d4

Merge remote-tracking branch 'upstream/master' into copy

3038362

Merge remote-tracking branch 'upstream/master' into copy

e05667c

add same device copy test

c5be4c5

mrshenli commented Feb 15, 2019

View reviewed changes

mrshenli added 3 commits February 23, 2019 18:59

synchronize on src current stream and dst current stream

e9a3200

Merge remote-tracking branch 'upstream/master' into copy

f3c5dc5

remove concurrent copy test

915d523

mrshenli added 2 commits February 25, 2019 12:14

skip rocm for stream tests

94aa6ce

Merge remote-tracking branch 'upstream/master' into copy

69c89b2

mrshenli changed the title ~~[WIP] Always synchronize src and dst streams when copying tensors~~ Always synchronize src and dst streams when copying tensors Feb 25, 2019

facebook-github-bot reviewed Feb 25, 2019

View reviewed changes

mrshenli changed the title ~~Always synchronize src and dst streams when copying tensors~~ Always synchronize on src and dst current streams when copying tensors Feb 25, 2019

colesbury approved these changes Feb 27, 2019

View reviewed changes

facebook-github-bot closed this in 1154506 Feb 27, 2019

pytorchbot added the merged label Feb 27, 2019

fmassa mentioned this pull request Mar 1, 2019

Failure when using scatter for H2D copy facebookresearch/maskrcnn-benchmark#512

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Always synchronize on src and dst current streams when copying tensors #16966

Always synchronize on src and dst current streams when copying tensors #16966

Uh oh!

mrshenli commented Feb 11, 2019 •

edited

Loading

Uh oh!

mrshenli Feb 15, 2019

Uh oh!

colesbury Feb 19, 2019

Uh oh!

mrshenli Feb 19, 2019

Uh oh!

colesbury Feb 19, 2019

Uh oh!

colesbury commented Feb 19, 2019

Uh oh!

mrshenli commented Feb 24, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Always synchronize on src and dst current streams when copying tensors #16966

Always synchronize on src and dst current streams when copying tensors #16966

Uh oh!

Conversation

mrshenli commented Feb 11, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mrshenli Feb 15, 2019

Choose a reason for hiding this comment

Uh oh!

colesbury Feb 19, 2019

Choose a reason for hiding this comment

Uh oh!

mrshenli Feb 19, 2019

Choose a reason for hiding this comment

Uh oh!

colesbury Feb 19, 2019

Choose a reason for hiding this comment

Uh oh!

colesbury commented Feb 19, 2019

Uh oh!

mrshenli commented Feb 24, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mrshenli commented Feb 11, 2019 •

edited

Loading