Move torch.logspace to ATen and parallelize on CPU. #15438

gchanan · 2018-12-20T18:52:41Z

No description provided.

facebook-github-bot

@gchanan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

gchanan · 2018-12-20T18:56:25Z

Performance comparisons:
Old:

>>> timeit torch.logspace(0,5,512,device='cuda')
23.9 µs ± 2.79 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

>>> timeit torch.logspace(0,5,3*512*512,device='cuda')
23.3 µs ± 1.23 µs per loop (mean ± std. dev. of 7 runs, 100000 loops each)

>>> timeit torch.logspace(0,5,512)
44.1 µs ± 372 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)

>>> timeit torch.logspace(0,5,3*512*512)
64.1 ms ± 732 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)

OMP_NUM_THREADS=1
>>> timeit torch.logspace(0,5,512)
44.8 µs ± 586 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)

OMP_NUM_THREADS=1
>>> timeit torch.logspace(0,5,3*512*512)
62.3 ms ± 1.07 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

New:

>>> timeit torch.logspace(0,5,512,device='cuda')
20.5 µs ± 1.41 µs per loop (mean ± std. dev. of 7 runs, 100000 loops each)

>>> timeit torch.logspace(0,5,3*512*512,device='cuda')
21.1 µs ± 1.87 µs per loop (mean ± std. dev. of 7 runs, 100000 loops each)

>>> timeit torch.logspace(0,5,512)
45.5 µs ± 316 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)

>>> timeit torch.logspace(0,5,3*512*512)
3.73 ms ± 14.5 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

OMP_NUM_THREADS=1
>>> timeit torch.logspace(0,5,512)
46.9 µs ± 1.44 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

OMP_NUM_THREADS=1
>>> timeit torch.logspace(0,5,3*512*512)
65 ms ± 797 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)

facebook-github-bot

@gchanan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ezyang · 2018-12-20T23:47:27Z

aten/src/ATen/native/cuda/RangeFactories.cu

+  AT_CHECK(steps >= 0, "number of steps must be non-negative");
+
+  if (result.numel() != steps) {
+    result.resize_({steps});


Huh, interesting that we're willing to write into any tensor that is correct numel. Well, I suppose it's handled correctly below.

yes, it's strange but that was the existing behavior.

Summary: Pull Request resolved: pytorch/pytorch#15438 Reviewed By: ezyang Differential Revision: D13529626 Pulled By: gchanan fbshipit-source-id: 896e8afee3d6b5a706c4f5815b91ba6bd8af6672

Move torch.logspace to ATen and parallelize on CPU.

d0a909d

facebook-github-bot reviewed Dec 20, 2018

View reviewed changes

Properly use LogSpaceOp in CUDA.

5246199

facebook-github-bot reviewed Dec 20, 2018

View reviewed changes

ezyang reviewed Dec 20, 2018

View reviewed changes

ezyang approved these changes Dec 20, 2018

View reviewed changes

facebook-github-bot closed this in 433db13 Dec 21, 2018

ezyang added the merged label Jun 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move torch.logspace to ATen and parallelize on CPU. #15438

Move torch.logspace to ATen and parallelize on CPU. #15438

Uh oh!

gchanan commented Dec 20, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

gchanan commented Dec 20, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

ezyang Dec 20, 2018

Uh oh!

gchanan Dec 21, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Move torch.logspace to ATen and parallelize on CPU. #15438

Move torch.logspace to ATen and parallelize on CPU. #15438

Uh oh!

Conversation

gchanan commented Dec 20, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

gchanan commented Dec 20, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang Dec 20, 2018

Choose a reason for hiding this comment

Uh oh!

gchanan Dec 21, 2018

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants