fix double backward for half softmax/logsoftmax #17330

ngimel · 2019-02-21T01:09:59Z

Fix for #17261, @ssnl do you have tests for it in your other PR? If not, I'll add to this. Example from #17261 now does not error out (and same for log_softmax).

ssnl · 2019-02-21T01:12:02Z

Thanks! Maybe removing this line is sufficient for a test.

pytorch/test/test_nn.py

Line 2146 in c02e2ff

    
           # FIXME: add torch.half after https://github.com/pytorch/pytorch/issues/17261 is fixed

ssnl · 2019-02-21T01:27:02Z

tools/autograd/derivatives.yaml

 - name: _log_softmax_backward_data(Tensor grad_output, Tensor output, int64_t dim, Tensor self)
-  grad_output: grad - (grad * output.exp()).sum(dim, true)
-  self: log_softmax_double_backward(grad, grad_output, dim, output).type_as(self)
+  grad_output: grad.type_as(output) - (grad.type_as(output) * output.exp()).sum(dim, true)


Could you use .to(output.dtype()) instead? I know it's not a problem here, but type_as ignores device index and I think we should generally avoid using it.

Also replace type_as in clamp derivative.

I also wonder if we can avoid calculating grad.type_as(output) twice, but I guess that requires putting this into a function, and can be left as a future optimization.

Yeah, I've noticed it, but given that it's really a corner case (in most cases type_as will be a no-op), decided to leave it as is for now

ssnl

This LGTM, but let's wait until CI comes up again.

ssnl · 2019-02-21T14:34:05Z

@pytorchbot rebase this please

ngimel · 2019-02-21T18:50:39Z

I don't think windows build failure is related, it's coming from thrust iterator

15:10:39          C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v9.2/include\thrust/iterator/iterator_facade.h(517): error C2065: '__T0': undeclared identifier [C:\Jenkins\workspace\caffe2-builds\py2-cuda9.0-cudnn7-windows-build\build\caffe2\caffe2_gpu.vcxproj]

facebook-github-bot

@soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

fix double backward for half softmax/logsoftmax

f74d370

add half test

4e865c3

ssnl reviewed Feb 21, 2019

View reviewed changes

replace type_as with dtype

41d1a36

ssnl approved these changes Feb 21, 2019

View reviewed changes

Merge remote-tracking branch 'origin/master' into HEAD

03ef72b

facebook-github-bot reviewed Feb 21, 2019

View reviewed changes

facebook-github-bot closed this in 5fa7830 Feb 21, 2019

pytorchbot added the merged label Feb 21, 2019

ngimel deleted the softmax_double_backward branch April 4, 2019 00:55

ezyang added the open source label Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix double backward for half softmax/logsoftmax #17330

fix double backward for half softmax/logsoftmax #17330

Uh oh!

ngimel commented Feb 21, 2019

Uh oh!

ssnl commented Feb 21, 2019

Uh oh!

ssnl Feb 21, 2019 •

edited

Loading

Uh oh!

ngimel Feb 21, 2019

Uh oh!

ssnl Feb 21, 2019

Uh oh!

ssnl Feb 21, 2019

Uh oh!

ngimel Feb 21, 2019

Uh oh!

ssnl Feb 21, 2019

Uh oh!

ssnl left a comment

Uh oh!

ssnl commented Feb 21, 2019

Uh oh!

ngimel commented Feb 21, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fix double backward for half softmax/logsoftmax #17330

fix double backward for half softmax/logsoftmax #17330

Uh oh!

Conversation

ngimel commented Feb 21, 2019

Uh oh!

ssnl commented Feb 21, 2019

Uh oh!

ssnl Feb 21, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngimel Feb 21, 2019

Choose a reason for hiding this comment

Uh oh!

ssnl Feb 21, 2019

Choose a reason for hiding this comment

Uh oh!

ssnl Feb 21, 2019

Choose a reason for hiding this comment

Uh oh!

ngimel Feb 21, 2019

Choose a reason for hiding this comment

Uh oh!

ssnl Feb 21, 2019

Choose a reason for hiding this comment

Uh oh!

ssnl left a comment

Choose a reason for hiding this comment

Uh oh!

ssnl commented Feb 21, 2019

Uh oh!

ngimel commented Feb 21, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ssnl Feb 21, 2019 •

edited

Loading