at::native batch norm kernel launch config update #17047

jjsjann123 · 2019-02-13T08:14:37Z

limit block dimension to avoid configuration error on batch norm kernel launch

This should resolve #16998

…l launch

ezyang · 2019-02-13T18:46:48Z

Failure is real

Feb 13 09:00:38 ======================================================================
Feb 13 09:00:38 ERROR: test_batchnorm_large_batch (__main__.TestNN)
Feb 13 09:00:38 ----------------------------------------------------------------------
Feb 13 09:00:38 Traceback (most recent call last):
Feb 13 09:00:38   File "test_nn.py", line 5469, in test_batchnorm_large_batch
Feb 13 09:00:38     out = bn(data).sum().backward()
Feb 13 09:00:38   File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/module.py", line 491, in __call__
Feb 13 09:00:38     result = self.forward(*input, **kwargs)
Feb 13 09:00:38   File "/opt/conda/lib/python3.6/site-packages/torch/nn/modules/batchnorm.py", line 76, in forward
Feb 13 09:00:38     exponential_average_factor, self.eps)
Feb 13 09:00:38   File "/opt/conda/lib/python3.6/site-packages/torch/nn/functional.py", line 1667, in batch_norm
Feb 13 09:00:38     training, momentum, eps, torch.backends.cudnn.enabled
Feb 13 09:00:38 RuntimeError: expected scalar type Float but found Double
Feb 13 09:00:38 
Feb 13 09:00:38 ----------------------------------------------------------------------

jjsjann123 · 2019-02-13T19:13:11Z

Of course I only updated the separate test inside the container and never push that back to the main repo...
Let me update the float with float32

jjsjann123 · 2019-02-13T20:45:23Z

hmmm. it's a surprise that test_nn default layer to double...
I'm missing a type conversion for the layer instead of the input. Fixed in the last commit.

jjsjann123 · 2019-02-14T06:08:08Z

@pytorchbot retest this please

…batch_fix

jjsjann123 · 2019-02-19T18:22:37Z

bump master merge just to rerun failed ci/circleci tests.

facebook-github-bot

@soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: limit block dimension to avoid configuration error on batch norm kernel launch This should resolve #16998 Pull Request resolved: pytorch/pytorch#17047 Differential Revision: D14142132 Pulled By: soumith fbshipit-source-id: 9c8c52dcd1d108cda1f65f5227e625b8fe6e12a0

limit block dimension to avoid configuration error on batchnorm kerne…

93ff65a

…l launch

fixing test failures by updating layer data type

416d289

Merge remote-tracking branch 'origin/master' into at_batchnorm_large_…

490eed5

…batch_fix

facebook-github-bot reviewed Feb 19, 2019

View reviewed changes

facebook-github-bot closed this in 594a4d7 Feb 20, 2019

ezyang added open source merged labels Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

at::native batch norm kernel launch config update #17047

at::native batch norm kernel launch config update #17047

Uh oh!

jjsjann123 commented Feb 13, 2019

Uh oh!

ezyang commented Feb 13, 2019

Uh oh!

jjsjann123 commented Feb 13, 2019

Uh oh!

jjsjann123 commented Feb 13, 2019

Uh oh!

jjsjann123 commented Feb 14, 2019

Uh oh!

jjsjann123 commented Feb 19, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

at::native batch norm kernel launch config update #17047

at::native batch norm kernel launch config update #17047

Uh oh!

Conversation

jjsjann123 commented Feb 13, 2019

Uh oh!

ezyang commented Feb 13, 2019

Uh oh!

jjsjann123 commented Feb 13, 2019

Uh oh!

jjsjann123 commented Feb 13, 2019

Uh oh!

jjsjann123 commented Feb 14, 2019

Uh oh!

jjsjann123 commented Feb 19, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants