DOC Adds code comment for _ConvNd.reset_parameters #58931

thomasjpfan · 2021-05-25T18:55:42Z

Fixes #55741 by adding a comment regarding the behavior of kaiming_uniform_

The docstring is correct in this case. For example:

import math
import matplotlib.pyplot as plt

import torch
import torch.nn as nn

in_channels = 120
groups = 2
kernel = (3, 8)
m = nn.Conv2d(in_channels=in_channels, groups=groups,
              out_channels=100, kernel_size=kernel)

k = math.sqrt(groups / (in_channels * math.prod(kernel)))
print(f"k: {k:0.6f}")

print(f"min weight: {m.weight.min().item():0.6f}")
print(f"max weight: {m.weight.max().item():0.6f}")

outputs:

k: 0.026352
min weight: -0.026352
max weight: 0.026352

And when we plot the distribution, it is uniform with the correct bounds:

_ = plt.hist(m.weight.detach().numpy().ravel())

facebook-github-bot · 2021-05-25T18:55:49Z

💊 CI failures summary and remediations

As of commit 0e77bc5 (more details on the Dr. CI page):

5/5 failures possibly* introduced in this PR
- 1/5 non-scanned failure(s)

🕵️ 3 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

pytorch_linux_xenial_cuda11_1_cudnn8_py3_gcc7_test1 (1/3)

Step: "Run tests" (full log | diagnosis details | 🔁 rerun)

May 25 23:38:57 AssertionError: RuntimeError not raised

May 25 23:38:57   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 292, in instantiated_test
May 25 23:38:57     result = test_fn(self, *args)
May 25 23:38:57   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 266, in test_wrapper
May 25 23:38:57     return test(*args, **kwargs)
May 25 23:38:57   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 617, in dep_fn
May 25 23:38:57     return fn(slf, device, *args, **kwargs)
May 25 23:38:57   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 765, in only_fn
May 25 23:38:57     return fn(self, device, *args, **kwargs)
May 25 23:38:57   File "test_ops.py", line 46, in test_unsupported_dtypes
May 25 23:38:57     op(sample.input, *sample.args, **sample.kwargs)
May 25 23:38:57 AssertionError: RuntimeError not raised
May 25 23:38:57 
May 25 23:38:58 ----------------------------------------------------------------------
May 25 23:38:58 Ran 7751 tests in 1305.698s
May 25 23:38:58 
May 25 23:38:58 FAILED (failures=1, skipped=1800)
May 25 23:38:58 
May 25 23:38:58 Generating XML reports...
May 25 23:38:58 Generated XML report: test-reports/python-unittest/test_ops/TEST-TestCommonCUDA-20210525231711.xml
May 25 23:38:58 Generated XML report: test-reports/python-unittest/test_ops/TEST-TestGradientsCUDA-20210525231711.xml
May 25 23:38:58 Generated XML report: test-reports/python-unittest/test_ops/TEST-TestOpInfoCUDA-20210525231711.xml

pytorch_linux_xenial_py3_6_gcc5_4_build (2/3)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.

CONFLICT (add/add): Merge conflict in .azure_pipelines/pytorch-tests-pipeline.yml
Auto-merging .azure_pipelines/pytorch-tests-pipeline.yml
CONFLICT (add/add): Merge conflict in .azure_pipelines/job_templates/wheel-wait-template.yml
Auto-merging .azure_pipelines/job_templates/wheel-wait-template.yml
CONFLICT (add/add): Merge conflict in .azure_pipelines/job_templates/wheel-wait-job-template.yml
Auto-merging .azure_pipelines/job_templates/wheel-wait-job-template.yml
CONFLICT (add/add): Merge conflict in .azure_pipelines/job_templates/pytorch-template-win.yml
Auto-merging .azure_pipelines/job_templates/pytorch-template-win.yml
CONFLICT (add/add): Merge conflict in .azure_pipelines/job_templates/pytorch-template-unix.yml
Auto-merging .azure_pipelines/job_templates/pytorch-template-unix.yml
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

pytorch_xla_linux_bionic_py3_6_clang9_build (3/3)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.

CONFLICT (add/add): Merge conflict in .azure_pipelines/pytorch-tests-pipeline.yml
Auto-merging .azure_pipelines/pytorch-tests-pipeline.yml
CONFLICT (add/add): Merge conflict in .azure_pipelines/job_templates/wheel-wait-template.yml
Auto-merging .azure_pipelines/job_templates/wheel-wait-template.yml
CONFLICT (add/add): Merge conflict in .azure_pipelines/job_templates/wheel-wait-job-template.yml
Auto-merging .azure_pipelines/job_templates/wheel-wait-job-template.yml
CONFLICT (add/add): Merge conflict in .azure_pipelines/job_templates/pytorch-template-win.yml
Auto-merging .azure_pipelines/job_templates/pytorch-template-win.yml
CONFLICT (add/add): Merge conflict in .azure_pipelines/job_templates/pytorch-template-unix.yml
Auto-merging .azure_pipelines/job_templates/pytorch-template-unix.yml
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

1 failure not recognized by patterns:

Job	Step	Action
^{Windows CI (pytorch-win-vs2019-cpu-py3) / render_test_results}	^Unknown	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

jbschlosser

Thanks for adding the clarification!

jbschlosser · 2021-05-25T19:35:13Z

torch/nn/modules/conv.py


    def reset_parameters(self) -> None:
+        # Setting a=sqrt(5) in kaiming_uniform is the same as initializing with
+        # uniform(-1/sqrt(k), 1/sqrt(k)), where k = weight.size(1) * prod(*kernel_size)


Can you also add a line to the effect of: "See for more details." where link points to Soumith's comment explaining the calculation (#15314 (comment))

facebook-github-bot · 2021-05-25T20:52:51Z

@jbschlosser has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-05-26T15:41:08Z

@jbschlosser merged this pull request in 8130f2f.

Summary: Fixes pytorch#55741 by adding a comment regarding the behavior of `kaiming_uniform_` The docstring is correct in this case. For example: ```python import math import matplotlib.pyplot as plt import torch import torch.nn as nn in_channels = 120 groups = 2 kernel = (3, 8) m = nn.Conv2d(in_channels=in_channels, groups=groups, out_channels=100, kernel_size=kernel) k = math.sqrt(groups / (in_channels * math.prod(kernel))) print(f"k: {k:0.6f}") print(f"min weight: {m.weight.min().item():0.6f}") print(f"max weight: {m.weight.max().item():0.6f}") ``` outputs: ``` k: 0.026352 min weight: -0.026352 max weight: 0.026352 ``` And when we plot the distribution, it is uniform with the correct bounds: ```python _ = plt.hist(m.weight.detach().numpy().ravel()) ``` ![Unknown](https://user-images.githubusercontent.com/5402633/119552979-21ba3800-bd69-11eb-8e10-e067c943abe3.png) Pull Request resolved: pytorch#58931 Reviewed By: anjali411 Differential Revision: D28689863 Pulled By: jbschlosser fbshipit-source-id: 98eebf265dfdaceed91f1991fc4b1592c0b3cf37

DOC Adds code comment for _ConvNd.reset_parameters

bc2cbe6

thomasjpfan requested review from albanD and jbschlosser as code owners May 25, 2021 18:55

facebook-github-bot added the cla signed label May 25, 2021

thomasjpfan mentioned this pull request May 25, 2021

_ConvNd weight initialization does not match docs #55741

Closed

pytorchbot added the open source label May 25, 2021

jbschlosser approved these changes May 25, 2021

View reviewed changes

DOC Adds link to github comment

0e77bc5

facebook-github-bot closed this in 8130f2f May 26, 2021

facebook-github-bot added the Merged label May 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DOC Adds code comment for _ConvNd.reset_parameters #58931

DOC Adds code comment for _ConvNd.reset_parameters #58931

Uh oh!

thomasjpfan commented May 25, 2021

Uh oh!

facebook-github-bot commented May 25, 2021 •

edited

Loading

Uh oh!

jbschlosser left a comment

Uh oh!

jbschlosser May 25, 2021

Uh oh!

facebook-github-bot commented May 25, 2021

Uh oh!

facebook-github-bot commented May 26, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DOC Adds code comment for _ConvNd.reset_parameters #58931

DOC Adds code comment for _ConvNd.reset_parameters #58931

Uh oh!

Conversation

thomasjpfan commented May 25, 2021

Uh oh!

facebook-github-bot commented May 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🕵️ 3 new failures recognized by patterns

pytorch_linux_xenial_cuda11_1_cudnn8_py3_gcc7_test1 (1/3)

pytorch_linux_xenial_py3_6_gcc5_4_build (2/3)

pytorch_xla_linux_bionic_py3_6_clang9_build (3/3)

1 failure not recognized by patterns:

Uh oh!

jbschlosser left a comment

Choose a reason for hiding this comment

Uh oh!

jbschlosser May 25, 2021

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 25, 2021

Uh oh!

facebook-github-bot commented May 26, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

facebook-github-bot commented May 25, 2021 •

edited

Loading