use all_weights instead of _parameters in _flat_weights in rnn #15766

ngimel · 2019-01-06T05:41:36Z

soumith · 2019-01-06T07:34:19Z

failing tests

ngimel · 2019-01-06T07:54:26Z

Build and asan failures seem unrelated (cannot install moreutils, temporary failure in name resolution), cuda9_cudnn7_py2_test looks real, I'll take a look next week.

ngimel · 2019-01-07T20:53:17Z

Remaining MacOS failures are unrelated.

facebook-github-bot

@soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

apaszke

Well it really looks like a hot patch for a much deeper issue, which is that weight norm and similar don't play well with our module design... Not being able to reliably access parameters in any way other than by name is just bad.

Summary: Fixes #15749 Pull Request resolved: #15766 Differential Revision: D13592320 Pulled By: soumith fbshipit-source-id: 6c3805f576c3df5a2da8bef1e4305eda379718df

Summary: **WIP** Attempt 2 at #14831 This adds `nn.LSTM` to the jit standard library. Necessary changes to the module itself are detailed in comments. The main limitation is the lack of a true `PackedSequence`, instead this PR uses an ordinary `tuple` to stand in for `PackedSequence`. Most of the new code in `rnn.py` is copied to `nn.LSTM` from `nn.RNNBase` to specialize it for LSTM since `hx` is a `Tuple[Tensor, Tensor]` (rather than just a `Tensor` as in the other RNN modules) for LSTM. As a hack it adds an internal annotation `@_parameter_list` to mark that a function returns all the parameters of a module. The weights for `RNN` modules are passed to the corresponding op as a `List[Tensor]`. In Python this has to be gathered dynamically since Parameters could be moved from CPU to GPU or be deleted and replaced (i.e. if someone calls `weight_norm` on their module, #15766), but in the JIT parameter lists are immutable, hence a builtin to handle this differently in Python/JIT. Pull Request resolved: #15744 Differential Revision: D14173198 Pulled By: driazati fbshipit-source-id: 4ee8113159b3a8f29a9f56fe661cfbb6b30dffcd

user all_weights instead of _parameters in _flat_weights in rnn

9f01cdc

soumith approved these changes Jan 6, 2019

View reviewed changes

force warnings for a test that depends on them

742c4d9

facebook-github-bot reviewed Jan 7, 2019

View reviewed changes

apaszke reviewed Jan 7, 2019

View reviewed changes

ailzhang added this to the 1.0.1 milestone Jan 8, 2019

facebook-github-bot closed this in 461dc9a Jan 8, 2019

ink-pad mentioned this pull request Jan 10, 2019

RuntimeError: shape '[5290000, 1]' is invalid for input of size 4600 salesforce/awd-lstm-lm#86

Open

ngimel deleted the rnn_weight_norm branch January 16, 2019 19:50

soumith added the cherry-picked This PR was cherry-picked onto a release branch from master label Jan 17, 2019

driazati mentioned this pull request Feb 13, 2019

[jit] Add LSTM to standard library #15744

Closed

ezyang added open source merged labels Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

use all_weights instead of _parameters in _flat_weights in rnn #15766

use all_weights instead of _parameters in _flat_weights in rnn #15766

Uh oh!

ngimel commented Jan 6, 2019

Uh oh!

soumith commented Jan 6, 2019

Uh oh!

ngimel commented Jan 6, 2019

Uh oh!

ngimel commented Jan 7, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

apaszke left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

use all_weights instead of _parameters in _flat_weights in rnn #15766

use all_weights instead of _parameters in _flat_weights in rnn #15766

Uh oh!

Conversation

ngimel commented Jan 6, 2019

Uh oh!

soumith commented Jan 6, 2019

Uh oh!

ngimel commented Jan 6, 2019

Uh oh!

ngimel commented Jan 7, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

apaszke left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants