[jit] Add LSTM to standard library #15744

driazati · 2019-01-04T21:15:17Z

WIP

Attempt 2 at #14831

This adds nn.LSTM to the jit standard library. Necessary changes to the module itself are detailed in comments. The main limitation is the lack of a true PackedSequence, instead this PR uses an ordinary tuple to stand in for PackedSequence.

Most of the new code in rnn.py is copied to nn.LSTM from nn.RNNBase to specialize it for LSTM since hx is a Tuple[Tensor, Tensor] (rather than just a Tensor as in the other RNN modules) for LSTM.

As a hack it adds an internal annotation @_parameter_list to mark that a function returns all the parameters of a module. The weights for RNN modules are passed to the corresponding op as a List[Tensor]. In Python this has to be gathered dynamically since Parameters could be moved from CPU to GPU or be deleted and replaced (i.e. if someone calls weight_norm on their module, #15766), but in the JIT parameter lists are immutable, hence a builtin to handle this differently in Python/JIT.

…f a model

zdevito

Looks good -- the hack is a bit messy but we discussed this in person and think it is best to get this merged and then work on the underlying functionality that would enable the parameter list to be presented as a first-class object. This requires having non-tensor non-constant model attributes accessible from TorchScript.

facebook-github-bot

@driazati has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

mcarilli · 2019-02-24T21:21:22Z

Edit: Disregard all this, we figured out a workaround on our end.

@driazati @zdevito As far as I can tell, you removed 'LSTM' from _rnn_impls https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/rnn.py#L15-L19 because that layer of indirection isn't necessary anymore for LSTM (now that LSTM has its own full-fledged forward and doesn't need to lean on the forward inherited from RNNBase).

Unfortunately for us, _rnn_impls dict is what Amp interposes on to insert its arg-casting wrapper (we can't patch _VF.* because that comes directly from C++). This means that Amp is broken with for LSTMs with current master. Is it ok if LSTM continues to use the layer of indirection, i.e., restore

_rnn_impls = {
    'LSTM': _VF.lstm,
    'GRU': _VF.gru,
    'RNN_TANH': _VF.rnn_tanh,
    'RNN_RELU': _VF.rnn_relu,
}

and change the _VF.lstm() calls to _rnn_impls['LSTM']()? I realize this is not necessary for you, and in support of our own use case, but it is helpful to us and doesn't seem to do any harm. If you don't object I can PR it.

driazati · 2019-02-26T00:25:41Z

@mcarilli The change was due to some technical limitations in the JIT since it uses the same module code, good you found a fix since we likely wouldn't be able to fix the limitation for a few weeks

davidriazati added 6 commits December 26, 2018 15:56

[jit] Add support for overloaded functions

16ff0c9

add test

7b4b57e

cleanup

9b84a0f

cleanup

d9cd83f

cleanup

8654e6d

Merge

14fde3d

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Jan 4, 2019

driazati changed the title ~~Lstm ov~~ [jit] Add LSTM to standard library Jan 4, 2019

driazati force-pushed the lstm_ov branch from e5eea0c to 518f0ed Compare January 4, 2019 21:26

[jit] Add LSTM to standard library

91fcb0f

driazati force-pushed the lstm_ov branch from 518f0ed to 91fcb0f Compare January 4, 2019 21:44

davidriazati added 5 commits January 7, 2019 10:45

Use PackedSequence for Python path

ceceb41

Weak ref to overloads

51c877c

merge

7ec3a06

merge 2

036a51a

redux

f377bdb

driazati closed this Feb 5, 2019

Cleanup

2face95

driazati reopened this Feb 5, 2019

davidriazati added 11 commits February 5, 2019 15:02

Cleanup

4dae7d7

Cleanup

e8992ba

Cleanup duplication

7cee8e6

Merge

a8309d7

Use ConstantParameterList if the list is all the model's parameters

44abe95

Fix extra self

87aabbc

fix clang tidy + mypy

17f24dc

Hackishly use annotation to mark method as returning all parameters o…

cfd388d

…f a model

Merge branch 'master' of https://github.com/pytorch/pytorch into lstm_ov

8adfc4b

cleanup

ce51bbb

Cleanup test

5c1845b

davidriazati added 2 commits February 14, 2019 13:55

Merge branch 'master' of https://github.com/pytorch/pytorch into lstm_ov

3875667

Merge branch 'master' of https://github.com/pytorch/pytorch into lstm_ov

5a49037

driazati requested a review from zdevito February 15, 2019 22:34

zdevito approved these changes Feb 20, 2019

View reviewed changes

facebook-github-bot reviewed Feb 21, 2019

View reviewed changes

facebook-github-bot closed this in 2370c98 Feb 22, 2019

pytorchbot added the merged label Feb 22, 2019

mcarilli pushed a commit to NVIDIA/apex that referenced this pull request Feb 25, 2019

Forward+backward compatibility fix around pytorch/pytorch#15744

7c36c41

suo mentioned this pull request Feb 28, 2019

torch.jit.trace hardcodes batch size with packed input to LSTM #15319

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[jit] Add LSTM to standard library #15744

[jit] Add LSTM to standard library #15744

Uh oh!

driazati commented Jan 4, 2019 •

edited

Loading

Uh oh!

zdevito left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

mcarilli commented Feb 24, 2019 •

edited

Loading

Uh oh!

driazati commented Feb 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[jit] Add LSTM to standard library #15744

[jit] Add LSTM to standard library #15744

Uh oh!

Conversation

driazati commented Jan 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zdevito left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

mcarilli commented Feb 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

driazati commented Feb 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

driazati commented Jan 4, 2019 •

edited

Loading

mcarilli commented Feb 24, 2019 •

edited

Loading