Add train() / eval() / is_training() to C++ ScriptModule API #16044

yf225 · 2019-01-15T20:34:53Z

This PR aims to fix https://discuss.pytorch.org/t/how-to-change-a-loaded-model-to-evaluation-mode-in-c/32330, by adding train() / eval() / is_training() to C++ ScriptModule API.

test/cpp/jit/tests.h

goldsborough · 2019-01-16T21:50:03Z

test/cpp/jit/tests.h

+void testEvalModeForLoadedModule() {
+  std::string module_path = "dropout_model.pt";
+  std::shared_ptr<torch::jit::script::Module> module = torch::jit::load(module_path);
+  // Test eval mode


I think the comments here don't add too much

test/cpp/jit/tests_setup.py

goldsborough · 2019-01-16T21:51:56Z

torch/csrc/jit/script/module.h

  }
+  void train(bool on = true) {
+    for (auto& submod : get_modules()) {
+      submod.value().module->train(on);


you can also write submod->module->train(on)

goldsborough · 2019-01-16T21:54:06Z

torch/csrc/jit/script/module.h

+    for (auto& submod : get_modules()) {
+      submod.value().module->train(on);
+    }
+    auto t = autograd::make_variable(at::full({}, on ? 1 : 0, at::kLong));


You can replace

auto t = autograd::make_variable(at::full({}, on ? 1 : 0, at::kLong));

with

auto t = torch::tensor(on ? 1 : 0, at::kLong);

To clarify: torch:: factory functions create variables while at:: functions create tensors. torch::tensor/at::tensor is like torch.tensor in python (creates a tensor with the values you give it)`

And then just embed it in the register_parameter call:

register_parameter("training", torch::tensor(on ? 1 : 0, at::kLong), /*is_buffer=*/true);

goldsborough · 2019-01-16T21:56:11Z

torch/csrc/jit/script/module.h

+  }
+  bool is_training() {
+    if (auto p = find_parameter("training")) {
+      return (*p->slot()).item().toLong() == 1;


Here you are converting the tensor first to a Scalar, and then to a long. You can just use item<T> to get the value of the tensor without the intermediate Scalar:

(*p->slot()).item<int64_t>()

goldsborough

Looks good! Some friendly nits which you can address if you feel like it, or not

goldsborough · 2019-01-18T15:43:07Z

torch/csrc/jit/script/module.h

nit: p->slot()->

goldsborough · 2019-01-18T15:43:29Z

torch/csrc/jit/script/module.h

nit: remove else block, just return true + comment (since you return in the if block)

torch/csrc/jit/script/module.h

facebook-github-bot

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Jan 15, 2019

yf225 changed the title ~~[WIP] Add train() / eval() / is_training() to C++ ScriptModule API~~ Add train() / eval() / is_training() to C++ ScriptModule API Jan 16, 2019

yf225 requested review from gchanan, goldsborough and zdevito and removed request for goldsborough January 16, 2019 20:06

goldsborough reviewed Jan 16, 2019

View reviewed changes

goldsborough approved these changes Jan 18, 2019

View reviewed changes

facebook-github-bot reviewed Jan 29, 2019

View reviewed changes

facebook-github-bot reviewed Jan 30, 2019

View reviewed changes

yf225 force-pushed the scriptmodule_eval branch from 82be773 to 4c36017 Compare January 30, 2019 17:27

Add train() / eval() / is_training() to C++ ScriptModule API

c03b1e5

yf225 force-pushed the scriptmodule_eval branch from f5a442d to c03b1e5 Compare January 31, 2019 05:34

fix lint

20bf3b0

facebook-github-bot reviewed Jan 31, 2019

View reviewed changes

facebook-github-bot closed this in a40e8ce Feb 1, 2019

ezyang added the merged label Jun 25, 2019

Add train() / eval() / is_training() to C++ ScriptModule API #16044

Add train() / eval() / is_training() to C++ ScriptModule API #16044

Uh oh!

Conversation

yf225 commented Jan 15, 2019

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

goldsborough left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants