add `OpInfo` for `torch.nn.functional.mse_loss` #62254

pmeier · 2021-07-27T09:43:29Z

Addresses pytorch/functorch#78.

facebook-github-bot · 2021-07-27T09:43:35Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/62254
📄 Preview docs built from this PR

💊 CI failures summary and remediations

As of commit 4c62164 (more details on the Dr. CI page):

1/1 failures introduced in this PR

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

pytorch_macos_10_13_py3_test (1/1)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

Aug 02 18:50:28 [E request_callback_no_python.c...yUniqueId(created_on=0, local_id=0) to be created.

Aug 02 18:50:17 ok (8.853s)
Aug 02 18:50:19   test_remote_message_script_delay_timeout (__main__.FaultyFaultyAgentRpcTestWithSpawn) ... [W tensorpipe_agent.cpp:186] Failed to look up the IP address for the hostname (EAI_NONAME: unknown node or service (this error originated at tensorpipe/transport/uv/utility.cc:97)), defaulting to 127.0.0.1
Aug 02 18:50:19 [W tensorpipe_agent.cpp:186] Failed to look up the IP address for the hostname (EAI_NONAME: unknown node or service (this error originated at tensorpipe/transport/uv/utility.cc:97)), defaulting to 127.0.0.1
Aug 02 18:50:19 [W tensorpipe_agent.cpp:186] Failed to look up the IP address for the hostname (EAI_NONAME: unknown node or service (this error originated at tensorpipe/transport/uv/utility.cc:97)), defaulting to 127.0.0.1
Aug 02 18:50:19 [W tensorpipe_agent.cpp:186] Failed to look up the IP address for the hostname (EAI_NONAME: unknown node or service (this error originated at tensorpipe/transport/uv/utility.cc:97)), defaulting to 127.0.0.1
Aug 02 18:50:24 ok (7.376s)
Aug 02 18:50:26   test_remote_message_script_delay_timeout_to_self (__main__.FaultyFaultyAgentRpcTestWithSpawn) ... [W tensorpipe_agent.cpp:186] Failed to look up the IP address for the hostname (EAI_NONAME: unknown node or service (this error originated at tensorpipe/transport/uv/utility.cc:97)), defaulting to 127.0.0.1
Aug 02 18:50:26 [W tensorpipe_agent.cpp:186] Failed to look up the IP address for the hostname (EAI_NONAME: unknown node or service (this error originated at tensorpipe/transport/uv/utility.cc:97)), defaulting to 127.0.0.1
Aug 02 18:50:26 [W tensorpipe_agent.cpp:186] Failed to look up the IP address for the hostname (EAI_NONAME: unknown node or service (this error originated at tensorpipe/transport/uv/utility.cc:97)), defaulting to 127.0.0.1
Aug 02 18:50:26 [W tensorpipe_agent.cpp:186] Failed to look up the IP address for the hostname (EAI_NONAME: unknown node or service (this error originated at tensorpipe/transport/uv/utility.cc:97)), defaulting to 127.0.0.1
Aug 02 18:50:28 [E request_callback_no_python.cpp:555] Received error while processing request type 260: falseINTERNAL ASSERT FAILED at "../torch/csrc/distributed/rpc/rref_context.cpp":387, please report a bug to PyTorch. Expected OwnerRRef with id GloballyUniqueId(created_on=0, local_id=0) to be created.
Aug 02 18:50:28 Exception raised from getOwnerRRef at ../torch/csrc/distributed/rpc/rref_context.cpp:387 (most recent call first):
Aug 02 18:50:28 frame #0: c10::Error::Error(c10::SourceLocation, std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> >) + 98 (0x10bf30652 in libc10.dylib)
Aug 02 18:50:28 frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&) + 106 (0x10bf2edca in libc10.dylib)
Aug 02 18:50:28 frame #2: c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&) + 64 (0x10bf2f000 in libc10.dylib)
Aug 02 18:50:28 frame #3: torch::distributed::rpc::RRefContext::getOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, bool) + 1711 (0x11b2675ff in libtorch_cpu.dylib)
Aug 02 18:50:28 frame #4: torch::distributed::rpc::RequestCallbackNoPython::assignOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, torch::distributed::rpc::GloballyUniqueId const&, c10::intrusive_ptr<c10::ivalue::Future, c10::detail::intrusive_target_default_null_type<c10::ivalue::Future> >) const + 86 (0x11b251e56 in libtorch_cpu.dylib)
Aug 02 18:50:28 frame #5: torch::distributed::rpc::RequestCallbackImpl::processScriptRemoteCall(torch::distributed::rpc::RpcCommandBase&, std::__1::vector<c10::Stream, std::__1::allocator<c10::Stream> >) const + 376 (0x1171e6768 in libtorch_python.dylib)
Aug 02 18:50:28 frame #6: torch::distributed::rpc::RequestCallbackNoPython::processRpc(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::__1::vector<c10::Stream, std::__1::allocator<c10::Stream> >) const + 437 (0x11b250aa5 in libtorch_cpu.dylib)
Aug 02 18:50:28 frame #7: torch::distributed::rpc::RequestCallbackImpl::processRpcWithErrors(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::__1::vector<c10::Stream, std::__1::allocator<c10::Stream> >) const + 74 (0x1171e74da in libtorch_python.dylib)
Aug 02 18:50:28 frame #8: c10::intrusive_ptr<c10::ivalue::Future, c10::detail::intrusive_target_default_null_type<c10::ivalue::Future> > c10::ivalue::Future::thenAsync<torch::distributed::rpc::RequestCallbackNoPython::processMessage(torch::distributed::rpc::Message&, std::__1::vector<c10::Stream, std::__1::allocator<c10::Stream> >) const::$_1>(torch::distributed::rpc::RequestCallbackNoPython::processMessage(torch::distributed::rpc::Message&, std::__1::vector<c10::Stream, std::__1::allocator<c10::Stream> >) const::$_1, std::__1::shared_ptr<c10::Type>)::'lambda'(c10::ivalue::Future&)::operator()(c10::ivalue::Future&) + 223 (0x11b25876f in libtorch_cpu.dylib)

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

pmeier · 2021-07-27T11:43:42Z

Not sure about this failure: https://app.circleci.com/pipelines/github/pytorch/pytorch/356512/workflows/199959e4-7e76-4245-972e-88fa1de4872d/jobs/15029588/tests#failed-test-0

It seems a lot of other OpInfo's seem to skip it, so I don't know if it is warranted to also do it here.

zou3519 · 2021-07-27T14:29:33Z

Test failures look real:

======================================================================
ERROR [0.070s]: test_variant_consistency_jit_nn_functional_mse_loss_cpu_float32 (__main__.TestJitCPU)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 378, in instantiated_test
    raise rte
  File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 373, in instantiated_test
    result = test(self, **param_kwargs)
  File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 780, in test_wrapper
    return test(*args, **kwargs)
  File "test_ops.py", line 731, in test_variant_consistency_jit
    func_type=func_type, aten_name=op.aten_name)
  File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/jit_metaprogramming_utils.py", line 490, in check_alias_annotation
    torch._C._jit_check_alias_annotation(CU.the_method.graph, tuple(tensors), aten_name)
RuntimeError: aliasOp != torch::jit::getOperatorAliasMap().end()INTERNAL ASSERT FAILED at "/var/lib/jenkins/workspace/torch/csrc/jit/passes/utils/check_alias_annotation.cpp":159, please report a bug to PyTorch.

I'm not sure what the right way to handle this is. @mruberry, @eellison -- is adding a Skip and then filing an issue the way to go?

mruberry · 2021-07-30T06:15:37Z

Test failures look real:

======================================================================
ERROR [0.070s]: test_variant_consistency_jit_nn_functional_mse_loss_cpu_float32 (__main__.TestJitCPU)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 378, in instantiated_test
    raise rte
  File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 373, in instantiated_test
    result = test(self, **param_kwargs)
  File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_device_type.py", line 780, in test_wrapper
    return test(*args, **kwargs)
  File "test_ops.py", line 731, in test_variant_consistency_jit
    func_type=func_type, aten_name=op.aten_name)
  File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/jit_metaprogramming_utils.py", line 490, in check_alias_annotation
    torch._C._jit_check_alias_annotation(CU.the_method.graph, tuple(tensors), aten_name)
RuntimeError: aliasOp != torch::jit::getOperatorAliasMap().end()INTERNAL ASSERT FAILED at "/var/lib/jenkins/workspace/torch/csrc/jit/passes/utils/check_alias_annotation.cpp":159, please report a bug to PyTorch.

I'm not sure what the right way to handle this is. @mruberry, @eellison -- is adding a Skip and then filing an issue the way to go?

Skipping seems OK for now. My guess is there's an issue with the test. I wouldn't bother filing an issue unless @eellison would like one.

torch/testing/_internal/common_methods_invocations.py

mruberry · 2021-07-30T06:19:12Z

mypy errors are unrelated; I would just skip the test and if you could tweak the sample inputs generator that'd be great -- large sample inputs can take a long time to test.

@zou3519 would you shepherd this through?

facebook-github-bot · 2021-07-30T13:51:05Z

@zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-07-30T15:06:15Z

@zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-08-02T18:03:35Z

@zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-08-03T16:02:37Z

@zou3519 merged this pull request in 2cf4d81.

add OpInfo for torch.nn.functional.mse_loss

5a187a0

facebook-github-bot added the cla signed label Jul 27, 2021

pytorchbot added the open source label Jul 27, 2021

lint

275474f

pmeier requested review from mruberry and zou3519 July 27, 2021 09:52

fix cuda dtypes

8d4269c

ejguan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jul 27, 2021

pmeier added module: nn Related to torch.nn module: testing Issues related to the torch.testing module (not tests) labels Jul 28, 2021

pmeier mentioned this pull request Jul 28, 2021

add OpInfo for torch.nn.functional.grid_sample #62311

Closed

mruberry reviewed Jul 30, 2021

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Outdated Show resolved Hide resolved

mruberry reviewed Jul 30, 2021

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Outdated Show resolved Hide resolved

mruberry reviewed Jul 30, 2021

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Outdated Show resolved Hide resolved

mruberry reviewed Jul 30, 2021

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Show resolved Hide resolved

pmeier added 2 commits July 30, 2021 11:12

address review comments

9b5fd0e

Merge branch 'master' into opinfo/mse_loss

80c7eff

zou3519 approved these changes Jul 30, 2021

View reviewed changes

Merge branch 'master' into opinfo/mse_loss

1ae2fca

Merge branch 'master' into opinfo/mse_loss

4c62164

facebook-github-bot closed this in 2cf4d81 Aug 3, 2021

facebook-github-bot added the Merged label Aug 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add `OpInfo` for `torch.nn.functional.mse_loss` #62254

add `OpInfo` for `torch.nn.functional.mse_loss` #62254

Uh oh!

pmeier commented Jul 27, 2021

Uh oh!

facebook-github-bot commented Jul 27, 2021 •

edited

Loading

Uh oh!

pmeier commented Jul 27, 2021

Uh oh!

zou3519 commented Jul 27, 2021

Uh oh!

mruberry commented Jul 30, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mruberry commented Jul 30, 2021

Uh oh!

facebook-github-bot commented Jul 30, 2021

Uh oh!

facebook-github-bot commented Jul 30, 2021

Uh oh!

facebook-github-bot commented Aug 2, 2021

Uh oh!

facebook-github-bot commented Aug 3, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

add OpInfo for torch.nn.functional.mse_loss #62254

add OpInfo for torch.nn.functional.mse_loss #62254

Uh oh!

Conversation

pmeier commented Jul 27, 2021

Uh oh!

facebook-github-bot commented Jul 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

🕵️ 1 new failure recognized by patterns

pytorch_macos_10_13_py3_test (1/1)

Uh oh!

pmeier commented Jul 27, 2021

Uh oh!

zou3519 commented Jul 27, 2021

Uh oh!

mruberry commented Jul 30, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mruberry commented Jul 30, 2021

Uh oh!

facebook-github-bot commented Jul 30, 2021

Uh oh!

facebook-github-bot commented Jul 30, 2021

Uh oh!

facebook-github-bot commented Aug 2, 2021

Uh oh!

facebook-github-bot commented Aug 3, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

add `OpInfo` for `torch.nn.functional.mse_loss` #62254

add `OpInfo` for `torch.nn.functional.mse_loss` #62254

facebook-github-bot commented Jul 27, 2021 •

edited

Loading