Fix dot reference checks #138596

kundaMwiza · 2024-10-22T15:51:15Z

dot reference implementation should be consistent with the cpu / cuda implementations since it may be used for meta dispatch

i.e.

import torch 
x = torch.tensor([1,2,3], dtype=torch.float32)
y = torch.tensor([4,5,6], dtype=torch.float16)
x.dot(y)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
RuntimeError: dot : expected both vectors to have same dtype, but found Float and Half

However the below does not raise an exception

x.to("meta").dot(y.to("meta"))

Fixes #ISSUE_NUMBER

pytorch-bot · 2024-10-22T15:51:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138596

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f38dd0f with merge base 10a34dc ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kundaMwiza · 2024-10-22T16:09:04Z

For some more context, see slack discussion: https://pytorch.slack.com/archives/C3PDTEV8E/p1728898894696069

bdhirsh

lgtm

kundaMwiza · 2024-10-25T12:08:18Z

@bdhirsh I've updated the PR to only apply type promotion after the checks. This should hopefully fix the failing tests

bdhirsh · 2024-10-25T18:58:27Z

torch/_refs/__init__.py

Isn't this going to skip the type promotion in the self.is_complex() path?

It might be cleaner just to take the exsting decomps, and wrap them in a new function that does the dtype checks first:

@elementwise_type_promotion_wrapper( type_promoting_args=("self", "other"), type_promotion_kind=ELEMENTWISE_TYPE_PROMOTION_KIND.DEFAULT, ) def dot_helper(self, other): ... # existing fn @register_decomposition(aten.dot) @out_wrapper() def dot(self, other): torch._check( self.dtype == other.dtype, lambda: "dot : expected both vectors to have same dtype, but found " f"{self.dtype} and {other.dtype}", ) return dot_helper(self, outer)

wdyt?

I did it this way because the complex branch is effectively the same implementation as the C++ implementation (e.g. for dot CPU:

pytorch/aten/src/ATen/native/Blas.cpp

Line 159 in 392221b

Tensor dot(const Tensor &self, const Tensor &other){

). So the computation and result dtype in this branch should be correct, as the torch.dot / vdot would do this if necessary.

The actual decomposition into an elementwise product followed by a reduction is only for the case when the inputs are real, so type promotion is applied.

That being said, if the computation and result types for dot, vdot and sum + reduction should be the same then it would be cleaner to do what you suggest - I just didn't want to make the assumption that they are.

Looks like dot_naive uses the same type promotion rules anyway:

pytorch/aten/src/ATen/native/BlasKernel.cpp

Line 1030 in 392221b

scalar_t dot_naive(

.

I'll change the code to your suggestion

…lementations since it may be used for meta dispatch

kundaMwiza · 2024-10-28T16:41:51Z

@pytorchbot merge

pytorchmergebot · 2024-10-28T16:43:31Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

bdhirsh · 2024-10-28T17:03:26Z

Thanks!

dot reference implementation should be consistent with the cpu / cuda implementations since it may be used for meta dispatch i.e. ```python import torch x = torch.tensor([1,2,3], dtype=torch.float32) y = torch.tensor([4,5,6], dtype=torch.float16) x.dot(y) Traceback (most recent call last): File "<stdin>", line 1, in <module> RuntimeError: dot : expected both vectors to have same dtype, but found Float and Half ``` However the below does not raise an exception ```python x.to("meta").dot(y.to("meta")) ``` Fixes #ISSUE_NUMBER Pull Request resolved: pytorch#138596 Approved by: https://github.com/bdhirsh

kundaMwiza requested a review from mruberry as a code owner October 22, 2024 15:51

pytorchbot added the open source label Oct 22, 2024

bdhirsh added the release notes: composability release notes category label Oct 23, 2024

bdhirsh approved these changes Oct 23, 2024

View reviewed changes

kundaMwiza force-pushed the mwizak/fix-dot-meta-py-decomp branch 2 times, most recently from 1ab84ec to b848e92 Compare October 25, 2024 11:20

bdhirsh reviewed Oct 25, 2024

View reviewed changes

kundaMwiza added 4 commits October 25, 2024 19:42

dot reference implementation should be consistent with cpu / cuda imp…

73bd8f3

…lementations since it may be used for meta dispatch

Result type must match input type

62c862c

Only apply type promotion to the decomposition

ee63692

Apply checks before elementwise type promotion

f38dd0f

kundaMwiza force-pushed the mwizak/fix-dot-meta-py-decomp branch from 7ef5d6e to f38dd0f Compare October 25, 2024 19:43

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 28, 2024

pytorchmergebot added the merging label Oct 28, 2024

pytorchmergebot added the Merged label Oct 28, 2024

pytorchmergebot closed this in c2ded9e Oct 28, 2024

pytorchmergebot removed the merging label Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix dot reference checks #138596

Fix dot reference checks #138596

Uh oh!

kundaMwiza commented Oct 22, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 22, 2024 •

edited

Loading

Uh oh!

kundaMwiza commented Oct 22, 2024

Uh oh!

bdhirsh left a comment

Uh oh!

kundaMwiza commented Oct 25, 2024 •

edited

Loading

Uh oh!

bdhirsh Oct 25, 2024

Uh oh!

kundaMwiza Oct 25, 2024

Uh oh!

kundaMwiza Oct 25, 2024 •

edited

Loading

Uh oh!

kundaMwiza commented Oct 28, 2024

Uh oh!

pytorchmergebot commented Oct 28, 2024

Uh oh!

bdhirsh commented Oct 28, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix dot reference checks #138596

Fix dot reference checks #138596

Uh oh!

Conversation

kundaMwiza commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138596

✅ No Failures

Uh oh!

kundaMwiza commented Oct 22, 2024

Uh oh!

bdhirsh left a comment

Choose a reason for hiding this comment

Uh oh!

kundaMwiza commented Oct 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bdhirsh Oct 25, 2024

Choose a reason for hiding this comment

Uh oh!

kundaMwiza Oct 25, 2024

Choose a reason for hiding this comment

Uh oh!

kundaMwiza Oct 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kundaMwiza commented Oct 28, 2024

Uh oh!

pytorchmergebot commented Oct 28, 2024

Merge started

Uh oh!

bdhirsh commented Oct 28, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kundaMwiza commented Oct 22, 2024 •

edited

Loading

pytorch-bot bot commented Oct 22, 2024 •

edited

Loading

kundaMwiza commented Oct 25, 2024 •

edited

Loading

kundaMwiza Oct 25, 2024 •

edited

Loading