Convert mul to use opmath_gpu_kernel_with_scalars #64019

ezyang · 2021-08-26T14:30:48Z

Stack from ghstack:

Convert mul to use opmath_gpu_kernel_with_scalars #64019 Convert mul to use opmath_gpu_kernel_with_scalars
Add opmath_gpu_kernel_with_scalars and port add to use it #63884 Add acc_gpu_kernel_with_scalars and port add to use it

Note that previously the functor operated on scalar_t and
this modifies it to operate on opmath_t, but this is not
a problem as half precision was implemented by performing the
compute in float anyway.

Signed-off-by: Edward Z. Yang ezyang@fb.com

Differential Revision: D30575282

Note that previously the functor operated on scalar_t and this modifies it to operate on accscalar_t, but this is not a problem as half precision was implemented by performing the compute in float anyway. Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

pytorch-probot · 2021-08-26T14:30:53Z

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/pytorch/pytorch/blob/aa0789f0a113042d66bc914c1ee6f9b976d6c5d6/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default

Workflows	Labels (bold enabled)	Status
Triggered Workflows
linux-bionic-py3.8-gcc9-coverage	`ciflow/all`, `ciflow/coverage`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/win`	✅ triggered
win-vs2019-cuda10.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/win`	✅ triggered
Skipped Workflows
libtorch-linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	🚫 skipped
linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	🚫 skipped
periodic-libtorch-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/win`	🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:

# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

facebook-github-bot · 2021-08-26T14:30:54Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/64019
📄 Preview docs built from this PR

💊 CI failures summary and remediations

As of commit aa0789f (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

Note that previously the functor operated on scalar_t and this modifies it to operate on opmath_t, but this is not a problem as half precision was implemented by performing the compute in float anyway. Signed-off-by: Edward Z. Yang <ezyang@fb.com> [ghstack-poisoned]

Note that previously the functor operated on scalar_t and this modifies it to operate on accscalar_t, but this is not a problem as half precision was implemented by performing the compute in float anyway. Signed-off-by: Edward Z. Yang <ezyang@fb.com> ghstack-source-id: 161cfe1 Pull Request resolved: #64019

ezyang · 2021-08-26T14:44:07Z

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

codecov · 2021-08-26T19:05:58Z

Codecov Report

Merging #64019 (aa0789f) into gh/ezyang/1067/base (167c9ab) will decrease coverage by 0.00%.
The diff coverage is n/a.

@@                   Coverage Diff                   @@
##           gh/ezyang/1067/base   #64019      +/-   ##
=======================================================
- Coverage                66.85%   66.85%   -0.01%     
=======================================================
  Files                      695      695              
  Lines                    90759    90759              
=======================================================
- Hits                     60674    60673       -1     
- Misses                   30085    30086       +1

facebook-github-bot · 2021-09-01T01:33:55Z

@ezyang merged this pull request in b23e4f6.

pytorch-probot bot added the ciflow/default label Aug 26, 2021

ezyang mentioned this pull request Aug 26, 2021

Add opmath_gpu_kernel_with_scalars and port add to use it #63884

Closed

pytorch-probot bot assigned pytorchbot and unassigned pytorchbot Aug 26, 2021

facebook-github-bot added the cla signed label Aug 26, 2021

ezyang changed the title ~~Convert mul to use acc_gpu_kernel_with_scalars~~ Convert mul to use opmath_gpu_kernel_with_scalars Aug 26, 2021

ezyang requested a review from ngimel August 26, 2021 14:31

pytorch-probot bot assigned pytorchbot and unassigned pytorchbot Aug 26, 2021

ngimel approved these changes Aug 31, 2021

View reviewed changes

facebook-github-bot closed this in b23e4f6 Sep 1, 2021

facebook-github-bot added the Merged label Sep 1, 2021

facebook-github-bot deleted the gh/ezyang/1067/head branch September 4, 2021 14:17

ysiraichi mentioned this pull request Feb 13, 2025

torch.mul uses OpMathType for computation. #147134

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Convert mul to use opmath_gpu_kernel_with_scalars #64019

Convert mul to use opmath_gpu_kernel_with_scalars #64019

Uh oh!

ezyang commented Aug 26, 2021 •

edited

Loading

Uh oh!

pytorch-probot bot commented Aug 26, 2021 •

edited

Loading

⚛️ CI Flow

Uh oh!

facebook-github-bot commented Aug 26, 2021 •

edited

Loading

Uh oh!

ezyang commented Aug 26, 2021

Uh oh!

codecov bot commented Aug 26, 2021

Uh oh!

facebook-github-bot commented Sep 1, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Convert mul to use opmath_gpu_kernel_with_scalars #64019

Convert mul to use opmath_gpu_kernel_with_scalars #64019

Uh oh!

Conversation

ezyang commented Aug 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-probot bot commented Aug 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚛️ CI Flow

Uh oh!

facebook-github-bot commented Aug 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

Uh oh!

ezyang commented Aug 26, 2021

Uh oh!

codecov bot commented Aug 26, 2021

Codecov Report

Uh oh!

facebook-github-bot commented Sep 1, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ezyang commented Aug 26, 2021 •

edited

Loading

pytorch-probot bot commented Aug 26, 2021 •

edited

Loading

facebook-github-bot commented Aug 26, 2021 •

edited

Loading