[optim] Allow torch.float64 scalars for forloop + foreach implementations #115841

janeyx99 · 2023-12-14T17:32:46Z

Should allow for uses cases mentioned in #110940

This would allow scalars to also be float64s in the foreach implementation. The fused implementation would still create a float32 step on Adam and AdamW. This PR also does NOT worry about performance and is mainly for enablement.

Next steps:

Relax the constraint on fused adam(w) and allow torch.float64 scalars there
Allow performant mixed dtypes in foreach (a bigger project in itself).

This PR will conflict with my other PRs, I will figure out a landing order

Stack from ghstack (oldest at bottom):

-> [optim] Allow torch.float64 scalars for forloop + foreach implementations #115841

[ghstack-poisoned]

pytorch-bot · 2023-12-14T17:32:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/115841

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8260136 with merge base 0978482 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: a0f7529 Pull Request resolved: #115841

test/optim/test_optim.py

albanD · 2023-12-15T15:19:37Z

torch/optim/optimizer.py

+def _get_scalar_dtype(is_fused=None):
+    if is_fused:
+        return torch.float32
+    return torch.float64 if torch.get_default_dtype() == torch.float64 else torch.float32


I don't remember the story for default dtype vs default Tensor type. Do we need to check both here?

I don't think we do, get_default_dtype seems to work as expected:

…implementations" Should allow for uses cases mentioned in #110940 This would allow scalars to also be float64s in the foreach implementation. The fused implementation would still create a float32 step on Adam and AdamW. This PR also does NOT worry about performance and is mainly for enablement. Next steps: - Relax the constraint on fused adam(w) and allow torch.float64 scalars there - Allow _performant_ mixed dtypes in foreach (a bigger project in itself). This PR will conflict with my other PRs, I will figure out a landing order [ghstack-poisoned]

ghstack-source-id: df87868 Pull Request resolved: #115841

albanD

Sounds good!

…implementations" Should allow for uses cases mentioned in #110940 This would allow scalars to also be float64s in the foreach implementation. The fused implementation would still create a float32 step on Adam and AdamW. This PR also does NOT worry about performance and is mainly for enablement. Next steps: - Relax the constraint on fused adam(w) and allow torch.float64 scalars there - Allow _performant_ mixed dtypes in foreach (a bigger project in itself). This PR will conflict with my other PRs, I will figure out a landing order [ghstack-poisoned]

ghstack-source-id: ecafc19 Pull Request resolved: #115841

janeyx99 · 2023-12-27T06:06:38Z

@pytorchbot merge

pytorchmergebot · 2023-12-27T06:08:41Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Allow torch.float64 scalars for foreach implementations

7d26203

[ghstack-poisoned]

janeyx99 requested a review from albanD as a code owner December 14, 2023 17:32

pytorch-bot bot added the release notes: foreach_frontend release notes category label Dec 14, 2023

janeyx99 added a commit that referenced this pull request Dec 14, 2023

Allow torch.float64 scalars for foreach implementations

8bd9f43

ghstack-source-id: a0f7529 Pull Request resolved: #115841

janeyx99 changed the title ~~Allow torch.float64 scalars for foreach implementations~~ [optim] Allow torch.float64 scalars for forloop + foreach implementations Dec 14, 2023

janeyx99 added topic: improvements topic category release notes: optim labels Dec 14, 2023

albanD reviewed Dec 15, 2023

View reviewed changes

janeyx99 added a commit that referenced this pull request Dec 26, 2023

Allow torch.float64 scalars for foreach implementations

1e00b45

ghstack-source-id: df87868 Pull Request resolved: #115841

albanD approved these changes Dec 26, 2023

View reviewed changes

janeyx99 added a commit that referenced this pull request Dec 27, 2023

Allow torch.float64 scalars for foreach implementations

935c62c

ghstack-source-id: ecafc19 Pull Request resolved: #115841

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 27, 2023

pytorchmergebot added the merging label Dec 27, 2023

pytorchmergebot added Merged and removed merging labels Dec 27, 2023

pytorchmergebot closed this in 924f1b8 Dec 27, 2023

facebook-github-bot deleted the gh/janeyx99/117/head branch December 30, 2023 15:20

janeyx99 mentioned this pull request Jan 2, 2024

Tensors in different devices #111573

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[optim] Allow torch.float64 scalars for forloop + foreach implementations #115841

[optim] Allow torch.float64 scalars for forloop + foreach implementations #115841

Uh oh!

janeyx99 commented Dec 14, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 14, 2023 •

edited

Loading

Uh oh!

Uh oh!

albanD Dec 15, 2023

Uh oh!

janeyx99 Dec 26, 2023

Uh oh!

albanD left a comment

Uh oh!

janeyx99 commented Dec 27, 2023

Uh oh!

pytorchmergebot commented Dec 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[optim] Allow torch.float64 scalars for forloop + foreach implementations #115841

[optim] Allow torch.float64 scalars for forloop + foreach implementations #115841

Uh oh!

Conversation

janeyx99 commented Dec 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/115841

✅ No Failures

Uh oh!

Uh oh!

albanD Dec 15, 2023

Choose a reason for hiding this comment

Uh oh!

janeyx99 Dec 26, 2023

Choose a reason for hiding this comment

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

janeyx99 commented Dec 27, 2023

Uh oh!

pytorchmergebot commented Dec 27, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

janeyx99 commented Dec 14, 2023 •

edited

Loading

pytorch-bot bot commented Dec 14, 2023 •

edited

Loading