-
Notifications
You must be signed in to change notification settings - Fork 25.7k
To add Adamax algorithm to documentation #63903
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit 6aada3f (more details on the Dr. CI page):
🕵️ 2 new failures recognized by patternsThe following CI failures do not appear to be due to upstream breakages:
|
7d4f467 to
8391319
Compare
Codecov Report
@@ Coverage Diff @@
## master #63903 +/- ##
=======================================
Coverage 66.81% 66.82%
=======================================
Files 695 698 +3
Lines 90845 90881 +36
=======================================
+ Hits 60701 60733 +32
- Misses 30144 30148 +4 |
23ac602 to
f485036
Compare
f485036 to
088903a
Compare
torch/optim/adamax.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The code actually adds an epsilon to the absolute value no?
088903a to
3f765ae
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
3f765ae to
6aada3f
Compare
|
@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
|
@iramazanli merged this pull request in 39ce801. |
It has been discussed before that adding description of Optimization algorithms to PyTorch Core documentation may result in a nice Optimization research tutorial. In the following tracking issue we mentioned about all the necessary algorithms and links to the originally published paper #63236.
In this PR we are adding description of Adamax Algorithm to the documentation. For more details, we refer to the paper https://arxiv.org/abs/1412.6980