KEMBAR78
[doc][hackathon] To add AdamW Optimizer to the documentation by iramazanli · Pull Request #63252 · pytorch/pytorch · GitHub
Skip to content

Conversation

@iramazanli
Copy link
Contributor

@iramazanli iramazanli commented Aug 13, 2021

It has been discussed before that adding description of Optimization algorithms to PyTorch Core documentation may result in a nice Optimization research tutorial. In the following tracking issue we mentioned about all the necessary algorithms and links to the originally published paper #63236.

In this PR we are adding description of AdamW Algorithm to the documentation. For more details, we refer to the paper here https://arxiv.org/abs/1711.05101

AdamWalgo

cc @vincentqb @iramazanli

@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Aug 13, 2021

🔗 Helpful links

💊 CI failures summary and remediations

As of commit 9671c27 (more details on the Dr. CI page):


  • 1/1 failures introduced in this PR

1 failure not recognized by patterns:

Job Step Action
GitHub Actions linux-xenial-py3.6-gcc5.4 / build-docs (cpp) Unknown 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

@iramazanli iramazanli force-pushed the adamw_algorithm_doc branch 4 times, most recently from 128aaea to d985b28 Compare August 15, 2021 17:33
@iramazanli iramazanli requested a review from albanD August 15, 2021 18:13
@iramazanli iramazanli force-pushed the adamw_algorithm_doc branch from d985b28 to fd9304f Compare August 24, 2021 18:21
@codecov
Copy link

codecov bot commented Aug 24, 2021

Codecov Report

Merging #63252 (9671c27) into master (92318a9) will decrease coverage by 0.11%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #63252      +/-   ##
==========================================
- Coverage   66.76%   66.65%   -0.12%     
==========================================
  Files         710      710              
  Lines       92354    92395      +41     
==========================================
- Hits        61658    61582      -76     
- Misses      30696    30813     +117     

@iramazanli iramazanli force-pushed the adamw_algorithm_doc branch 3 times, most recently from a711fbe to 409c1e7 Compare August 27, 2021 22:50
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is that correct? We don't actually do weight decay this way for this one no?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, you're completely right !

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where is the \lambda \theta_{t-1} coming from?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think it shouldn't exist, you're right again !

@iramazanli iramazanli force-pushed the adamw_algorithm_doc branch 5 times, most recently from 45cdbf8 to 7b1b7ea Compare September 8, 2021 20:36
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is missing a multiplication by \gamma no?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, i added it. thanks for pointing it out

Copy link
Collaborator

@albanD albanD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok

@facebook-github-bot
Copy link
Contributor

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@iramazanli merged this pull request in 5b21f17.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants