[Doctests] Fix all T5 doc tests #16646

patrickvonplaten · 2022-04-07T10:04:33Z

What does this PR do?

Corrects T5 model docs and adds them to doc tests

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2022-04-07T10:33:24Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for fixing those. The examples are still a bit arcane with special values hard-coded in the middle, could you explain those a little bit better?

sgugger · 2022-04-07T11:48:33Z

docs/source/en/model_doc/byt5.mdx

+>>> input_ids = (
+...     torch.tensor([list("Life is like a box of chocolates.".encode("utf-8"))]) + 3
+>>> )  # add 3 for special tokens
+>>> labels = (
+...     torch.tensor([list("La vie est comme une boîte de chocolat.".encode("utf-8"))]) + 3
+>>> )  # add 3 for special tokens


Can the comment go once above to avoid the formatting on several lines? And also maybe be more helpful because I have no idea what "add 3 for special tokens" means.

sgugger · 2022-04-07T11:49:34Z

docs/source/en/model_doc/byt5.mdx

+>>> # Now Mask
+>>> # Note that we can add "{extra_id_...}" to the string directly
+>>> # as the Byte tokenizer would incorrectly merge the tokens
+>>> # We need to work on the character level directly here
+>>> # => mask to "The dog [258]a ball [257]park."


We can or we can't? I don't understand this comment and why it results in using 258 and 257.

Good point - Added more explanation!

docs/source/en/model_doc/t5.mdx

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

…platen/transformers into correct_t5_model_docs

docs/source/en/model_doc/byt5.mdx

* [Doctests] Fix all T5 doc tests * make style * Update docs/source/en/model_doc/t5.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply Sylvains comments * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

[Doctests] Fix all T5 doc tests

b79e50c

This was referenced Apr 7, 2022

Improving T5 Docs #16614

Closed

How to do mask prediction with ByT5? #16303

Closed

make style

841ff74

patrickvonplaten requested review from NielsRogge and sgugger April 7, 2022 10:15

sgugger approved these changes Apr 7, 2022

View reviewed changes

patrickvonplaten and others added 3 commits April 12, 2022 17:48

Update docs/source/en/model_doc/t5.mdx

5fbd7e9

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Apply Sylvains comments

102d300

Merge branch 'correct_t5_model_docs' of https://github.com/patrickvon…

294f9a0

…platen/transformers into correct_t5_model_docs

patrickvonplaten commented Apr 12, 2022

View reviewed changes

docs/source/en/model_doc/byt5.mdx Outdated Show resolved Hide resolved

patrickvonplaten added 2 commits April 12, 2022 18:03

Apply suggestions from code review

d74ae86

Merge branch 'main' into correct_t5_model_docs

f296606

patrickvonplaten merged commit b24201f into huggingface:main Apr 13, 2022

patrickvonplaten deleted the correct_t5_model_docs branch April 13, 2022 09:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Doctests] Fix all T5 doc tests #16646

[Doctests] Fix all T5 doc tests #16646

Uh oh!

patrickvonplaten commented Apr 7, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Apr 7, 2022 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

sgugger Apr 7, 2022

Uh oh!

sgugger Apr 7, 2022

Uh oh!

patrickvonplaten Apr 12, 2022

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Doctests] Fix all T5 doc tests #16646

[Doctests] Fix all T5 doc tests #16646

Uh oh!

Conversation

patrickvonplaten commented Apr 7, 2022

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Apr 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger Apr 7, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Apr 12, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented Apr 7, 2022 •

edited

Loading