Fix code examples for doctests #17013

stevhliu · 2022-04-29T20:03:56Z

This PR fixes some code examples to pass the doctests for the pipeline and AutoClass tutorials.

I was unable to pass the audio code examples on my local machine because soundfile is not supported on M1 yet. I was able to run and reproduce the code snippets in Colab though so I think they should also pass on the CI.

HuggingFaceDocBuilderDev · 2022-04-29T20:18:23Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for fixing!

sgugger · 2022-04-29T20:57:58Z

docs/source/en/pipeline_tutorial.mdx

+>>> print(preds)
+[{'score': 0.4403, 'label': 'lynx, catamount'}, {'score': 0.0343, 'label': 'cougar, puma, catamount, mountain lion, painter, panther, Felis concolor'}, {'score': 0.0321, 'label': 'snow leopard, ounce, Panthera uncia'}, {'score': 0.0235, 'label': 'Egyptian cat'}, {'score': 0.023, 'label': 'tiger cat'}]


Can we show the results as above? The doc test will pass if your replace any space by a new line, it only cares that there is whitespace.

docs/source/en/pipeline_tutorial.mdx

ydshieh · 2022-04-30T10:50:45Z

Thanks for the work.

Other than the >>> things, there are 2 failures when I ran it.

For audio pipeline:

Expected:
    [{'label': 'calm', 'score': 0.1315},
     {'label': 'neutral', 'score': 0.1307},
     {'label': 'sad', 'score': 0.1274},
     {'label': 'fearful', 'score': 0.1261},
     {'label': 'happy', 'score': 0.1242}]
Got:
    [{'score': 0.1315, 'label': 'calm'}, {'score': 0.1307, 'label': 'neutral'}, {'score': 0.1274, 'label': 'sad'}, {'score': 0.1261, 'label': 'fearful'}, {'score': 0.1242, 'label': 'happy'}]

(this is just a format issue I think)

For vision pipeline:

Expected:
    [{'score': 0.4403, 'label': 'lynx, catamount'}, {'score': 0.0343, 'label': 'cougar, puma, catamount, mountain lion, painter, panther, Felis concolor'}, {'score': 0.0321, 'label': 'snow leopard, ounce, Panthera uncia'}, {'score': 0.0235, 'label': 'Egyptian cat'}, {'score': 0.023, 'label': 'tiger cat'}]
Got:
    [{'score': 0.4335, 'label': 'lynx, catamount'}, {'score': 0.0348, 'label': 'cougar, puma, catamount, mountain lion, painter, panther, Felis concolor'}, {'score': 0.0324, 'label': 'snow leopard, ounce, Panthera uncia'}, {'score': 0.0239, 'label': 'Egyptian cat'}, {'score': 0.0229, 'label': 'tiger cat'}]

~~(This might be due to some random ops. I remembered I have similar situations before. I can take a look too.)~~
I get deterministic results, which is on Ubuntu 20.04. It's not very clear why the result is different than the previous one in the doc. I also get the same results on my local Windows machine. Maybe we could just update the values, cc @sgugger?

sgugger · 2022-05-02T11:51:36Z

I don't know why you ask me @ydshieh this is not my PR ;-)

ydshieh · 2022-05-02T12:09:41Z

I don't know why you ask me @ydshieh this is not my PR ;-)

I know. Just to make sure you are also fine with my suggestion about just update the values. But I guess I should be more confident 😄

…gface#17007) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…gface#16999)

…e#16974)

…gface#16893)

) This reverts commit 4f3a14e.

) * Replace dict/BatchEncoding instance checks by Mapping * Typo

* Result of new doc style with fixes * Add last two files * Bump hf-doc-builder

* Add the check * add missing ckpts * add a list to ignore * call the added check script * better regex pattern Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Add translating guide

* Add meta proxy * Uses meta data to trace data dependent control-flow * Remove commented class * Handles torch creating functions * Added type annotation to fix tracing * Tracing works for everything but T5 and GPT-J * Almost all previously supported models pass * All architectures can be traced except T5 * Intermediate commit to have a trace of the comparison operators for HFProxy * Everything works, except loss computation * Everything works * Removed unused import * Overriden methods do not use underlying ops (linear and torch.matmul), and model attributes are copied to the traced version * Fix torch_matmul_override * Change attributes reference to deepcopy * Remove breakpoint and add torch_index_override * Small fix * Fix typo * Replace asserts by explicit exceptions

…#17036) * [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module` * [FlaxEncoderDecoder] Fix bug in `decoder_module`

* add torch.no_grad when in eval mode * make style quality

* Clean up tests * Make fixup Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

… easy overriding (huggingface#17043)

* Propagate and fix imports

…uggingface#17046) * Update all examples to properly calculate progress bar

* First draft * Add YolosForObjectDetection * Make forward pass work * Add mid position embeddings * Add interpolation of position encodings * Add expected values * Add YOLOS to tests * Add integration test * Support tiny model as well * Support all models in conversion script * Remove mid_pe_size attribute * Make more tests pass * Add model to README and fix config * Add copied from statements * Rename base_model_prefix to vit * Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP * Apply suggestions from code review * Apply more suggestions from code review * Convert remaining checkpoints * Improve docstrings * Add YolosFeatureExtractor * Add feature extractor to docs * Add corresponding tests * Fix style * Fix docs * Apply suggestion from code review * Fix bad rebase * Fix some more bad rebase * Fix missing character * Improve docs and variable names Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

* Make sacremoses optional * Pickle

* Clean up setup.py * Trigger CI * Upgrade Python used

huggingface#16990) * [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length * [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length * correct t5 tokenizer * correct t5 tokenizer * fix test * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * finish Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* [FlaxBert] Add ForCausalLM * make style * fix output attentions * Add RobertaForCausalLM * remove comment * fix fx-to-pt model loading * remove comment * add modeling tests * add enc-dec model tests * add big_bird * add electra * make style * make repo-consitency * add to docs * remove roberta test * quality * amend cookiecutter * fix attention_mask bug in flax bert model tester * tighten pt-fx thresholds to 1e-5 * add 'copied from' statements * amend 'copied from' statements * amend 'copied from' statements * quality

* move test model folders (TODO: fix imports and others) * fix (potentially partially) imports (in model test modules) * fix (potentially partially) imports (in tokenization test modules) * fix (potentially partially) imports (in feature extraction test modules) * fix import utils.test_modeling_tf_core * fix path ../fixtures/ * fix imports about generation.test_generation_flax_utils * fix more imports * fix fixture path * fix get_test_dir * update module_to_test_file * fix get_tests_dir from wrong transformers.utils * update config.yml (CircleCI) * fix style * remove missing imports * update new model script * update check_repo * update SPECIAL_MODULE_TO_TEST_MAP * fix style * add __init__ * update self-scheduled * fix add_new_model scripts * check one way to get location back * python setup.py build install * fix import in test auto * update self-scheduled.yml * update slack notification script * Add comments about artifact names * fix for yolos Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Make Trainer compatible with sharded checkpoints * Add doc

…17055) * Fix RNG reload in resume training from epoch checkpoint * Fix test

…er (huggingface#16894)

📝 add doctests

413ebb1

🖍 make style

f194e40

stevhliu requested review from sgugger and ydshieh April 29, 2022 20:53

sgugger approved these changes Apr 29, 2022

View reviewed changes

stevhliu and others added 21 commits May 3, 2022 13:36

🖍 apply yih-dar review

423d7d8

Update README to latest release (huggingface#16997)

2edf4fa

use scale=1.0 in floats_tensor called in speech model testers (huggin…

db85031

…gface#17007) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Update all require decorators to use skipUnless when possible (huggin…

bfbcd5f

…gface#16999)

TF: XLA bad words logits processor and list of processors (huggingfac…

310ba18

…e#16974)

Make create_extended_attention_mask_for_decoder static method (huggin…

2a330e6

…gface#16893)

Update README_zh-hans.md (huggingface#16977)

d4e636c

Updating variable names. (huggingface#16445)

6fac074

Revert "Updating variable names. (huggingface#16445)" (huggingface#17011

4694766

) This reverts commit 4f3a14e.

Replace dict/BatchEncoding instance checks by Mapping (huggingface#17014

52d8c08

) * Replace dict/BatchEncoding instance checks by Mapping * Typo

Result of new doc style with fixes (huggingface#17015)

e64a6fa

* Result of new doc style with fixes * Add last two files * Bump hf-doc-builder

Add a check on config classes docstring checkpoints (huggingface#17012)

dbb67d2

* Add the check * add missing ckpts * add a list to ignore * call the added check script * better regex pattern Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Add translating guide (huggingface#17004)

7578824

* Add translating guide

update docs of length_penalty (huggingface#17022)

2f454f4

[FlaxGenerate] Fix bug in decoder_start_token_id (huggingface#17035)

fa07659

Fix style

4c213ba

[Flax(Speech)EncoderDecoder] Fix bug in decoder_module (huggingface…

de20bfb

…#17036) * [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module` * [FlaxEncoderDecoder] Fix bug in `decoder_module`

Fix typo in RetriBERT docstring (huggingface#17018)

3dba687

add torch.no_grad when in eval mode (huggingface#17020)

eb11d6e

* add torch.no_grad when in eval mode * make style quality

Disable Flax GPU tests on push (huggingface#17042)

f48684b

NielsRogge and others added 16 commits May 3, 2022 14:12

Clean up vision tests (huggingface#17024)

c1e90b7

* Clean up tests * Make fixup Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

[Trainer] Move logic for checkpoint loading into separate methods for…

c190e2b

… easy overriding (huggingface#17043)

Update no_trainer examples to use new logger (huggingface#17044)

abb74da

* Propagate and fix imports

Fix no_trainer examples to properly calculate the number of samples (h…

f923885

…uggingface#17046) * Update all examples to properly calculate progress bar

Allow all imports from transformers (huggingface#17050)

82507cc

Make the sacremoses dependency optional (huggingface#17049)

08f5ced

* Make sacremoses optional * Pickle

Clean up setup.py (huggingface#17045)

fc9d25e

* Clean up setup.py * Trigger CI * Upgrade Python used

Make Trainer compatible with sharded checkpoints (huggingface#17053)

0d069b8

* Make Trainer compatible with sharded checkpoints * Add doc

Remove Python and use v2 action (huggingface#17059)

02e2828

Fix RNG reload in resume training from epoch checkpoint (huggingface#…

ced44b3

…17055) * Fix RNG reload in resume training from epoch checkpoint * Fix test

Remove fetch in model templates test

669bfc1

Remove device parameter from create_extended_attention_mask_for_decod…

4dbd9ef

…er (huggingface#16894)

stevhliu mentioned this pull request May 3, 2022

Fix pipeline doctests #17073

Merged

stevhliu closed this May 3, 2022

stevhliu deleted the add-doctests branch May 4, 2022 00:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix code examples for doctests #17013

Fix code examples for doctests #17013

Uh oh!

stevhliu commented Apr 29, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Apr 29, 2022 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

sgugger Apr 29, 2022

Uh oh!

Uh oh!

ydshieh commented Apr 30, 2022 •

edited

Loading

Uh oh!

sgugger commented May 2, 2022

Uh oh!

ydshieh commented May 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

19 participants

		>>> print(preds)
		[{'score': 0.4403, 'label': 'lynx, catamount'}, {'score': 0.0343, 'label': 'cougar, puma, catamount, mountain lion, painter, panther, Felis concolor'}, {'score': 0.0321, 'label': 'snow leopard, ounce, Panthera uncia'}, {'score': 0.0235, 'label': 'Egyptian cat'}, {'score': 0.023, 'label': 'tiger cat'}]

Fix code examples for doctests #17013

Fix code examples for doctests #17013

Uh oh!

Conversation

stevhliu commented Apr 29, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Apr 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Apr 29, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ydshieh commented Apr 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger commented May 2, 2022

Uh oh!

ydshieh commented May 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

19 participants

HuggingFaceDocBuilderDev commented Apr 29, 2022 •

edited

Loading

ydshieh commented Apr 30, 2022 •

edited

Loading