Add semantic segmentation example script, no trainer #16630

NielsRogge · 2022-04-06T13:33:34Z

What does this PR do?

This PR adds an example script regarding fine-tuning any model supported by the AutoModelForSemanticSegmentation API on a semantic segmentation dataset, including regularly pushing to the hub during training as well as WandB logging.

I switched to using Accelerate as I had a bug with the Trainer.

To do:

fix Tensorboard logs

…ielsRogge/transformers into add_semantic_script_no_trainer

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

HuggingFaceDocBuilderDev · 2022-04-06T13:48:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

This reverts commit b1a7dfe.

* refactor TF beam search * refactored generate can now properly use attention masks * add force bos/eos logit processors

* Update modeling_mpnet.py * Update modeling_ctrl.py * formatting * Formatting * Formatting * annotated FSMT * Added annotations for LED * Added Annotations for M2M * Added annotations for nystromformer * Added annotations for OpenAI * Added annotations for RAG * Removed unused imports * fix isort errors * Removed inputs_embeds docstring, corrected original * flake8 fixes * doc-builder fixes

…gface#16617) Adds logging and save/loading to the Accelerate scripts Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix doc * Make fixup Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home>

muellerzr

Thanks @NielsRogge!

I added a few suggestions based on the recent upgrades to Accelerate and the new script changes that got pushed.

The other two capabilities we're trying to include in the scripts are resuming from a checkpoint as well as save_state by batch or by epoch (this is just torch.save, so slightly different than transformers save. Pick your poison on those I think.)

Nice start!

examples/pytorch/semantic-segmentation/run_semantic_segmentation_no_trainer.py

* Add inputs vector to calculate metric method * Include inputs for evaluation metrics with backwards compatibility * Prevent inputs create OOM issue and documentation details * Update style and code documentation * Fix style formatting issues * Update files format with make style

sgugger

Very nice addition! Agreed with all the changes Zach suggested as Accelerate can now handle the logging for you, and the scheduler should be passed along to prepare for better support with DeepSpeed.

This will make your example rely on Accelerate master for a couple of weeks until next release (probably next week) so you should add a warning in the Readme to install Accelerate from source. I'll put a not to adapt this after Accelerate next release.

examples/pytorch/semantic-segmentation/run_semantic_segmentation_no_trainer.py

…ate_dict (huggingface#16643) * Updated _load_pretrained_model_low_mem to check if keys are in the stored state_dict * update after conversions

* Update README.md Support Image Updates the Support image linking to our EAP page (to give it a refresh + help avoid image fatigue). Slack thread checking in with #open-source-internal on this update (https://huggingface.slack.com/archives/C021H1P1HKR/p1648838903316709) * Compressed Updated Support image * Improves Support Image Logo + Height Updated the image based on logo + size feedback. Big thanks to Bibi for making quick edits to this image.

* base model done * make style * done * added files * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Trigger doc build * resolved conversations * resolved conversations * seer models * minor changes * minor changes * make fixup * glob variables * minor changes * fix copies * config when possibile * resolved conflicts * resolved conflicts * resolved conflicts * CI * conversion script for 10b param * fixed for 10b model * minor updates in the doc + make style * removed unused code * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * removed unused code * removed unused code * updated modeling_utils from main Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

…ace#16171)

* Add TapexTokenizer * Improve docstrings and provide option to provide answer * Remove option for pretokenized inputs * Add TAPEX to README * Fix copies * Remove option for pretokenized inputs * Initial commit: add tapex fine-tuning examples on both table-based question answering and table-based fact verification. * - Draft a README file for running the script and introducing some background. - Remove unused code lines in tabfact script. - Disable the deafult `pad_to_max_length` option which is memory-consuming. * * Support `as_target_tokenizer` function for TapexTokenizer. * Fix the do_lower_case behaviour of TapexTokenizer. * Add unit tests for target scenarios and cased/uncased scenarios for both source and target. * * Replace the label BartTokenizer with TapexTokenizer's as_target_tokenizer function. * Fix typos in tapex example README. * * fix the evaluation script - remove the property `task_name` * * Make the label space more clear for tabfact tasks * * Using a new fine-tuning script for tapex-base on tabfact. * * Remove the lowercase code outside the tokenizer - we use the tokenizer to control whether do_lower_case * Guarantee the hyper-parameter can be run without out-of-memory on 16GB card and report the new reproduced number on wikisql * * Remove the default tokenizer_name option. * Provide evaluation command. * * Support for WikiTableQuestion dataset. * Fix a typo in README. * * Fix the datasets's key name in WikiTableQuestions * Run make fixup and move test to folder * Fix quality * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply some more suggestions from code review * Improve docstrings * Overwrite failing test * Improve comment in example scripts * Fix rebase * Add TAPEX to Auto mapping * Add TAPEX to auto config mappings * Put TAPEX higher than BART in auto mapping * Add TAPEX to doc tests Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain> Co-authored-by: SivilTaram <qianlxc@outlook.com> Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

* add vit tf doctest with @add_code_sample_docstrings * add labels string back in Co-authored-by: Johannes Kolbe <johannes.kolbe@tech.better.team>

The defalut value of `padding` in `DataCollatorWithPadding` is `True`, not `False`.

* fix QA sample * For TF_QUESTION_ANSWERING_SAMPLE Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Fixed some bugs involving saving during epochs * Added tests mimicking the existing examples tests * Added in json exporting to all `no_trainer` examples for consistency

* [Trainer] tf32 arg doc * Update src/transformers/training_args.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* ✨ update audio examples with minds dataset * 🖍 make style * 🖍 minor fixes for doctests

NielsRogge · 2022-04-11T08:29:50Z

I've added all requested changes. As you can see, I'm updating the repo using commits rather than creating subfolders.

sgugger · 2022-04-11T11:39:56Z

examples/pytorch/semantic-segmentation/run_semantic_segmentation_no_trainer.py

+                    # Optionally push to the hub
+                    if args.push_to_hub and accelerator.is_main_process:
+                        feature_extractor.save_pretrained(args.output_dir)
+                        repo.push_to_hub(


Note that the checkpoint saved here is saved with torch.save and not save_pretrained so they're not usable with from_pretrained.

Also, you'll need to adapt the logic to resume from a checkpoint to checkout the right branch in the folder if you go this way, as everything is saved in the root of output_dir.

I've removed this logic and updated the script to be aligned with the other example scripts.

NielsRogge · 2022-04-11T18:13:18Z

Updated the script to create subfolders like the other examples (and these aren't pushed to the hub based on the gitignore file).

NielsRogge · 2022-04-14T15:12:47Z

Some issue with git, closing this PR in favor of a new one.

Niels Rogge and others added 9 commits April 5, 2022 15:33

Add first draft

c244b86

Add requirements.txt

6f8756a

Add wandb logging

1a229d5

Merge branch 'add_semantic_script_no_trainer' of https://github.com/N…

9f1fb14

…ielsRogge/transformers into add_semantic_script_no_trainer

Improve script

4b0f429

Add validation

92d882b

Improve script

0408c7f

Improve README

5c2c837

Fix TFTransfoXLLMHeadModel outputs (huggingface#16590)

2aef4cf

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

sgugger and others added 8 commits April 6, 2022 09:57

Allow the same config in the auto mapping

b1a7dfe

Revert "Allow the same config in the auto mapping"

b9bf91a

This reverts commit b1a7dfe.

Dev version

a180efe

[modeling_utils] rearrange text (huggingface#16632)

4d10083

TF generate refactor - Beam Search (huggingface#16374)

3f43d82

* refactor TF beam search * refactored generate can now properly use attention masks * add force bos/eos logit processors

Allow the same config in the auto mapping (huggingface#16631)

10c15d2

Update no_trainer scripts with new Accelerate functionalities (huggin…

febe42b

…gface#16617) Adds logging and save/loading to the Accelerate scripts Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

NielsRogge requested review from muellerzr and sgugger April 7, 2022 08:32

Fix doc example (huggingface#16448)

dc99180

* Fix doc * Make fixup Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home>

muellerzr suggested changes Apr 7, 2022

View reviewed changes

sgugger approved these changes Apr 7, 2022

View reviewed changes

examples/pytorch/semantic-segmentation/run_semantic_segmentation_no_trainer.py Outdated Show resolved Hide resolved

examples/pytorch/semantic-segmentation/run_semantic_segmentation_no_trainer.py Outdated Show resolved Hide resolved

stas00 and others added 6 commits April 7, 2022 07:56

[megatron-bert-uncased-345m] fix conversion (huggingface#16639)

080e42d

Remove parent/child tests in auto model tests (huggingface#16653)

389f661

Add Accelerate new features

1bdfff3

Updated _load_pretrained_model_low_mem to check if keys are in the st…

4099817

…ate_dict (huggingface#16643) * Updated _load_pretrained_model_low_mem to check if keys are in the stored state_dict * update after conversions

stefan-it and others added 11 commits April 7, 2022 17:35

bert: properly mention deprecation of TF2 conversion script (huggingf…

33cb211

…ace#16171)

add vit tf doctest with @add_code_sample_docstrings (huggingface#16636)

9db2eeb

* add vit tf doctest with @add_code_sample_docstrings * add labels string back in Co-authored-by: Johannes Kolbe <johannes.kolbe@tech.better.team>

Fix error in doc of DataCollatorWithPadding (huggingface#16662)

5db2fcc

The defalut value of `padding` in `DataCollatorWithPadding` is `True`, not `False`.

Fix style

9a24b97

Fix QA sample (huggingface#16648)

ab22966

* fix QA sample * For TF_QUESTION_ANSWERING_SAMPLE Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Add tests for no_trainer and fix existing examples (huggingface#16656)

d57da99

* Fixed some bugs involving saving during epochs * Added tests mimicking the existing examples tests * Added in json exporting to all `no_trainer` examples for consistency

Improve script

e6bad6e

only load state dict when the checkpoint is not None (huggingface#16673)

f4d4f0a

[Trainer] tf32 arg doc (huggingface#16674)

4d46106

* [Trainer] tf32 arg doc * Update src/transformers/training_args.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Update audio examples with MInDS-14 (huggingface#16633)

7c5d799

* ✨ update audio examples with minds dataset * 🖍 make style * 🖍 minor fixes for doctests

Niels Rogge and others added 9 commits April 11, 2022 08:33

Add first draft

6d83f4d

Add wandb logging

7dcef30

Add requirements.txt

35ef845

Improve script

d90b3be

Add validation

73e26cd

Improve script

6dbda08

Improve README

72fc8dc

Add Accelerate new features

8964d56

Improve script

131c7c2

sgugger reviewed Apr 11, 2022

View reviewed changes

NielsRogge added 3 commits April 11, 2022 11:41

Add corresponding test

3dec4d9

Fix merge

67cea8d

Make script more aligned to other scripts

7ee39ee

NielsRogge closed this Apr 14, 2022

NielsRogge mentioned this pull request Apr 14, 2022

Add semantic script no trainer, v2 #16788

Merged

Add semantic segmentation example script, no trainer #16630

Add semantic segmentation example script, no trainer #16630

Uh oh!

Conversation

NielsRogge commented Apr 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Apr 6, 2022

Uh oh!

muellerzr left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

NielsRogge commented Apr 11, 2022

Uh oh!

sgugger Apr 11, 2022

Choose a reason for hiding this comment

Uh oh!

NielsRogge Apr 11, 2022

Choose a reason for hiding this comment

Uh oh!

NielsRogge commented Apr 11, 2022

Uh oh!

NielsRogge commented Apr 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

18 participants

NielsRogge commented Apr 6, 2022 •

edited

Loading

muellerzr left a comment •

edited

Loading

NielsRogge commented Apr 14, 2022 •

edited

Loading