Add self training code for text classification #16738

tuvuumass · 2022-04-12T21:34:40Z

This is an implementation of the self-training algorithm (without task augmentation) for classification tasks proposed in the EMNLP 2021 paper: STraTA: Self-Training with Task Augmentation for Better Few-shot Learning. For the original codebase, please check out https://github.com/google-research/google-research/tree/master/STraTA. Note that this code can be used as a tool for automatic data labeling.

The pull request includes a README.md file with detailed instructions on how to set up a virtual environment and install necessary packages. It also includes a demo run.sh on how to perform self-training with a BERT Base model on the SciTail science entailment dataset using 8 labeled examples per class.

HuggingFaceDocBuilderDev · 2022-04-12T21:49:55Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Very nice, thanks a lot for adding this new example! Just to be sure, the empty strata file is intended? I didn't get why it's there.

tuvuumass · 2022-04-13T15:54:05Z

Very nice, thanks a lot for adding this new example! Just to be sure, the empty strata file is intended? I didn't get why it's there.

Good catch, @sgugger. Just removed the empty strata file. Thanks!

* Add self-training code for text-classification * Add self-training code for text-classification * Add self-training code for text-classification * Add self-training code for text-classification * Add self-training code for text-classification * Delete strata

tuvuumass added 2 commits April 12, 2022 17:18

Add self-training code for text-classification

ff36b04

Add self-training code for text-classification

768f782

tuvuumass mentioned this pull request Apr 12, 2022

Is it fine if we do not pass the optimizer through accelerator.prepare() in DDP? #15656

Closed

tuvuumass added 3 commits April 12, 2022 17:51

Add self-training code for text-classification

7491c3d

Add self-training code for text-classification

bd0a2ce

Add self-training code for text-classification

7a8905e

sgugger approved these changes Apr 13, 2022

View reviewed changes

Delete strata

0fc63c9

sgugger merged commit 34ef029 into huggingface:main Apr 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add self training code for text classification #16738

Add self training code for text classification #16738

Uh oh!

tuvuumass commented Apr 12, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 12, 2022 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

tuvuumass commented Apr 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add self training code for text classification #16738

Add self training code for text classification #16738

Uh oh!

Conversation

tuvuumass commented Apr 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

tuvuumass commented Apr 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tuvuumass commented Apr 12, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 12, 2022 •

edited

Loading