set eos_token_id to None to generate until max length #16989

ydshieh · 2022-04-28T12:42:07Z

What does this PR do?

Update check_encoder_decoder_model_generate to generate until max length.
Otherwise, this check

self.assertEqual(generated_output.shape, (input_ids.shape[0],) + (decoder_config.max_length,))

might fail.

Remark

In generate(), we have

transformers/src/transformers/generation_utils.py

Lines 1129 to 1133 in dced262

    
           eos_token_id = eos_token_id if eos_token_id is not None else self.config.eos_token_id 
        
           if eos_token_id is None and hasattr(self.config, "decoder"): 
        
               eos_token_id = self.config.decoder.eos_token_id

So I think the (original) logic about Generate until max length in check_encoder_decoder_model_generate should be updated too. The case won't really happen in the tests, but in general, config might still have eos_token_id.

I also leave the corresponding flax tests untouched for now.

This PR will fix

FAILED tests/models/vision_encoder_decoder/test_modeling_vision_encoder_decoder.py::Swin2BartModelTest::test_encoder_decoder_model_generate

tests/vision_encoder_decoder/test_modeling_vision_encoder_decoder.py:280: in check_encoder_decoder_model_generate
self.assertEqual(generated_output.shape, (inputs.shape[0],) + (decoder_config.max_length,))
AssertionError: torch.Size([13, 2]) != (13, 20)

…r_decoder_model_generate

HuggingFaceDocBuilderDev · 2022-04-28T12:58:45Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten

Thanks!

gante

👍

) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

set eos_token_id to None to generate until max length in check_encode…

834be7d

…r_decoder_model_generate

ydshieh changed the title ~~set eos_token_id to None to generate until max length in check_encode…~~ set eos_token_id to None to generate until max length Apr 28, 2022

ydshieh requested review from gante and patrickvonplaten April 28, 2022 12:42

patrickvonplaten approved these changes Apr 28, 2022

View reviewed changes

gante approved these changes Apr 28, 2022

View reviewed changes

ydshieh merged commit 5af5735 into huggingface:main Apr 28, 2022

ydshieh deleted the fix_check_encoder_decoder_model_generate branch April 28, 2022 17:47

elusenji pushed a commit to elusenji/transformers that referenced this pull request Jun 12, 2022

set eos_token_id to None to generate until max length (huggingface#16989

4953c37

) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

set eos_token_id to None to generate until max length #16989

set eos_token_id to None to generate until max length #16989

Uh oh!

ydshieh commented Apr 28, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2022 •

edited

Loading

Uh oh!

patrickvonplaten left a comment

Uh oh!

gante left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	eos_token_id = eos_token_id if eos_token_id is not None else self.config.eos_token_id

	if eos_token_id is None and hasattr(self.config, "decoder"):
	eos_token_id = self.config.decoder.eos_token_id

set eos_token_id to None to generate until max length #16989

set eos_token_id to None to generate until max length #16989

Uh oh!

Conversation

ydshieh commented Apr 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Remark

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ydshieh commented Apr 28, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 28, 2022 •

edited

Loading