propagate "attention_mask" dtype for "use_past" in OnnxConfig.generate_dummy_inputs #17105

arampacha · 2022-05-05T23:00:04Z

What does this PR do?

The mask_dtype is propagated to torch.ones() producing "attention_mask" for past_key_values in generate_dummy_inputs call. This ensures the input datatype expected by ONNX model matches default "attention_mask" dtype.
The fix is applied for configs where the pattern was used.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

The existing tests use *OnnxConfig.generate_dummy_inputs method to produce inputs passed to session.run(...) for testing. For this reason the issue was not reported at testing - the same inputs are used for export and testing. I'm not sure if specific tests for inputs datatype is required.

Here is a notebook for verifying the fix works as expected.

Who can review?

@lewtun

HuggingFaceDocBuilderDev · 2022-05-05T23:15:49Z

The documentation is not available anymore as the PR was closed or merged.

lewtun

Thanks for improving all these types for the attention masks @arampacha!

It looks good to me, so gently pinging @patil-suraj and @sgugger for their perspective :)

For context to the reviewers: these changes ensure the data types of input_ids and attention_mask are the same (i.e. ints) when these models are exported to ONNX

sgugger

Thanks for fixing!

patil-suraj

Thanks for the fix!

…e_dummy_inputs (huggingface#17105) * propagate attention_mask dtype * fixup&style

arampacha added 2 commits May 5, 2022 18:11

propagate attention_mask dtype

510e805

fixup&style

60abd2b

lewtun requested a review from michaelbenayoun May 6, 2022 10:42

lewtun approved these changes May 6, 2022

View reviewed changes

sgugger approved these changes May 6, 2022

View reviewed changes

patil-suraj approved these changes May 11, 2022

View reviewed changes

sgugger merged commit 0645b07 into huggingface:main May 11, 2022

arampacha deleted the causal-lm-with-past-onnx-config branch May 11, 2022 11:54

ArthurZucker pushed a commit to ArthurZucker/transformers that referenced this pull request May 12, 2022

propagate "attention_mask" dtype for "use_past" in OnnxConfig.generat…

5ca32d7

…e_dummy_inputs (huggingface#17105) * propagate attention_mask dtype * fixup&style

elusenji pushed a commit to elusenji/transformers that referenced this pull request Jun 12, 2022

propagate "attention_mask" dtype for "use_past" in OnnxConfig.generat…

de6909f

…e_dummy_inputs (huggingface#17105) * propagate attention_mask dtype * fixup&style

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

propagate "attention_mask" dtype for "use_past" in OnnxConfig.generate_dummy_inputs #17105

propagate "attention_mask" dtype for "use_past" in OnnxConfig.generate_dummy_inputs #17105

Uh oh!

arampacha commented May 5, 2022

Uh oh!

HuggingFaceDocBuilderDev commented May 5, 2022 •

edited

Loading

Uh oh!

lewtun left a comment

Uh oh!

sgugger left a comment

Uh oh!

patil-suraj left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

propagate "attention_mask" dtype for "use_past" in OnnxConfig.generate_dummy_inputs #17105

propagate "attention_mask" dtype for "use_past" in OnnxConfig.generate_dummy_inputs #17105

Uh oh!

Conversation

arampacha commented May 5, 2022

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented May 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lewtun left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HuggingFaceDocBuilderDev commented May 5, 2022 •

edited

Loading