Fix TF_MASKED_LM_SAMPLE #16698

ydshieh · 2022-04-11T11:03:47Z

What does this PR do?

Fix TF_MASKED_LM_SAMPLE: there is currently a dimension issue regarding mask_token_index and predicted_token_id, which gives different results between PT/TF masked LM code samples

PT: paris
TF: p a r i s

See below for details.

(This is related to #16523)

PT_MASKED_LM_SAMPLE

from transformers import BertTokenizer, BertForMaskedLM
import torch

mask = "[MASK]",
checkpoint = "bert-base-uncased"

tokenizer = BertTokenizer.from_pretrained(f"{checkpoint}")
model = BertForMaskedLM.from_pretrained(f"{checkpoint}")

inputs = tokenizer(f"The capital of France is {mask}.", return_tensors="pt")

with torch.no_grad():
    logits = model(**inputs).logits

# retrieve index of {mask}
mask_token_index = (inputs.input_ids == tokenizer.mask_token_id)[0].nonzero(as_tuple=True)[0]
predicted_token_id = logits[0, mask_token_index].argmax(axis=-1)
expected_output = tokenizer.decode(predicted_token_id)


print(mask_token_index)  # tensor([8]): row dimension from `nonzero()`
print(predicted_token_id)  # tensor([3000])
print(expected_output)  # paris

TF_MASKED_LM_SAMPLE (on `main`)

from transformers import BertTokenizer, TFBertForMaskedLM
import tensorflow as tf

tokenizer = BertTokenizer.from_pretrained(f"{checkpoint}")
model = TFBertForMaskedLM.from_pretrained(f"{checkpoint}")

inputs = tokenizer(f"The capital of France is {mask}.", return_tensors="tf")
logits = model(**inputs).logits

# retrieve index of {mask}
mask_token_index = tf.where(inputs.input_ids == tokenizer.mask_token_id)[0][1]
predicted_token_id = tf.math.argmax(logits[0, mask_token_index], axis=-1)
expected_output = tokenizer.decode(predicted_token_id)

print(mask_token_index)  # tf.Tensor(8, shape=(), dtype=int64): no row dimension
print(predicted_token_id)  # tf.Tensor(3000, shape=(), dtype=int64)
print(tokenizer.decode(predicted_token_id))  # p a r i s (not good)

TF_MASKED_LM_SAMPLE (this PR)

# retrieve index of {mask}
mask_token_index = tf.where((inputs.input_ids == tokenizer.mask_token_id)[0])
selected_logits = tf.gather_nd(logits[0], indices=mask_token_index)
predicted_token_id = tf.math.argmax(selected_logits, axis=-1)
expected_output = tokenizer.decode(predicted_token_id)

print(mask_token_index)  # tf.Tensor([[8]], shape=(1, 1), dtype=int64): with row dimension
print(predicted_token_id)  # tf.Tensor([3000], shape=(1,), dtype=int64)
print(tokenizer.decode(predicted_token_id))  # paris

HuggingFaceDocBuilderDev · 2022-04-11T11:17:53Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

LGTM, thanks for fixing!

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix code sample

0830c6d

ydshieh marked this pull request as ready for review April 11, 2022 11:42

ydshieh requested review from gante, patrickvonplaten and sgugger April 11, 2022 11:42

sgugger approved these changes Apr 11, 2022

View reviewed changes

patrickvonplaten approved these changes Apr 11, 2022

View reviewed changes

gante approved these changes Apr 11, 2022

View reviewed changes

ydshieh merged commit 40618ec into huggingface:main Apr 11, 2022

ydshieh deleted the fix_tf_masked_lm_code_sample branch April 11, 2022 16:19

ydshieh mentioned this pull request Apr 13, 2022

[Doctest] added doctest changes for electra #16675

Merged

5 tasks

elusenji pushed a commit to elusenji/transformers that referenced this pull request Jun 12, 2022

Fix TF_MASKED_LM_SAMPLE (huggingface#16698)

61035b4

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix TF_MASKED_LM_SAMPLE #16698

Fix TF_MASKED_LM_SAMPLE #16698

Uh oh!

ydshieh commented Apr 11, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 11, 2022 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix TF_MASKED_LM_SAMPLE #16698

Fix TF_MASKED_LM_SAMPLE #16698

Uh oh!

Conversation

ydshieh commented Apr 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

PT_MASKED_LM_SAMPLE

TF_MASKED_LM_SAMPLE (on main)

TF_MASKED_LM_SAMPLE (this PR)

Uh oh!

HuggingFaceDocBuilderDev commented Apr 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ydshieh commented Apr 11, 2022 •

edited

Loading

TF_MASKED_LM_SAMPLE (on `main`)

HuggingFaceDocBuilderDev commented Apr 11, 2022 •

edited

Loading