Add `tokenizer_kwargs` argument to the text generation pipeline #40364

Joshua-Chin · 2025-08-21T22:30:47Z

What does this PR do?

This PR adds a tokenizer_kwargs argument to the TextGenerationPipeline, allowing users to pass arbitrary arguments to the tokenizer during preprocessing. In particular, this lets users set chat template arguments, such as the enable_thinking flag for Qwen3 or SmolLM3.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Joshua-Chin · 2025-08-21T22:49:11Z

The test failure seems to be an unrelated flake:

self = <test_accelerate_examples.ExamplesTestsNoTrainer testMethod=test_run_swag_no_trainer>

    @mock.patch.dict(os.environ, {"WANDB_MODE": "offline", "DVCLIVE_TEST": "true"})
    def test_run_swag_no_trainer(self):
        tmp_dir = self.get_auto_remove_tmp_dir()
        testargs = f"""
            {self.examples_dir}/pytorch/multiple-choice/run_swag_no_trainer.py
            --model_name_or_path google-bert/bert-base-uncased
            --train_file tests/fixtures/tests_samples/swag/sample.json
            --validation_file tests/fixtures/tests_samples/swag/sample.json
            --output_dir {tmp_dir}
            --max_train_steps=20
            --num_warmup_steps=2
            --learning_rate=2e-4
            --per_device_train_batch_size=2
            --per_device_eval_batch_size=1
            --with_tracking
        """.split()
    
        run_command(self._launch_args + testargs)
        result = get_results(tmp_dir)
>       self.assertGreaterEqual(result["eval_accuracy"], 0.8)
E       AssertionError: 0.4 not greater than or equal to 0.8

examples/pytorch/test_accelerate_examples.py:225: AssertionError

Pushing an empty commit to re-run the CI.

Joshua-Chin · 2025-08-21T22:59:14Z

A disjoint set of tests have failed in the re-run.

Joshua-Chin · 2025-08-21T23:07:40Z

@Rocketknight1 Please review this PR when you have a chance. The CI failures seem to be caused by unrelated, flaky tests.

Rocketknight1

This LGTM! cc @gante just in case you have opinions about the max_length generate kwarg clash.

Rocketknight1 · 2025-08-22T13:26:03Z

Also @Joshua-Chin you may need to rebase to fix some conflicts before we can merge the PR! That should also clear up the CI issues.

gante

One question about variable names, otherwise lgtm :)

gante · 2025-08-22T15:10:30Z

src/transformers/pipelines/text_generation.py

                - `None` : default strategy where nothing in particular happens
                - `"hole"`: Truncates left of input, and leaves a gap wide enough to let generation happen (might
                  truncate a lot of the prompt and not suitable when generation exceed the model capacity)
+            tokenizer_kwargs (`dict`, *optional*):


perhaps tokenizer_encode_kwargs? There are also kwargs used at decode time, and we don't want to mix the two

cc @Rocketknight1

@gante I updated the argument to tokenizer_encode_kwargs. Please take another look when you have a chance.

Joshua-Chin · 2025-08-22T18:01:42Z

The CI is currently failing because of the following test, added by a recently merged change (HunYuan opensource #39606):

FAILED tests/models/hunyuan_v1_moe/test_modeling_hunyuan_v1_moe.py::HunYuanMoEV1ModelTest::test_generate_compile_model_forward_fullgraph - torch._dynamo.exc.Unsupported: Dynamic shape operator

…ation pipeline

gante

thank you for iterating 🤗

gante · 2025-08-25T09:32:47Z

@Joshua-Chin I've clicked on "Update branch" (= pull from main new code), in hopes the CI error was a transient issue in our codebase 🤗

gante · 2025-08-25T10:13:32Z

Nope, didn't fix it. It's unrelated to this PR, I'll have a look to see what's wrong

HuggingFaceDocBuilderDev · 2025-08-25T15:21:36Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Joshua-Chin force-pushed the text-generation-pipeline-tokenizer-kwargs branch from 92b4d49 to 2fe0979 Compare August 22, 2025 01:03

Rocketknight1 approved these changes Aug 22, 2025

View reviewed changes

gante reviewed Aug 22, 2025

View reviewed changes

Joshua-Chin force-pushed the text-generation-pipeline-tokenizer-kwargs branch from 2fe0979 to e484dbb Compare August 22, 2025 17:26

Joshua-Chin added 5 commits August 22, 2025 13:46

Add tokenizer_kwargs arg to text generation pipeline.

f73dfbf

chore: re-run CI

1812580

Rename tokenizer_kwargs to tokenizer_encode_kwargs for text gener…

7c10223

…ation pipeline

Fix tokenizer_encode_kwargs doc string.

2e34d28

Fix note related to tokenizer _kwargs in text generation pipeline

f1d1dc1

Joshua-Chin force-pushed the text-generation-pipeline-tokenizer-kwargs branch from d80a814 to f1d1dc1 Compare August 22, 2025 20:46

gante approved these changes Aug 25, 2025

View reviewed changes

Merge branch 'main' into text-generation-pipeline-tokenizer-kwargs

c5b7c5d

gante enabled auto-merge (squash) August 25, 2025 09:33

Merge branch 'main' into text-generation-pipeline-tokenizer-kwargs

bdf63ae

gante merged commit d8f2edc into huggingface:main Aug 25, 2025
24 checks passed

Add tokenizer_kwargs argument to the text generation pipeline #40364

Add tokenizer_kwargs argument to the text generation pipeline #40364

Uh oh!

Conversation

Joshua-Chin commented Aug 21, 2025

What does this PR do?

Before submitting

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Joshua-Chin commented Aug 21, 2025

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Aug 22, 2025

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Joshua-Chin Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

Joshua-Chin commented Aug 22, 2025

Uh oh!

gante left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gante commented Aug 25, 2025

Uh oh!

gante commented Aug 25, 2025

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add `tokenizer_kwargs` argument to the text generation pipeline #40364

Add `tokenizer_kwargs` argument to the text generation pipeline #40364

gante Aug 22, 2025 •

edited

Loading

gante left a comment •

edited

Loading