[Glm4.5V] fix vLLM support #40696

zucchini-nlp · 2025-09-04T14:31:53Z

What does this PR do?

GLM-4.5V inference does not work in the latest release and this PR fixes it. Specifically, users can now pass in a VideoMetadata object to processor's call which is needed to backwards compatibility (adding a test for this, one sec)

And a tiny fix in image processing to make helper utility aligned with self._preprocess

github-actions · 2025-09-04T14:33:01Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: glm4v

HuggingFaceDocBuilderDev · 2025-09-04T14:48:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qubvel

Thanks for fixing! I'm trusting you, I'm a bit out of context 😄

qubvel · 2025-09-04T15:23:21Z

src/transformers/video_utils.py


 @dataclass
-class VideoMetadata:
+class VideoMetadata(Mapping):


What is the motivation for making it iterable?

so that we can use VideoMetadata in same way as dict. We allow users to pass both formats currently

qubvel · 2025-09-04T15:26:32Z

src/transformers/models/glm4v/image_processing_glm4v.py

        patch_size = images_kwargs.get("patch_size", self.patch_size)
        merge_size = images_kwargs.get("merge_size", self.merge_size)
-        size = images_kwargs.get("size", self.size)
+        size = images_kwargs.get("size", {"shortest_edge": 112 * 112, "longest_edge": 28 * 28 * 15000})


should we align self.size instead?

the defaults are already using these values. It is the model checkpoint that has another set of values saved, and for some reason we never use it. Didn't want to break BC but using config values is actually correct

hmmm, can we update config values instead? or Hub's PR will not be merged?

we can try, lemme ask one of the authors who is on slack

asked the authors and the config values are the correct ones. So will update processor in the next PR to use config defaults and mark it as slightly breaking

Cyrilvallez

Alright, thanks a lot!

* fix * add a test case

ALEXANDRA-SALOMEE

Aryan ****

ALEXANDRA-SALOMEE

Aryan ****

fix

ff62fa7

zucchini-nlp requested review from Cyrilvallez and qubvel September 4, 2025 14:33

add a test case

176d38f

zucchini-nlp added the for patch Tag issues / labels that should be included in the next patch label Sep 4, 2025

qubvel reviewed Sep 4, 2025

View reviewed changes

Cyrilvallez approved these changes Sep 4, 2025

View reviewed changes

Cyrilvallez merged commit 586dc5d into huggingface:main Sep 4, 2025
24 checks passed

Cyrilvallez pushed a commit that referenced this pull request Sep 4, 2025

[Glm4.5V] fix vLLM support (#40696)

3ce5629

* fix * add a test case

ALEXANDRA-SALOMEE reviewed Sep 14, 2025

View reviewed changes

[Glm4.5V] fix vLLM support #40696

[Glm4.5V] fix vLLM support #40696

Uh oh!

Conversation

zucchini-nlp commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions bot commented Sep 4, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

qubvel left a comment

Choose a reason for hiding this comment

Uh oh!

qubvel Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qubvel Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ALEXANDRA-SALOMEE left a comment

Choose a reason for hiding this comment

Uh oh!

ALEXANDRA-SALOMEE left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zucchini-nlp commented Sep 4, 2025 •

edited

Loading

zucchini-nlp Sep 4, 2025 •

edited

Loading

qubvel Sep 4, 2025 •

edited

Loading