GLM-4.5V Model Support #39805

zRzRzRzRzRzRzR · 2025-07-31T04:10:29Z

This PR will complete two contents

Modifications to default parameters for GLM-4.1V
Adding GLM-4.5V Model

src/transformers/models/glm4v_moe/modular_glm4v_moe.py

zucchini-nlp

Perfect! I think there are a few things we can still remove and let the modular copy. Overall very clean

src/transformers/models/auto/configuration_auto.py

src/transformers/models/glm4v_moe/modular_glm4v_moe.py

Cyrilvallez

Hey! Looks nice overall 🤗 But we absolutely want to rename ALL classes with Glm4vMoe prefixes instead of Glm4v_moe as we enforce CamelCasing! Made a few other comments as well 🤗

Cyrilvallez · 2025-08-07T10:56:44Z

src/transformers/models/glm4v_moe/modular_glm4v_moe.py

+
+class Glm4v_moeVisionConfig(Glm4vVisionConfig):
+    pass


We want the model name to be Glm4vMoe everywhere, not Glm4v_moe!!! We only use CamelCasing!

Cyrilvallez · 2025-08-07T10:57:28Z

src/transformers/models/glm4v_moe/modular_glm4v_moe.py

+
+def apply_multimodal_rotary_pos_emb(q, k, cos, sin, mrope_section, unsqueeze_dim=1):
+    """Applies Rotary Position Embedding with Multimodal Sections to the query and key tensors (https://qwenlm.github.io/blog/qwen2-vl/).
+


This should be imported, not redefined here (I see it was already mentioned as a review comment)

the code of apply_multimodal_rotary_pos_emb inner is changed

the line in red box had been removed in new impl

Cyrilvallez · 2025-08-07T10:58:53Z

src/transformers/models/glm4v_moe/modular_glm4v_moe.py

+
+        if self.use_qk_norm:  # main diff from Llama


If it's always True, let's simply use it instead of having an if ...

For the latest model, this setting is False, and it's not yet certain if any models with True have been released, just like our GLM-4.5.

For the latest model, this setting is False,
in that case we can remove the codepath! If a new model is release we'll add it for you!

tests/models/glm4v_moe/test_modeling_glm4v_moe.py

…to glm-45-vl

ArthurZucker

Thanks for your hardwork!

ArthurZucker · 2025-08-08T08:11:04Z

src/transformers/models/glm4v/modeling_glm4v.py

ArthurZucker · 2025-08-08T08:15:05Z

src/transformers/models/glm4v_moe/modular_glm4v_moe.py

Very nice and minimal! 🚀

ArthurZucker · 2025-08-08T09:14:37Z

run-slow: auto, glm4_moe, glm4v, glm4v_moe

github-actions · 2025-08-08T09:15:52Z

This comment contains run-slow, running the specified jobs:

models: ['models/auto', 'models/glm4_moe', 'models/glm4v', 'models/glm4v_moe']
quantizations: [] ...

HuggingFaceDocBuilderDev · 2025-08-08T09:27:50Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

github-actions · 2025-08-08T14:48:20Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, glm4_moe, glm4v, glm4v_moe

ArthurZucker · 2025-08-08T14:50:48Z

@bot /style

## Summary  This PR adds support for GLM4.1V (GLM-4 Vision) models to the Liger Kernel #855 https://huggingface.co/zai-org/GLM-4.5 This model have been merged in huggingface/transformers#39805  ## Testing Done  Found that `python3 -m pytest test/convergence/bf16/test_mini_models.py -k 'glm4v_moe' -rF` has `AssertionError: [Loss]Number of mismatched elements: 14` with <details> <summary>Test result</summary> ``` AssertionError: [Loss]Number of mismatched elements: 14 Mismatch at index (0, 5): tensor1[(0, 5)] = 8.733983993530273, tensor2[(0, 5)] = 8.52511215209961 Mismatch at index (0, 8): tensor1[(0, 8)] = 7.2776618003845215, tensor2[(0, 8)] = 7.524500846862793 Mismatch at index (0, 9): tensor1[(0, 9)] = 6.917590618133545, tensor2[(0, 9)] = 7.175967216491699 Mismatch at index (0, 13): tensor1[(0, 13)] = 5.685216426849365, tensor2[(0, 13)] = 5.427236557006836 Mismatch at index (0, 14): tensor1[(0, 14)] = 5.337466239929199, tensor2[(0, 14)] = 5.049449443817139 ... and 9 more mismatched elements. ``` </details>  - Hardware Type: <BLANK> - [x] run `make test` to ensure correctness - [x] run `make checkstyle` to ensure code style - [x] run `make test-convergence` to ensure convergence --------- Co-authored-by: Shao Tang <tangshao28@gmail.com> Co-authored-by: Steven Shimizu <shimizust@gmail.com>

zRzRzRzRzRzRzR and others added 19 commits July 30, 2025 12:08

init

3253a2c

update

bb73eed

uupdate

6041d9d

ruff

f546c9b

t patch is 2 defalut not 1

b4cac7c

draft

1237f95

back

77d2b49

back1

821c1cb

update

6f44b2b

config update

65d083c

update using glm-41 format

460403c

add self.rope_scaling = config.rope_scaling

c888594

Merge branch 'huggingface:main' into glm-45-vl

da300e7

Merge branch 'huggingface:main' into glm-45-vl

85836c6

update config

262c0bd

update

d508103

remove the processor

cae1eee

update

d29fe46

fix tests

9a47a3d

zucchini-nlp reviewed Jul 31, 2025

View reviewed changes

src/transformers/models/glm4v_moe/modular_glm4v_moe.py Outdated Show resolved Hide resolved

zRzRzRzRzRzRzR added 2 commits July 31, 2025 17:41

update

4690e4e

for test

5aabed2

zucchini-nlp reviewed Jul 31, 2025

View reviewed changes

zRzRzRzRzRzRzR added 7 commits July 31, 2025 21:16

update

d2749c8

update 2126

63392fe

self.rope_scaling is missing in GLM4MOE lets add it

a0925cf

update

7a69b83

update

4f837dc

Update modular_glm4v_moe.py

38dd6c3

Merge branch 'huggingface:main' into glm-45-vl

c3145bc

byjiang1996 mentioned this pull request Aug 7, 2025

Support glm4.1v and glm4.5v sgl-project/sglang#8798

Merged

6 tasks

Cyrilvallez reviewed Aug 7, 2025

View reviewed changes

zRzRzRzRzRzRzR added 6 commits August 8, 2025 12:54

changed name

0221958

Merge branch 'glm-45-vl' of github.com:zRzRzRzRzRzRzR/transformers in…

1066a20

…to glm-45-vl

update

7cc90e7

Merge branch 'huggingface:main' into glm-45-vl

7a41e83

Update modeling_glm4v_moe.py

9772be3

_init_weights shoud be add in Glm4vMoePreTrainedModel

3595400

ArthurZucker approved these changes Aug 8, 2025

View reviewed changes

zRzRzRzRzRzRzR added 3 commits August 8, 2025 16:24

remove use_qk_norm

6501424

Update modular_glm4v_moe.py

8cda36e

remove use_qk_norm as it is not use

06d5fc8

zRzRzRzRzRzRzR changed the title ~~GLM-4.1V default parameter modifications and updates.~~ GLM-4.5V Model Support Aug 8, 2025

zRzRzRzRzRzRzR mentioned this pull request Aug 8, 2025

GLM-4.5V with new class name at transformers vllm-project/vllm#22520

Merged

Merge branch 'main' into glm-45-vl

d46320c

fix style

37e42b3

huggingface deleted a comment from github-actions bot Aug 8, 2025

ArthurZucker added 2 commits August 8, 2025 17:02

deprecations are not needed on new models

fff03ec

fix merge issues

f376897

ArthurZucker merged commit 7b20915 into huggingface:main Aug 8, 2025
19 of 22 checks passed

Jintao-Huang mentioned this pull request Aug 11, 2025

[model] Support ZhipuAI/GLM-4.5V modelscope/ms-swift#5346

Merged

zRzRzRzRzRzRzR deleted the glm-45-vl branch August 21, 2025 08:44

vvvdwbvvv mentioned this pull request Aug 24, 2025

Add GLM4.5V support linkedin/Liger-Kernel#863

Merged

BBC-Esq mentioned this pull request Aug 30, 2025

test new glm4.5V BBC-Esq/VectorDB-Plugin#448

Open


		def apply_multimodal_rotary_pos_emb(q, k, cos, sin, mrope_section, unsqueeze_dim=1):
		"""Applies Rotary Position Embedding with Multimodal Sections to the query and key tensors (https://qwenlm.github.io/blog/qwen2-vl/).

GLM-4.5V Model Support #39805

GLM-4.5V Model Support #39805

Uh oh!

Conversation

zRzRzRzRzRzRzR commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented Aug 8, 2025

Uh oh!

github-actions bot commented Aug 8, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 8, 2025

Uh oh!

github-actions bot commented Aug 8, 2025

Uh oh!

ArthurZucker commented Aug 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zRzRzRzRzRzRzR commented Jul 31, 2025 •

edited

Loading