Fix Gemma2/3 & Llava (Next) & Llama4 conversion issue with latest transformers #14367

suiyoubi · 2025-07-30T22:57:22Z

…formers

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Collection: [Note which collection this PR will affect]

Changelog

Add specific line by line info of high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

…formers Signed-off-by: Ao Tang <aot@nvidia.com>

nemo/collections/vlm/gemma3vl/model/gemma3vl.py

Signed-off-by: Ao Tang <aot@nvidia.com>

Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>

Signed-off-by: Ao Tang <aot@nvidia.com>

Signed-off-by: oliver könig <okoenig@nvidia.com>

Signed-off-by: Ao Tang <aot@nvidia.com>

ko3n1g · 2025-08-01T07:08:53Z

Nemo2 tests are passing, speech are known flaky

…nsformers (NVIDIA-NeMo#14367) * Fix Gemma3VL & Llava Next & Llama4 conversion issue when latest transformers Signed-off-by: Ao Tang <aot@nvidia.com> * bump transformers Signed-off-by: Ao Tang <aot@nvidia.com> * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * Remove unused import of Gemma3Model in gemma3vl.py Signed-off-by: Ao Tang <aot@nvidia.com> * build: Lift upper boundary of transformers Signed-off-by: oliver könig <okoenig@nvidia.com> * Fix key check Signed-off-by: Ao Tang <aot@nvidia.com> --------- Signed-off-by: Ao Tang <aot@nvidia.com> Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> Signed-off-by: oliver könig <okoenig@nvidia.com> Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com> Co-authored-by: oliver könig <okoenig@nvidia.com> Signed-off-by: Oren Amsalem <oren.a4@gmail.com>

…nsformers (NVIDIA-NeMo#14367) * Fix Gemma3VL & Llava Next & Llama4 conversion issue when latest transformers Signed-off-by: Ao Tang <aot@nvidia.com> * bump transformers Signed-off-by: Ao Tang <aot@nvidia.com> * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * Remove unused import of Gemma3Model in gemma3vl.py Signed-off-by: Ao Tang <aot@nvidia.com> * build: Lift upper boundary of transformers Signed-off-by: oliver könig <okoenig@nvidia.com> * Fix key check Signed-off-by: Ao Tang <aot@nvidia.com> --------- Signed-off-by: Ao Tang <aot@nvidia.com> Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> Signed-off-by: oliver könig <okoenig@nvidia.com> Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com> Co-authored-by: oliver könig <okoenig@nvidia.com>

…nsformers (NVIDIA-NeMo#14367) * Fix Gemma3VL & Llava Next & Llama4 conversion issue when latest transformers Signed-off-by: Ao Tang <aot@nvidia.com> * bump transformers Signed-off-by: Ao Tang <aot@nvidia.com> * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * Remove unused import of Gemma3Model in gemma3vl.py Signed-off-by: Ao Tang <aot@nvidia.com> * build: Lift upper boundary of transformers Signed-off-by: oliver könig <okoenig@nvidia.com> * Fix key check Signed-off-by: Ao Tang <aot@nvidia.com> --------- Signed-off-by: Ao Tang <aot@nvidia.com> Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> Signed-off-by: oliver könig <okoenig@nvidia.com> Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com> Co-authored-by: oliver könig <okoenig@nvidia.com> Signed-off-by: Guyue Huang <guyueh@nvidia.com>

Fix Gemma3VL & Llava Next & Llama4 conversion issue when latest trans…

9ca1661

…formers Signed-off-by: Ao Tang <aot@nvidia.com>

suiyoubi changed the title ~~Fix Gemma3VL & Llava Next & Llama4 conversion issue when latest trans…~~ Fix Gemma2/3 & Llava (Next) & Llama4 conversion issue when latest trans… Jul 30, 2025

github-advanced-security bot found potential problems Jul 30, 2025

View reviewed changes

nemo/collections/vlm/gemma3vl/model/gemma3vl.py Fixed Show fixed Hide fixed

bump transformers

f2593b1

Signed-off-by: Ao Tang <aot@nvidia.com>

suiyoubi force-pushed the aot/cicd-conversion-fix branch from 3a47405 to f2593b1 Compare July 30, 2025 23:25

suiyoubi added the Run CICD label Jul 30, 2025

ko3n1g added Run CICD and removed Run CICD labels Jul 30, 2025

Apply isort and black reformatting

e4049f4

Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>

ko3n1g added Run CICD and removed Run CICD labels Jul 30, 2025

github-actions bot removed the Run CICD label Jul 30, 2025

Remove unused import of Gemma3Model in gemma3vl.py

ca42c97

Signed-off-by: Ao Tang <aot@nvidia.com>

suiyoubi added the Run CICD label Jul 30, 2025

suiyoubi temporarily deployed to test July 30, 2025 23:36 — with GitHub Actions Inactive

build: Lift upper boundary of transformers

8d082f0

Signed-off-by: oliver könig <okoenig@nvidia.com>

ko3n1g added Run CICD and removed Run CICD labels Jul 31, 2025

ko3n1g temporarily deployed to test July 31, 2025 07:24 — with GitHub Actions Inactive

github-actions bot removed the Run CICD label Jul 31, 2025

Fix key check

e08303a

Signed-off-by: Ao Tang <aot@nvidia.com>

suiyoubi added the Run CICD label Jul 31, 2025

suiyoubi changed the title ~~Fix Gemma2/3 & Llava (Next) & Llama4 conversion issue when latest trans…~~ Fix Gemma2/3 & Llava (Next) & Llama4 conversion issue with latest transformers Jul 31, 2025

suiyoubi temporarily deployed to test July 31, 2025 13:13 — with GitHub Actions Inactive

github-actions bot removed the Run CICD label Jul 31, 2025

ko3n1g approved these changes Jul 31, 2025

View reviewed changes

ko3n1g merged commit a82dce9 into main Aug 1, 2025
450 of 464 checks passed

ko3n1g deleted the aot/cicd-conversion-fix branch August 1, 2025 07:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Gemma2/3 & Llava (Next) & Llama4 conversion issue with latest transformers #14367

Fix Gemma2/3 & Llava (Next) & Llama4 conversion issue with latest transformers #14367

Uh oh!

suiyoubi commented Jul 30, 2025

Uh oh!

Uh oh!

ko3n1g commented Aug 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix Gemma2/3 & Llava (Next) & Llama4 conversion issue with latest transformers #14367

Fix Gemma2/3 & Llava (Next) & Llama4 conversion issue with latest transformers #14367

Uh oh!

Conversation

suiyoubi commented Jul 30, 2025

What does this PR do ?

Changelog

Usage

GitHub Actions CI

Before your PR is "Ready for review"

Who can review?

Additional Information

Uh oh!

Uh oh!

ko3n1g commented Aug 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants