-
Notifications
You must be signed in to change notification settings - Fork 30.9k
Fix processing tests #40379
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix processing tests #40379
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would love a bit more detailed in the PR description 🙏 (no need to be a thesis, but something that we can know what/why the previous PR break and a short explanation of the changes in this PR)
I will check this afternoon, but I need a bit more context (so my comment above 🙏 ) |
|
|
||
| @property | ||
| def model_input_names(self): | ||
| # InternVL renames video inputs to `pixel_values` before returning |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice to have comment.
Might would be even better to explain a bit more:
- does this simply "what we are doing here" (and if so, why?)
- or does this mean because
InternVL renames video inputs topixel_valuesbefore returning, so we need to have this change here (otherwise something would go wrong)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah, that is the reason to override it, will make it clear
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you
|
[For maintainers] Suggested jobs to run (before merge) run-slow: aria, chameleon, deepseek_vl_hybrid, gemma3, glm4v, grounding_dino, hunyuan_v1_moe, internvl, llava, mistral3, mllama, pix2struct, qwen2_vl, smolvlm |
What does this PR do?
I broke the CI in #40339 🙈 No idea why the tests weren't triggered before merging
The newly added test is failing for InetrnVL, QwenAudio (and others) because the number of video/audio placeholders doesn't match number of actual video/audios. The fix is to allow to prepare dummy text with different modality placeholder, especially useful when a processor can process more than one modality