fix: Google: use maxTokens from params #734

nicolasf · 2025-09-04T16:07:47Z

Current implementation has a hardcoded maxOutputTokens=2048. Use the value from prompt.params.maxTokens similar to the OpenAILLMClient.

Motivation and Context

During execution of structured output requests using Gemini 2.5 I encountered unexpected failures due to reaching MAX_TOKENS.
After checking the GoogleLLMClient implementation I noticed a hardcoded maxOutputTokens = 2048.
Checking the other clients, I'm replicating the same logic that exists on OpenAILLMClient, which applies the value from prompt.params.maxTokens.

Breaking Changes

No breaking changes.

Type of the changes

New feature (non-breaking change which adds functionality)
Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update
Tests improvement
Refactoring

Checklist

The pull request has a description of the proposed change
I read the Contributing Guidelines before opening the pull request
The pull request uses develop as the base branch
Tests for the changes have been added
All new and existing tests passed

Additional steps for pull requests adding a new feature

An issue describing the proposed change exists
The pull request includes a link to the issue
The change was discussed and approved in the issue
Docs have been added / updated

kpavlov

Thank you, @nicolasf, for catching this!
Let's add/modify some test to avoid regressions in the future

kpavlov

no tests

Current implementation has a hardcoded maxOutputTokens=2048. Use the value from prompt.params.maxTokens similar to the OpenAILLMClient.

nicolasf · 2025-09-13T19:08:22Z

Hi @kpavlov.
I have rebased and added 2 tests.
There's opportunity for improvement by refactoring and separating the logic but I think this is good enough for now.
Let me know if there's anything else that needs improvement.
Thanks.

kpavlov

Thank you @nicolasf
I would also add a check that max tokens value is positive, but in general, it makes things better

nicolasf · 2025-09-30T15:53:39Z

HI @kpavlov. Is it possible to get this merged before 0.5.0 is released?
This would make my dev team's life easier 😄.
Thanks!

sausti · 2025-10-02T10:50:40Z

Thanks for fixing this @nicolasf!

@Ololoshechkin, are there plans to release 0.5.0 or 0.4.3 soon (or even a feature or SNAPSHOT build)? We're loving Koog so far, but we cannot migrate to it fully, as this issue prevents the use of Gemini models.

(cherry picked from commit 45012a3)

kpavlov reviewed Sep 5, 2025

View reviewed changes

kpavlov added the bugfix Something was fixed 🎉 label Sep 5, 2025

kpavlov requested changes Sep 5, 2025

View reviewed changes

andruhon mentioned this pull request Sep 10, 2025

fix GoogleLLMClient to respect maxTokens and model.maxOutputTokens #776

Closed

15 tasks

nicolasf force-pushed the fix-google-max-tokens-hardcoded branch from d5fea78 to 174924f Compare September 13, 2025 18:54

fix: Google: use maxTokens from params

bd14e68

Current implementation has a hardcoded maxOutputTokens=2048. Use the value from prompt.params.maxTokens similar to the OpenAILLMClient.

nicolasf force-pushed the fix-google-max-tokens-hardcoded branch from 174924f to bd14e68 Compare September 13, 2025 19:03

nicolasf requested a review from kpavlov September 13, 2025 19:08

kpavlov approved these changes Sep 24, 2025

View reviewed changes

nicolasf added 2 commits September 30, 2025 14:37

Merge branch 'JetBrains:develop' into fix-google-max-tokens-hardcoded

90f6cf1

Merge branch 'JetBrains:develop' into fix-google-max-tokens-hardcoded

ffcf952

Ololoshechkin merged commit 45012a3 into JetBrains:develop Sep 30, 2025
9 of 10 checks passed

Ololoshechkin pushed a commit that referenced this pull request Oct 2, 2025

fix: Google: use maxTokens from params (#734)

e22ff30

(cherry picked from commit 45012a3)

karloti pushed a commit to karloti/koog that referenced this pull request Oct 21, 2025

fix: Google: use maxTokens from params (JetBrains#734)

5724a5d

(cherry picked from commit 45012a3)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Google: use maxTokens from params #734

fix: Google: use maxTokens from params #734

Uh oh!

nicolasf commented Sep 4, 2025

Uh oh!

kpavlov left a comment

Uh oh!

kpavlov left a comment

Uh oh!

nicolasf commented Sep 13, 2025

Uh oh!

kpavlov left a comment

Uh oh!

nicolasf commented Sep 30, 2025

Uh oh!

Uh oh!

sausti commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: Google: use maxTokens from params #734

fix: Google: use maxTokens from params #734

Uh oh!

Conversation

nicolasf commented Sep 4, 2025

Motivation and Context

Breaking Changes

Type of the changes

Checklist

Additional steps for pull requests adding a new feature

Uh oh!

kpavlov left a comment

Choose a reason for hiding this comment

Uh oh!

kpavlov left a comment

Choose a reason for hiding this comment

Uh oh!

nicolasf commented Sep 13, 2025

Uh oh!

kpavlov left a comment

Choose a reason for hiding this comment

Uh oh!

nicolasf commented Sep 30, 2025

Uh oh!

Uh oh!

sausti commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants