Make `max_model_len` configurable #972

Yard1 · 2023-09-07T00:43:56Z

Allows the user to override derived max_model_len if they so desire.

We can also warn the user if the max_model_len is set above what vLLM has derived - lmk if you think that's a good idea!

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>

zhuohan123

LGTM! Thanks for your contribution. And yes, please warn the user if the max_model_len is set above what vLLM has derived.

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>

Yard1 · 2023-09-12T23:04:39Z

@zhuohan123 added warning

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>

zhuohan123

LGTM! Thanks for the contribution!

Enable fp32 softmax in flat_pa_mla for accuracy.

Yard1 added 2 commits September 6, 2023 17:42

Make max_model_len configurable

45c97e7

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>

Merge branch 'upstream_main' into configurable_model_max_len

baad4c5

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>

zhuohan123 approved these changes Sep 12, 2023

View reviewed changes

Warn user

bb34f22

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>

Yard1 added 2 commits September 12, 2023 16:05

Lint

c654763

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>

Fix

481298d

Signed-off-by: Antoni Baum <antoni.baum@protonmail.com>

zhuohan123 approved these changes Sep 12, 2023

View reviewed changes

zhuohan123 merged commit 0bb1e88 into vllm-project:main Sep 12, 2023

Yard1 deleted the configurable_model_max_len branch September 13, 2023 01:03

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Make max_model_len configurable (vllm-project#972)

24b151b

servient-ashwin mentioned this pull request May 3, 2024

[Usage]: Difference in language model usage post updating versions form 0.2 to 0.4 #4588

Closed

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request May 12, 2025

enable fp32 softmax in flat_pa_mla (vllm-project#972)

3c8f48b

Enable fp32 softmax in flat_pa_mla for accuracy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Make `max_model_len` configurable #972

Make `max_model_len` configurable #972

Uh oh!

Yard1 commented Sep 7, 2023

Uh oh!

zhuohan123 left a comment

Uh oh!

Yard1 commented Sep 12, 2023

Uh oh!

zhuohan123 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Make max_model_len configurable #972

Make max_model_len configurable #972

Uh oh!

Conversation

Yard1 commented Sep 7, 2023

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Yard1 commented Sep 12, 2023

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Make `max_model_len` configurable #972

Make `max_model_len` configurable #972