rope_theta and max_position_embeddings from config #1096

Yard1 · 2023-09-19T04:47:13Z

This PR lets rope_theta and max_position_embeddings to be read from model configs instead of hardcoding them. Notably, this allows codellama to work without issues with longer contexts.

Fixes #904

WoosukKwon · 2023-09-19T05:34:44Z

A quick question: Don't we have to use get_max_model_len(

vllm/vllm/config.py

Line 170 in c102631

def get_max_model_len(self) -> int:

) instead of the model's configuration?

BTW, there's a duplicated PR: #1057

Yard1 · 2023-09-19T05:42:35Z

Hmm good question. The HF Transformers behavior is to load it from config. I have no strong feelings one way or another, though I lean towards consistency between vllm and transformers. We can raise an exception if this value is set below max model len.

My bad about this being a duplicate, happy to close this one if needed

WoosukKwon · 2023-09-19T05:48:25Z

Hmm good question. The HF Transformers behavior is to load it from config. I have no strong feelings one way or another, though I lean towards consistency between vllm and transformers. We can raise an exception if this value is set below max model len.

My concern is that the bug will happen when the user-specified maximum model length is larger than the model's configuration. To my knowledge, at least mathematically, increasing max_positions of RoPE wouldn't affect the outputs as long as their positions fit in max_positions.

My bad about this being a duplicate, happy to close this one if needed\

No worries. As this is an urgent bug fix, I think we can take this PR and have @wanmok as a co-author (if you are ok with it).

Yard1 · 2023-09-19T05:50:32Z

Ofc I am fully ok with coautorship!

How about that exception, then? If there is a mismatch between the two, I feel it's better to let the user know explicitly and have them fix it, instead of trying to magic it away

WoosukKwon · 2023-09-19T05:53:09Z

How about that exception, then? If there is a mismatch between the two, I feel it's better to let the user know explicitly and have them fix it, instead of trying to magic it away

Got it. Then what's the role of the model_max_len argument? I thought it's to allow users to manually configure the length.

Yard1 · 2023-09-19T05:53:44Z

It can be used to set the length below the model maximum for eg. limiting the context length when serving or when memory constrained

WoosukKwon · 2023-09-19T05:57:38Z

It can be used to set the length below the model maximum for eg. limiting the context length when serving or when memory constrained

Got it. Then I'm good with your idea to raise an error.

Yard1 · 2023-09-20T07:05:51Z

I think this PR will also fix #905

vllm/config.py

WoosukKwon

@Yard1 LGTM! I slightly refactored the code in config.py to make the logic a bit clearer.

Rogerspy · 2023-09-21T06:42:45Z

I think this PR will also fix #905

No，this PR can not fix it.

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by: wnma3mz <wnma3mz@gmail.com>

--------- Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com> Signed-off-by: zhenwei <zhenweiliu@habana.ai> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>

Yard1 added 2 commits September 18, 2023 21:44

rope_theta and max_position_embeddings from config

a0edbce

Lint

8f977a3

Yard1 added 2 commits September 18, 2023 23:11

Raise error if user max_model_len is above derived

9057e0c

Fix

42a37d9

wnma3mz mentioned this pull request Sep 19, 2023

Update model load max_position parameter #1057

Closed

WoosukKwon self-requested a review September 20, 2023 04:38

WoosukKwon reviewed Sep 20, 2023

View reviewed changes

vllm/config.py Outdated Show resolved Hide resolved

Yard1 commented Sep 20, 2023

View reviewed changes

vllm/config.py Outdated Show resolved Hide resolved

Yard1 and others added 3 commits September 20, 2023 10:41

Update vllm/config.py

d9a66fa

Refactor

12cccb0

Remove TODO

fccb43f

WoosukKwon approved these changes Sep 20, 2023

View reviewed changes

WoosukKwon merged commit 3302f0a into vllm-project:main Sep 20, 2023

Yard1 deleted the read_rope_from_config branch September 20, 2023 20:40

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

rope_theta and max_position_embeddings from config (vllm-project#1096)

0c8cb94

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by: wnma3mz <wnma3mz@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

rope_theta and max_position_embeddings from config #1096

rope_theta and max_position_embeddings from config #1096

Uh oh!

Yard1 commented Sep 19, 2023

Uh oh!

WoosukKwon commented Sep 19, 2023

Uh oh!

Yard1 commented Sep 19, 2023 •

edited

Loading

Uh oh!

WoosukKwon commented Sep 19, 2023

Uh oh!

Yard1 commented Sep 19, 2023

Uh oh!

WoosukKwon commented Sep 19, 2023

Uh oh!

Yard1 commented Sep 19, 2023

Uh oh!

WoosukKwon commented Sep 19, 2023 •

edited

Loading

Uh oh!

Yard1 commented Sep 20, 2023

Uh oh!

Uh oh!

Uh oh!

WoosukKwon left a comment

Uh oh!

Rogerspy commented Sep 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

rope_theta and max_position_embeddings from config #1096

rope_theta and max_position_embeddings from config #1096

Uh oh!

Conversation

Yard1 commented Sep 19, 2023

Uh oh!

WoosukKwon commented Sep 19, 2023

Uh oh!

Yard1 commented Sep 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WoosukKwon commented Sep 19, 2023

Uh oh!

Yard1 commented Sep 19, 2023

Uh oh!

WoosukKwon commented Sep 19, 2023

Uh oh!

Yard1 commented Sep 19, 2023

Uh oh!

WoosukKwon commented Sep 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Yard1 commented Sep 20, 2023

Uh oh!

Uh oh!

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

Rogerspy commented Sep 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Yard1 commented Sep 19, 2023 •

edited

Loading

WoosukKwon commented Sep 19, 2023 •

edited

Loading