fix qwen-14b model #1173

Sanster · 2023-09-25T09:50:45Z

Support both qwen-7b and qwen-14b.

ZPerling · 2023-09-26T06:26:04Z

I've tested using this code and qwen-14b still can not load

Sanster · 2023-09-27T01:37:45Z

I've tested using this code and qwen-14b still can not load

What error did you encounter?

fengtong-xiao · 2023-09-27T03:33:16Z

Still encountered this error:

RuntimeError: shape '[3, 32, 128]' is invalid for input of size 15360

Sanster · 2023-09-27T06:47:58Z

Still encountered this error:

RuntimeError: shape '[3, 32, 128]' is invalid for input of size 15360

I tested it again and it works. Please make sure the changes to your local vllm code have taken effect.

fengtong-xiao · 2023-09-27T09:05:45Z

Still encountered this error:
RuntimeError: shape '[3, 32, 128]' is invalid for input of size 15360

I tested it again and it works. Please make sure the changes to your local vllm code have taken effect.

Thanks! This works for me, turns out I need to restart the cluster instead of detach-and-reattach.

zhuohan123

LGTM! Tested with Qwen/Qwen-14B-Chat and it works well. Thanks for your contribution!

Co-authored-by: Michał Kuligowski <mkuligowski@habana.ai>

fix qwen-14b model

11fcd9d

Sanster force-pushed the fix_qwen_14b branch from 28ac647 to 11fcd9d Compare September 25, 2023 09:54

Sanster mentioned this pull request Sep 25, 2023

Can not load Qwen-14B-chat #1172

Closed

xusenlinzy mentioned this pull request Sep 25, 2023

💡 [REQUEST] - 需要支持Qwen-14B-chat xusenlinzy/api-for-open-llm#134

Closed

zhuohan123 approved these changes Sep 27, 2023

View reviewed changes

zhuohan123 merged commit 28e616c into vllm-project:main Sep 27, 2023

330570902 mentioned this pull request Oct 8, 2023

[BUG] 运行Qwen14B时报错 chatchat-space/Langchain-Chatchat#1646

Closed

jklj077 mentioned this pull request Jan 17, 2024

[BUG] vLLM推理Qwen-14b-chat出现个别乱码和个别字母无法生成 QwenLM/Qwen#974

Closed

2 tasks

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

fix qwen-14b model (vllm-project#1173)

66a9693

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request May 8, 2025

[V1] Set dynamo cache size even if warmup is skipped (vllm-project#1173)

6c214b6

Co-authored-by: Michał Kuligowski <mkuligowski@habana.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix qwen-14b model #1173

fix qwen-14b model #1173

Uh oh!

Sanster commented Sep 25, 2023

Uh oh!

ZPerling commented Sep 26, 2023

Uh oh!

Sanster commented Sep 27, 2023

Uh oh!

fengtong-xiao commented Sep 27, 2023

Uh oh!

Sanster commented Sep 27, 2023

Uh oh!

fengtong-xiao commented Sep 27, 2023

Uh oh!

zhuohan123 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

fix qwen-14b model #1173

fix qwen-14b model #1173

Uh oh!

Conversation

Sanster commented Sep 25, 2023

Uh oh!

ZPerling commented Sep 26, 2023

Uh oh!

Sanster commented Sep 27, 2023

Uh oh!

fengtong-xiao commented Sep 27, 2023

Uh oh!

Sanster commented Sep 27, 2023

Uh oh!

fengtong-xiao commented Sep 27, 2023

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants