KEMBAR78
Add safetensors support for quantized models by WoosukKwon · Pull Request #1073 · vllm-project/vllm · GitHub
Skip to content

Conversation

@WoosukKwon
Copy link
Collaborator

Fixes #1071

The error is because the safetensors need to be converted to regular tensors before the transpose operation.

Copy link
Member

@zhuohan123 zhuohan123 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks for the fix!

@zhuohan123 zhuohan123 merged commit cc796b1 into main Sep 18, 2023
@WoosukKwon WoosukKwon deleted the awq-safetensors branch September 18, 2023 18:59
@TheBloke
Copy link

Awesome, thanks guys. AWQs will start flooding HF in the next few hours!

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

AWQ: vLLM cannot load AWQ models in Safetensors format

3 participants