Add comments on RoPE initialization #1176

WoosukKwon · 2023-09-25T23:18:02Z

Added more comments that can be useful for understanding the differences from HF.

casper-hansen · 2023-09-26T10:57:21Z

Does this explain the difference between HF and vLLM? I.e. if you enable the same CPU initialization with greedy sampling, we get same outputs?

WoosukKwon · 2023-09-26T17:05:55Z

@casper-hansen Not always. Because floating-point arithmetics is not associative, different kernel implementations might lead to different outputs. The difference is more significant when reduction operation is involved. Therefore, our custom CUDA kernels for attention and RMS normalization do not produce exactly the same outputs as the original HF implementation, and thus outputs of vLLM models can be different from the HF models. We've checked that the outputs usually match when using FP32 and greedy sampling, but there are some cases where the outputs do not match. However, please note that this does not hurt task accuracy, as vLLM's implementation is mathematically equivalent to HF's.

zhuohan123

LGTM! Thanks for the fix!

Fix accuracy issue for llama 3.2 vision models that is caused by is_causal setting to False

Add comments

06b9e95

WoosukKwon requested a review from zhuohan123 September 26, 2023 17:23

zhuohan123 approved these changes Sep 26, 2023

View reviewed changes

WoosukKwon merged commit 03ffd0a into main Sep 26, 2023

WoosukKwon deleted the fix-rope branch September 26, 2023 17:48

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Add comments on RoPE initialization (vllm-project#1176)

582ccbe

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request May 8, 2025

Fix accuracy issue for llama 3.2 vision models. (vllm-project#1176)

3f0c2f7

Fix accuracy issue for llama 3.2 vision models that is caused by is_causal setting to False

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add comments on RoPE initialization #1176

Add comments on RoPE initialization #1176

Uh oh!

WoosukKwon commented Sep 25, 2023 •

edited

Loading

Uh oh!

casper-hansen commented Sep 26, 2023 •

edited

Loading

Uh oh!

WoosukKwon commented Sep 26, 2023

Uh oh!

zhuohan123 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Add comments on RoPE initialization #1176

Add comments on RoPE initialization #1176

Uh oh!

Conversation

WoosukKwon commented Sep 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

casper-hansen commented Sep 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WoosukKwon commented Sep 26, 2023

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WoosukKwon commented Sep 25, 2023 •

edited

Loading

casper-hansen commented Sep 26, 2023 •

edited

Loading