Add Model Revision Support #1014

ghost · 2023-09-11T08:23:53Z

This PR adds an additional attribute to the LLM engine, the revision options download specify the commit version for the model for consistency and reliability.

For API_Server usage

python3 -m vllm.entrypoints.api_server --model facebook/opt-125m --revision 507a3991d874042a92e7581eb6e7cc7074b0c77e

For LLM Engine usage

from vllm import LLM, SamplingParams

llm = LLM(model="facebook/opt-125m", revision="507a3991d874042a92e7581eb6e7cc7074b0c77e")
sampling_params = SamplingParams(temperature=0.8, top_p=0.95)
outputs = llm.generate(["hello, my name is "], sampling_params)

ghost · 2023-09-11T10:04:39Z

Hi @WoosukKwon,

Could you review this PR

Thanks.

zhuohan123

Thank you for your contribution! In general LGTM! Left a small comment on the default value. We can merge this branch after that is fixed. Also, please format your script with format.sh

vllm/engine/arg_utils.py

Co-authored-by: Jasmond Loh <Jasmond.Loh@hotmail.com> Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

### What this PR does / why we need it? Add benchmark workflows ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Run locally --------- Signed-off-by: wangli <wangli858794774@gmail.com>

…#1039) ### What this PR does / why we need it? This is a post patch of vllm-project#1014, for some convenience optimization - Set cached dataset path for speed - Use pypi to install escli-tool - Add benchmark results convert script to have a developer-friendly result - Patch the `benchmark_dataset.py` to disable streaming load for internet - Add more trigger ways for different purpose, `pr` for debug, `schedule` for daily test, `dispatch` and `pr-labled` for manual testing of a single(current) commit - Disable latency test for `qwen-2.5-vl`, (This script does not support multi-modal yet) ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>

JasmondL added 2 commits September 11, 2023 14:25

Add revision attribute

90c1a46

Remove experimental code

7d78736

ghost changed the title ~~Add revision attribute~~ Add Model Revision Support Sep 12, 2023

zhuohan123 approved these changes Sep 12, 2023

View reviewed changes

vllm/engine/arg_utils.py Show resolved Hide resolved

vllm/engine/arg_utils.py Outdated Show resolved Hide resolved

JasmondL and others added 5 commits September 13, 2023 16:50

Update Format

6b9f417

Merge branch 'main' into revision-patch-1

4343f20

Update Comments for Revision Support

216acba

Change main to None to follow HF's style

1b81f4a

fix

88efa64

zhuohan123 merged commit ab019ee into vllm-project:main Sep 13, 2023

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Add Model Revision Support (vllm-project#1014)

0d7a22f

Co-authored-by: Jasmond Loh <Jasmond.Loh@hotmail.com> Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add Model Revision Support #1014

Add Model Revision Support #1014

Uh oh!

ghost commented Sep 11, 2023 •

edited by ghost

Loading

Uh oh!

ghost commented Sep 11, 2023

Uh oh!

zhuohan123 left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Add Model Revision Support #1014

Add Model Revision Support #1014

Uh oh!

Conversation

ghost commented Sep 11, 2023 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghost commented Sep 11, 2023

Uh oh!

zhuohan123 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ghost commented Sep 11, 2023 •

edited by ghost

Loading

zhuohan123 left a comment •

edited

Loading