Fix get_max_num_running_seqs for waiting seq groups #1034

Yard1 · 2023-09-13T22:32:11Z

Currently, get_max_num_running_seqs will always return 0 for waiting sequences in the case of best_of == self.num_seqs(), leading to incorrect scheduler behavior of scheduling more requests than max_num_seqs.

Yard1 · 2023-09-16T02:53:38Z

cc @zhuohan123

zhuohan123 · 2023-09-17T08:42:55Z

vllm/sequence.py

            return self.sampling_params.best_of
        else:
-            if self.sampling_params.best_of > self.num_seqs():
+            if self.sampling_params.best_of >= self.num_seqs():


Thanks for reporting the issue! This fix will make the if always be true. Please find the correct change in #1068

Fix get_max_num_running_seqs for waiting seq groups

3bd548f

zhuohan123 mentioned this pull request Sep 17, 2023

Fix get_max_num_running_seqs for waiting and swapped seq groups #1068

Merged

zhuohan123 reviewed Sep 17, 2023

View reviewed changes

Yard1 closed this Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix get_max_num_running_seqs for waiting seq groups #1034

Fix get_max_num_running_seqs for waiting seq groups #1034

Uh oh!

Yard1 commented Sep 13, 2023

Uh oh!

Yard1 commented Sep 16, 2023

Uh oh!

zhuohan123 Sep 17, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix get_max_num_running_seqs for waiting seq groups #1034

Fix get_max_num_running_seqs for waiting seq groups #1034

Uh oh!

Conversation

Yard1 commented Sep 13, 2023

Uh oh!

Yard1 commented Sep 16, 2023

Uh oh!

zhuohan123 Sep 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zhuohan123 Sep 17, 2023 •

edited

Loading