KEMBAR78
Add documentation to Triton server tutorial by tanmayv25 · Pull Request #983 · vllm-project/vllm · GitHub
Skip to content

Conversation

@tanmayv25
Copy link
Contributor

There has been a growing interest in how to deploy vLLM within Triton. Adding this documentation would make it easier for vLLM to find Triton as a serving platform. #541 (comment)

Copy link
Collaborator

@WoosukKwon WoosukKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@tanmayv25 Sorry for the delayed response, and thanks for your hard work on the integration! I've only requested a minor change.

Let's further promote this! On our side, I think we can further enhance the document with a running example and a performance benchmark.

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
@tanmayv25
Copy link
Contributor Author

@WoosukKwon Thanks for the comments! We welcome any improvements from the community.

Copy link
Collaborator

@WoosukKwon WoosukKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks for the PR!

@WoosukKwon WoosukKwon merged commit 6f2dd6c into vllm-project:main Sep 20, 2023
hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants