Document using local/self-hosted models #101

dagardner-nv · 2025-04-08T17:03:23Z

Description

Document using locally hosted models using NIMs and using vLLM
Adds an OpenAI LangChain embedding client (required for the vLLM).

Closes #34
Closes #37

By Submitting this PR I confirm:

I am familiar with the Contributing Guidelines.
We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license.
- Any contribution which contains commits that are not Signed-Off will not be accepted.
When the PR is ready for review, new or existing tests cover these changes.
When the PR is ready for review, the documentation is up to date with these changes.

Signed-off-by: David Gardner <dagardner@nvidia.com>

…was available for both NIM and vLLM Signed-off-by: David Gardner <dagardner@nvidia.com>

…local-llms-35 Signed-off-by: David Gardner <dagardner@nvidia.com>

mdemoret-nv · 2025-04-15T17:46:50Z

/merge

* Document using locally hosted models using NIMs and using vLLM * Adds an OpenAI LangChain embedding client (required for the vLLM). Closes NVIDIA#34 Closes NVIDIA#37 ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/AgentIQ/blob/develop/docs/source/advanced/contributing.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - Michael Demoret (https://github.com/mdemoret-nv) URL: NVIDIA#101 Signed-off-by: Yuchen Zhang <134643420+yczhang-nv@users.noreply.github.com>

* Document using locally hosted models using NIMs and using vLLM * Adds an OpenAI LangChain embedding client (required for the vLLM). Closes NVIDIA#34 Closes NVIDIA#37 ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/AgentIQ/blob/develop/docs/source/advanced/contributing.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - Michael Demoret (https://github.com/mdemoret-nv) URL: NVIDIA#101 Signed-off-by: Eric Evans <194135482+ericevans-nv@users.noreply.github.com>

* Document using locally hosted models using NIMs and using vLLM * Adds an OpenAI LangChain embedding client (required for the vLLM). Closes NVIDIA#34 Closes NVIDIA#37 ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/AgentIQ/blob/develop/docs/source/advanced/contributing.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - Michael Demoret (https://github.com/mdemoret-nv) URL: NVIDIA#101 Signed-off-by: Yuchen Zhang <134643420+yczhang-nv@users.noreply.github.com>

* Document using locally hosted models using NIMs and using vLLM * Adds an OpenAI LangChain embedding client (required for the vLLM). Closes NVIDIA#34 Closes NVIDIA#37 ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/AgentIQ/blob/develop/docs/source/advanced/contributing.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - Michael Demoret (https://github.com/mdemoret-nv) URL: NVIDIA#101

dagardner-nv added 9 commits April 7, 2025 16:05

WIP

4f25ab9

Signed-off-by: David Gardner <dagardner@nvidia.com>

Add an openai langchain embedder

091d495

Signed-off-by: David Gardner <dagardner@nvidia.com>

Register openai embedder

6e101d6

Signed-off-by: David Gardner <dagardner@nvidia.com>

Fix type-o in docstring

9bc25e5

Signed-off-by: David Gardner <dagardner@nvidia.com>

First pass at a vLLM config

96d6c5a

Signed-off-by: David Gardner <dagardner@nvidia.com>

WIP

f8d4629

Signed-off-by: David Gardner <dagardner@nvidia.com>

Lint fix

9723d67

Signed-off-by: David Gardner <dagardner@nvidia.com>

Add an api_key value

e19e1a1

Signed-off-by: David Gardner <dagardner@nvidia.com>

Document vLLM

b080296

Signed-off-by: David Gardner <dagardner@nvidia.com>

dagardner-nv added doc Improvements or additions to documentation non-breaking Non-breaking change labels Apr 8, 2025

dagardner-nv self-assigned this Apr 8, 2025

dagardner-nv requested a review from a team as a code owner April 8, 2025 17:03

dagardner-nv marked this pull request as draft April 8, 2025 17:03

Add CR header

995db87

Signed-off-by: David Gardner <dagardner@nvidia.com>

dagardner-nv marked this pull request as ready for review April 9, 2025 16:25

dagardner-nv marked this pull request as draft April 9, 2025 20:22

dagardner-nv added 2 commits April 9, 2025 15:24

Switch to using the microsoft/Phi-3-mini-4k-instruct LLM model as it …

41f1a46

…was available for both NIM and vLLM Signed-off-by: David Gardner <dagardner@nvidia.com>

Merge branch 'develop' of github.com:NVIDIA/AgentIQ into david-using-…

f157cad

…local-llms-35 Signed-off-by: David Gardner <dagardner@nvidia.com>

dagardner-nv marked this pull request as ready for review April 9, 2025 22:26

dagardner-nv marked this pull request as draft April 10, 2025 15:55

dagardner-nv marked this pull request as ready for review April 10, 2025 17:05

mdemoret-nv approved these changes Apr 15, 2025

View reviewed changes

rapids-bot bot merged commit 926d618 into NVIDIA:develop Apr 15, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Document using local/self-hosted models #101

Document using local/self-hosted models #101

Uh oh!

dagardner-nv commented Apr 8, 2025

Uh oh!

mdemoret-nv commented Apr 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Document using local/self-hosted models #101

Document using local/self-hosted models #101

Uh oh!

Conversation

dagardner-nv commented Apr 8, 2025

Description

By Submitting this PR I confirm:

Uh oh!

mdemoret-nv commented Apr 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants