-
Notifications
You must be signed in to change notification settings - Fork 396
Add support for Weave evaluation #264
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: ayulockin <mein2work@gmail.com>
Signed-off-by: ayulockin <mein2work@gmail.com>
Signed-off-by: ayulockin <mein2work@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this super cool contribution @ayulockin
Signed-off-by: ayulockin <mein2work@gmail.com>
Signed-off-by: ayulockin <mein2work@gmail.com>
Signed-off-by: ayulockin <mein2work@gmail.com>
Signed-off-by: ayulockin <mein2work@gmail.com>
This is without weave turned on:
This is with weave enabled:
Note that this comparison is done only on 5 examples but definitely there is some performance overhead introduced when weave is turned on. |
Signed-off-by: ayulockin <mein2work@gmail.com>
Hey @AnuradhaKaruppiah the race conditions are gone. I have removed the hack to suppress the bad stdouts. The PR is ready from my end to be reviewed. I want to add support for profiler and add tests. I have started working on them but we can add those in a separate PR or if you want I can add them with this PR. |
Hey @AnuradhaKaruppiah I also ran the following checks locally.
![]()
|
/ok to test ef9f17b |
/merge |
This PR adds the ability to run evaluations such that the traces and evaluation scores are logged to W&B Weave. This allows for comparing evaluations, debugging evaluations, etc. ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/AIQToolkit/blob/develop/docs/source/resources/contributing.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. Authors: - Ayush Thakur (https://github.com/ayulockin) Approvers: - Anuradha Karuppiah (https://github.com/AnuradhaKaruppiah) URL: NVIDIA#264
This PR adds the ability to run evaluations such that the traces and evaluation scores are logged to W&B Weave. This allows for comparing evaluations, debugging evaluations, etc. ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/AIQToolkit/blob/develop/docs/source/resources/contributing.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. Authors: - Ayush Thakur (https://github.com/ayulockin) Approvers: - Anuradha Karuppiah (https://github.com/AnuradhaKaruppiah) URL: NVIDIA#264
Description
This PR adds the ability to run evaluations such that the traces and evaluation scores are logged to W&B Weave. This allows for comparing evaluations, debugging evaluations, etc.
By Submitting this PR I confirm: