Provide a config to pass the complete dataset entry as an EvalInputItem field to evaluators #355

AnuradhaKaruppiah · 2025-06-10T01:01:20Z

Description

Add support for passing the full dataset entry through to evaluators via a new full_dataset_entry field on EvalInputItem.
Add tests and update docs to demonstrate full_dataset_entry.

By Submitting this PR I confirm:

I am familiar with the Contributing Guidelines.
We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license.
- Any contribution which contains commits that are not Signed-Off will not be accepted.
When the PR is ready for review, new or existing tests cover these changes.
When the PR is ready for review, the documentation is up to date with these changes.

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

mdemoret-nv

There is a disconnect here between where this setting is specified and where it will be used. The setting is on the dataset specification but it would only be used by evaluators. So its possible to set pass_full_entry=False and then use a metric which requires the full entry.

Instead, why dont we just always include the full entry and then add options to filter it from the saved output? I dont see any reason why we shouldnt include the full entry by default to the evaluators.

src/aiq/eval/dataset_handler/dataset_handler.py

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

Copilot

Pull Request Overview

Adds support for passing the full dataset entry through to evaluators via a new full_dataset_entry field on EvalInputItem, controlled by a pass_full_entry flag.

Always include full_dataset_entry in each EvalInputItem (intended to be conditional).
Extend EvalInputItem model and DatasetHandler to populate the new field.
Add tests and update docs to demonstrate full_dataset_entry.

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/aiq/eval/dataset_handler/test_dataset_handler.py	Add fixtures and test to verify `full_dataset_entry` contains all extra fields.
src/aiq/eval/evaluator/evaluator_model.py	Extend `EvalInputItem` model with `full_dataset_entry` field
src/aiq/eval/dataset_handler/dataset_handler.py	Populate `full_dataset_entry` via `row.to_dict()` in `create_eval_item`
docs/source/reference/evaluate.md	Document `full_dataset_entry` access in evaluators
docs/source/extend/custom-evaluator.md	Describe `full_dataset_entry` in custom evaluator reference

Comments suppressed due to low confidence (3)

src/aiq/eval/evaluator/evaluator_model.py:30

[nitpick] Consider making full_dataset_entry optional (e.g., typing.Optional[dict]) or providing a default value so the model remains valid when the feature is disabled.

full_dataset_entry: typing.Any

tests/aiq/eval/dataset_handler/test_dataset_handler.py:156

Add a test for when pass_full_entry is set to false to confirm full_dataset_entry is omitted or empty in EvalInputItem.

dataset_config = EvalDatasetJsonConfig()

docs/source/reference/evaluate.md:90

Include an example YAML snippet under the dataset config showing how to enable pass_full_entry (e.g., dataset: pass_full_entry: true) so users know how to turn this feature on.

### Accessing Additional Dataset Fields in Evaluators

src/aiq/eval/dataset_handler/dataset_handler.py

…pdate

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

AnuradhaKaruppiah · 2025-06-12T20:58:30Z

/merge

…em field to evaluators (NVIDIA#355) - Add support for passing the full dataset entry through to evaluators via a new full_dataset_entry field on EvalInputItem. - Add tests and update docs to demonstrate full_dataset_entry. ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/AIQToolkit/blob/develop/docs/source/resources/contributing.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. Authors: - Anuradha Karuppiah (https://github.com/AnuradhaKaruppiah) Approvers: - Yuchen Zhang (https://github.com/yczhang-nv) - Michael Demoret (https://github.com/mdemoret-nv) URL: NVIDIA#355

AnuradhaKaruppiah added 3 commits June 9, 2025 17:22

Enable passing the full dataset entry to evaluators

34e120a

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

Update docs

761dd06

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

Unit tests

8279b85

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

AnuradhaKaruppiah requested a review from Copilot June 10, 2025 01:02

AnuradhaKaruppiah added improvement Improvement to existing functionality non-breaking Non-breaking change labels Jun 10, 2025

This comment was marked as outdated.

Sign in to view

Make the full_dataset_entry field optional in EvalInputItem

06500e2

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

AnuradhaKaruppiah self-assigned this Jun 11, 2025

mdemoret-nv requested changes Jun 11, 2025

View reviewed changes

src/aiq/eval/dataset_handler/dataset_handler.py Outdated Show resolved Hide resolved

AnuradhaKaruppiah added 2 commits June 11, 2025 15:04

Drop the config to pass the full dataset entry

8436185

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

Remove whitespace changes

1f3fca4

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

AnuradhaKaruppiah requested a review from Copilot June 11, 2025 22:11

Copilot AI reviewed Jun 11, 2025

View reviewed changes

src/aiq/eval/dataset_handler/dataset_handler.py Show resolved Hide resolved

AnuradhaKaruppiah added 2 commits June 11, 2025 15:17

Merge remote-tracking branch 'upstream/develop' into ak-custom-eval-u…

55a8a6b

…pdate

Fix unit tests

88821a7

Signed-off-by: Anuradha Karuppiah <anuradhak@nvidia.com>

yczhang-nv approved these changes Jun 12, 2025

View reviewed changes

mdemoret-nv approved these changes Jun 12, 2025

View reviewed changes

rapids-bot bot merged commit 4f05208 into NVIDIA:develop Jun 12, 2025
12 checks passed

AnuradhaKaruppiah deleted the ak-custom-eval-update branch June 25, 2025 15:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Provide a config to pass the complete dataset entry as an EvalInputItem field to evaluators #355

Provide a config to pass the complete dataset entry as an EvalInputItem field to evaluators #355

Uh oh!

AnuradhaKaruppiah commented Jun 10, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

mdemoret-nv left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

AnuradhaKaruppiah commented Jun 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Provide a config to pass the complete dataset entry as an EvalInputItem field to evaluators #355

Provide a config to pass the complete dataset entry as an EvalInputItem field to evaluators #355

Uh oh!

Conversation

AnuradhaKaruppiah commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

By Submitting this PR I confirm:

Uh oh!

This comment was marked as outdated.

Uh oh!

mdemoret-nv left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

AnuradhaKaruppiah commented Jun 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AnuradhaKaruppiah commented Jun 10, 2025 •

edited

Loading