NLLLoss: validate target is 0D when input is 1D #161412

mansiag05 · 2025-08-25T16:32:28Z

Add a shape check in nll_loss_forward to error out when both input and target are 1D. Added a unit test to cover the incompatible 1D/1D case.

Fixes #157420

cc @albanD @ngimel @peterbell10 @cyyever @kurtamohler

pytorch-bot · 2025-08-25T16:32:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161412

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1398d84 with merge base c321111 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2025-08-25T16:32:35Z

The committers listed above are authorized under a signed CLA.

✅ login: mansiag05 / name: Mansi Agarwal (1398d84, c9a84ae, 5041fdb)

aten/src/ATen/native/LossNLL.cpp

test/test_nn.py

ngimel

Please don't submit BC-breaking changes. When you think documentation contradicts current behavior, changes should be made to documentation, not behavior. In this case, there is logic in implementation that sets batch_size to 1 if input is 1d, and there was a specific check in the file you were changing https://github.com/pytorch/pytorch/pull/161412/files#diff-0b104a4612220d611b2c5d76272c28ddb21efbd8f9b9ae29d7f5b6c9e2a9be1fR53 to make sure inputs are valid

Let's think it's over

mansiag05 · 2025-08-26T06:59:51Z

Hello all, thanks for the reviews and clarifications.
I wanted to follow up with an example to clarify the edge case we are discussing.

Examples:

Valid case:

input = torch.randn(10)        # [C] -> Internally [1,C]
target = torch.tensor(3)       # scalar, class 3 is the correct label.
F.nll_loss(input, target)      # works

Conceptually, nll_loss expects input.shape = [N, C], target.shape = [N].
When input is 1D ([C]), PyTorch internally treats it as a batch of size 1 ([1, C]). This means the target should be scalar (or [1]) to match the single batch sample.

Invalid case:

# 1D input and 1D target of same length
input1 = torch.randn(10)                           # shape [10] -> internally [1,10]
target1 = torch.randint(0, 10, (10,), dtype=torch.long)  # shape [10]
F.nll_loss(input1, target1)

Here PyTorch currently does not raise an error, even though logically this is invalid: a batch of size 1 cannot have 10 labels.
This seems to be a quirk in the current shape-checking logic: the implementation treats the 1D input as batch size 1 internally but does not fully enforce that the target must be scalar. So it accidentally accepts [10] as a target, even though it doesn’t align with the “batch size =1” interpretation.

Next Step

Given this, what would be the recommended approach for the PR?

Update the documentation to clearly explain that for 1D input, only scalar targets are valid?
Or additionally introduce a deprecation warning / stricter error check for the invalid 1D input + 1D target case?

Happy to adjust the PR based on guidance.

ngimel · 2025-08-26T16:08:07Z

neither. We could make the 1d-1d shape check correct, but we won't limit target to 1d, and we won'd deprecate.

mansiag05 · 2025-09-02T15:55:50Z

I've updated the check as per the comment.

Also, one more observation, when both the input and target tensors are 1D, the NLL loss function only uses the value at index 0 of the target tensor. The remaining values in the target tensor are ignored.

ngimel · 2025-09-02T20:42:46Z

Right, that's why we need better shape checks (to make sure target has 1 element in this case)

aten/src/ATen/native/LossNLL.cpp

mansiag05 · 2025-09-04T17:57:24Z

I've updated the shape check. Could you please review.

mansiag05 · 2025-09-05T18:08:11Z

@pytorchbot merge

pytorchmergebot · 2025-09-05T18:10:24Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-09-05T18:10:39Z

Merge failed

Reason: 26 mandatory check(s) failed. The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

ngimel · 2025-09-05T18:12:45Z

@pytorchbot rebase -b main

pytorchmergebot · 2025-09-05T18:14:16Z

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

Updating the UT assert message with ValueError instead of RuntimeError.

pytorchmergebot · 2025-09-05T18:14:19Z

Successfully rebased fix-issue-157420 onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via git checkout fix-issue-157420 && git pull --rebase)

mansiag05 · 2025-09-06T18:17:39Z

@pytorchbot merge

pytorchmergebot · 2025-09-06T18:19:26Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Add a shape check in nll_loss_forward to error out when both input and target are 1D. Added a unit test to cover the incompatible 1D/1D case. Fixes pytorch#157420 Pull Request resolved: pytorch#161412 Approved by: https://github.com/ngimel

t-vi · 2025-09-09T07:05:18Z

PyTorch used to allow 1d size 0 target with 1d size 0 input, is it intentional that this PR blocks this? It's a corner case, just wondering. I guess part of it is that there is no 0 element 0 dim tensor, but that's just how dimensions work.

Add a shape check in nll_loss_forward to error out when both input and target are 1D. Added a unit test to cover the incompatible 1D/1D case. Fixes pytorch#157420 Pull Request resolved: pytorch#161412 Approved by: https://github.com/ngimel

pytorch-bot bot added the release notes: nn release notes category label Aug 25, 2025

pytorchbot added the open source label Aug 25, 2025

Skylion007 reviewed Aug 25, 2025

View reviewed changes

aten/src/ATen/native/LossNLL.cpp Outdated Show resolved Hide resolved

test/test_nn.py Outdated Show resolved Hide resolved

Skylion007 previously approved these changes Aug 25, 2025

View reviewed changes

ngimel requested changes Aug 25, 2025

View reviewed changes

soulitzer added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Aug 26, 2025

soulitzer requested a review from mikaylagawarecki August 26, 2025 13:22

ngimel reviewed Sep 2, 2025

View reviewed changes

aten/src/ATen/native/LossNLL.cpp Outdated Show resolved Hide resolved

ngimel approved these changes Sep 4, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 5, 2025

pytorchmergebot added the merging label Sep 5, 2025

pytorchmergebot removed the merging label Sep 5, 2025

mansiag05 added 3 commits September 5, 2025 18:14

NLLLoss: validate target is 0D when input is 1D

5041fdb

Updating the NLL loss check with TORCH_CHECK_VALUE.

c9a84ae

Updating the UT assert message with ValueError instead of RuntimeError.

Updating the 1D-1D shape check for NLL loss.

1398d84

pytorchmergebot force-pushed the fix-issue-157420 branch from 3534048 to 1398d84 Compare September 5, 2025 18:14

pytorch-bot bot removed the ciflow/trunk Trigger trunk jobs on your pull request label Sep 5, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 6, 2025

pytorchmergebot added the merging label Sep 6, 2025

pytorchmergebot added the Merged label Sep 6, 2025

pytorchmergebot closed this in 5927a70 Sep 6, 2025

pytorchmergebot removed the merging label Sep 6, 2025

NLLLoss: validate target is 0D when input is 1D #161412

NLLLoss: validate target is 0D when input is 1D #161412

Uh oh!

Conversation

mansiag05 commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161412

✅ No Failures

Uh oh!

linux-foundation-easycla bot commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ngimel left a comment

Choose a reason for hiding this comment

Uh oh!

mansiag05 commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngimel commented Aug 26, 2025

Uh oh!

mansiag05 commented Sep 2, 2025

Uh oh!

ngimel commented Sep 2, 2025

Uh oh!

Uh oh!

mansiag05 commented Sep 4, 2025

Uh oh!

mansiag05 commented Sep 5, 2025

Uh oh!

pytorchmergebot commented Sep 5, 2025

Merge started

Uh oh!

pytorchmergebot commented Sep 5, 2025

Merge failed

Uh oh!

ngimel commented Sep 5, 2025

Uh oh!

pytorchmergebot commented Sep 5, 2025

Uh oh!

pytorchmergebot commented Sep 5, 2025

Uh oh!

mansiag05 commented Sep 6, 2025

Uh oh!

pytorchmergebot commented Sep 6, 2025

Merge started

Uh oh!

t-vi commented Sep 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

mansiag05 commented Aug 25, 2025 •

edited

Loading

pytorch-bot bot commented Aug 25, 2025 •

edited

Loading

linux-foundation-easycla bot commented Aug 25, 2025 •

edited

Loading

mansiag05 commented Aug 26, 2025 •

edited

Loading