[ONNX] Fix rotary_embedding_23 implementation #162865

justinchuby · 2025-09-13T00:48:01Z

The implementation of rotary_embedding_23 when input is 3D was incorrect.

Tested

Locally with

import onnx_ir as ir
import onnx
import torch
import os
import numpy as np

base_path = "/home/justinchu/dev/onnx/onnx/backend/test/data/node"
test_names = [
    "test_rotary_embedding",
    "test_rotary_embedding_3d_input",
    "test_rotary_embedding_interleaved",
    "test_rotary_embedding_no_position_ids",
    "test_rotary_embedding_no_position_ids_interleaved",
    "test_rotary_embedding_no_position_ids_rotary_dim",
    "test_rotary_embedding_with_interleaved_rotary_dim",
    "test_rotary_embedding_with_rotary_dim",
]
model_paths = [os.path.join(base_path, name) for name in test_names]


for path in model_paths:
    print(f"Checking {path} for issues...")

    model = onnx.load(os.path.join(path, "model.onnx"))
    input0 = ir.from_proto(
        onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_0.pb"))
    ).numpy()
    input1 = ir.from_proto(
        onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_1.pb"))
    ).numpy()
    input2 = ir.from_proto(
        onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_2.pb"))
    ).numpy()
    if os.path.exists(os.path.join(path, "test_data_set_0", "input_3.pb")):
        input3 = ir.from_proto(
            onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_3.pb"))
        ).numpy()
    else:
        input3 = None
    output0 = ir.from_proto(
        onnx.load_tensor(os.path.join(path, "test_data_set_0", "output_0.pb"))
    ).numpy()

    m = ir.from_proto(model)

    node = m.graph[-1]
    print(node)
    assert node.op_type == "RotaryEmbedding"

    interleaved = node.attributes.get_int("interleaved", 0)
    num_heads = node.attributes.get_int("num_heads", 0)
    rotary_embedding_dim = node.attributes.get_int("rotary_embedding_dim", 0)

    torch_out = torch.onnx.ops.rotary_embedding(
        torch.tensor(input0),
        torch.tensor(input1),
        torch.tensor(input2),
        position_ids=torch.tensor(input3) if input3 is not None else None,
        interleaved=bool(interleaved),
        num_heads=num_heads,
        rotary_embedding_dim=rotary_embedding_dim,
    )
    torch_out = torch_out.detach().cpu().numpy()
    np.testing.assert_allclose(torch_out, output0)

Fix #162848

cc @titaiwangms @kunal-vaishnavi

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

pytorch-bot · 2025-09-13T00:48:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162865

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3cc0463 with merge base a94ddd9 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

justinchuby · 2025-09-13T01:30:48Z

Cross reference

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

torch/onnx/ops/_impl.py

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

test/onnx/ops/test_ops.py

torch/onnx/ops/_impl.py

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

justinchuby · 2025-09-16T00:45:24Z

@pytorchbot merge

pytorchmergebot · 2025-09-16T00:47:08Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

justinchuby · 2025-09-16T00:48:06Z

@pytorchbot merge

pytorchmergebot · 2025-09-16T00:48:27Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

pytorchmergebot · 2025-09-16T00:50:16Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-09-16T01:17:06Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

Lint / Test collect_env (with_torch, linux.24_04.4x)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

justinchuby · 2025-09-16T01:53:01Z

@pytorchbot merge -i

pytorchmergebot · 2025-09-16T01:54:46Z

Merge started

Your change will be merged while ignoring the following 0 checks:

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

justinchuby · 2025-09-16T04:37:21Z

@pytorchbot cherry-pick --onto release/2.9 --fixes "ONNX operator fix for the new dynamo export feature" -c critical

The implementation of rotary_embedding_23 when input is 3D was incorrect. ## Tested Locally with ```py import onnx_ir as ir import onnx import torch import os import numpy as np base_path = "/home/justinchu/dev/onnx/onnx/backend/test/data/node" test_names = [ "test_rotary_embedding", "test_rotary_embedding_3d_input", "test_rotary_embedding_interleaved", "test_rotary_embedding_no_position_ids", "test_rotary_embedding_no_position_ids_interleaved", "test_rotary_embedding_no_position_ids_rotary_dim", "test_rotary_embedding_with_interleaved_rotary_dim", "test_rotary_embedding_with_rotary_dim", ] model_paths = [os.path.join(base_path, name) for name in test_names] for path in model_paths: print(f"Checking {path} for issues...") model = onnx.load(os.path.join(path, "model.onnx")) input0 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_0.pb")) ).numpy() input1 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_1.pb")) ).numpy() input2 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_2.pb")) ).numpy() if os.path.exists(os.path.join(path, "test_data_set_0", "input_3.pb")): input3 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_3.pb")) ).numpy() else: input3 = None output0 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "output_0.pb")) ).numpy() m = ir.from_proto(model) node = m.graph[-1] print(node) assert node.op_type == "RotaryEmbedding" interleaved = node.attributes.get_int("interleaved", 0) num_heads = node.attributes.get_int("num_heads", 0) rotary_embedding_dim = node.attributes.get_int("rotary_embedding_dim", 0) torch_out = torch.onnx.ops.rotary_embedding( torch.tensor(input0), torch.tensor(input1), torch.tensor(input2), position_ids=torch.tensor(input3) if input3 is not None else None, interleaved=bool(interleaved), num_heads=num_heads, rotary_embedding_dim=rotary_embedding_dim, ) torch_out = torch_out.detach().cpu().numpy() np.testing.assert_allclose(torch_out, output0) ``` Fix #162848 Pull Request resolved: #162865 Approved by: https://github.com/kunal-vaishnavi, https://github.com/titaiwangms (cherry picked from commit fdf68fa)

pytorchbot · 2025-09-16T04:42:24Z

Cherry picking #162865

The cherry pick PR is at #163041 and it is linked with issue ONNX operator fix for the new dynamo export feature. The following tracker issues are updated:

[v.2.9.0] Release Tracker #162497 (comment)

Details for Dev Infra team

Raised by workflow job

[ONNX] Fix rotary_embedding_23 implementation (#162865) The implementation of rotary_embedding_23 when input is 3D was incorrect. ## Tested Locally with ```py import onnx_ir as ir import onnx import torch import os import numpy as np base_path = "/home/justinchu/dev/onnx/onnx/backend/test/data/node" test_names = [ "test_rotary_embedding", "test_rotary_embedding_3d_input", "test_rotary_embedding_interleaved", "test_rotary_embedding_no_position_ids", "test_rotary_embedding_no_position_ids_interleaved", "test_rotary_embedding_no_position_ids_rotary_dim", "test_rotary_embedding_with_interleaved_rotary_dim", "test_rotary_embedding_with_rotary_dim", ] model_paths = [os.path.join(base_path, name) for name in test_names] for path in model_paths: print(f"Checking {path} for issues...") model = onnx.load(os.path.join(path, "model.onnx")) input0 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_0.pb")) ).numpy() input1 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_1.pb")) ).numpy() input2 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_2.pb")) ).numpy() if os.path.exists(os.path.join(path, "test_data_set_0", "input_3.pb")): input3 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_3.pb")) ).numpy() else: input3 = None output0 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "output_0.pb")) ).numpy() m = ir.from_proto(model) node = m.graph[-1] print(node) assert node.op_type == "RotaryEmbedding" interleaved = node.attributes.get_int("interleaved", 0) num_heads = node.attributes.get_int("num_heads", 0) rotary_embedding_dim = node.attributes.get_int("rotary_embedding_dim", 0) torch_out = torch.onnx.ops.rotary_embedding( torch.tensor(input0), torch.tensor(input1), torch.tensor(input2), position_ids=torch.tensor(input3) if input3 is not None else None, interleaved=bool(interleaved), num_heads=num_heads, rotary_embedding_dim=rotary_embedding_dim, ) torch_out = torch_out.detach().cpu().numpy() np.testing.assert_allclose(torch_out, output0) ``` Fix #162848 Pull Request resolved: #162865 Approved by: https://github.com/kunal-vaishnavi, https://github.com/titaiwangms (cherry picked from commit fdf68fa) Co-authored-by: Justin Chu <justinchuby@users.noreply.github.com>

The implementation of rotary_embedding_23 when input is 3D was incorrect. ## Tested Locally with ```py import onnx_ir as ir import onnx import torch import os import numpy as np base_path = "/home/justinchu/dev/onnx/onnx/backend/test/data/node" test_names = [ "test_rotary_embedding", "test_rotary_embedding_3d_input", "test_rotary_embedding_interleaved", "test_rotary_embedding_no_position_ids", "test_rotary_embedding_no_position_ids_interleaved", "test_rotary_embedding_no_position_ids_rotary_dim", "test_rotary_embedding_with_interleaved_rotary_dim", "test_rotary_embedding_with_rotary_dim", ] model_paths = [os.path.join(base_path, name) for name in test_names] for path in model_paths: print(f"Checking {path} for issues...") model = onnx.load(os.path.join(path, "model.onnx")) input0 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_0.pb")) ).numpy() input1 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_1.pb")) ).numpy() input2 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_2.pb")) ).numpy() if os.path.exists(os.path.join(path, "test_data_set_0", "input_3.pb")): input3 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "input_3.pb")) ).numpy() else: input3 = None output0 = ir.from_proto( onnx.load_tensor(os.path.join(path, "test_data_set_0", "output_0.pb")) ).numpy() m = ir.from_proto(model) node = m.graph[-1] print(node) assert node.op_type == "RotaryEmbedding" interleaved = node.attributes.get_int("interleaved", 0) num_heads = node.attributes.get_int("num_heads", 0) rotary_embedding_dim = node.attributes.get_int("rotary_embedding_dim", 0) torch_out = torch.onnx.ops.rotary_embedding( torch.tensor(input0), torch.tensor(input1), torch.tensor(input2), position_ids=torch.tensor(input3) if input3 is not None else None, interleaved=bool(interleaved), num_heads=num_heads, rotary_embedding_dim=rotary_embedding_dim, ) torch_out = torch_out.detach().cpu().numpy() np.testing.assert_allclose(torch_out, output0) ``` Fix pytorch#162848 Pull Request resolved: pytorch#162865 Approved by: https://github.com/kunal-vaishnavi, https://github.com/titaiwangms

justinchuby added 4 commits September 12, 2025 17:22

[ONNX] Fix rotary_embedding_23 implementation

0cf0e9e

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

Add test

c3e6005

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

test

bdc1931

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

More test

54b4682

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

justinchuby requested review from jambayk and xadupre September 13, 2025 00:48

justinchuby requested a review from titaiwangms as a code owner September 13, 2025 00:48

pytorch-bot bot added the release notes: onnx torch.onnx related changes that should show up in the release notes label Sep 13, 2025

justinchuby added module: onnx Related to torch.onnx topic: bug fixes topic category labels Sep 13, 2025

justinchuby added this to the 2.9.0 milestone Sep 13, 2025

justinchuby marked this pull request as draft September 13, 2025 00:50

justinchuby marked this pull request as ready for review September 13, 2025 00:50

pytorchbot added the open source label Sep 13, 2025

justinchuby added 2 commits September 12, 2025 18:14

Fix comment

78312b8

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

checks

b5112bb

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

justinchuby added 2 commits September 12, 2025 20:40

checks

fc9df99

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

Fix tests

a65bf87

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

titaiwangms reviewed Sep 15, 2025

View reviewed changes

torch/onnx/ops/_impl.py Show resolved Hide resolved

torch/onnx/ops/_impl.py Outdated Show resolved Hide resolved

Fix test

7dd8f7c

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

xadupre reviewed Sep 15, 2025

View reviewed changes

test/onnx/ops/test_ops.py Show resolved Hide resolved

jambayk reviewed Sep 15, 2025

View reviewed changes

torch/onnx/ops/_impl.py Outdated Show resolved Hide resolved

msg

1a98570

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

kunal-vaishnavi approved these changes Sep 15, 2025

View reviewed changes

titaiwangms approved these changes Sep 15, 2025

View reviewed changes

Check sin/cos size

d089942

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 16, 2025

pytorchmergebot added the merging label Sep 16, 2025

error message

3cc0463

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

pytorchmergebot removed the merging label Sep 16, 2025

pytorchmergebot added the merging label Sep 16, 2025

pytorchmergebot added the Merged label Sep 16, 2025

pytorchmergebot closed this in fdf68fa Sep 16, 2025

pytorchmergebot removed the merging label Sep 16, 2025

pytorchbot mentioned this pull request Sep 16, 2025

[v.2.9.0] Release Tracker #162497

Closed

[ONNX] Fix rotary_embedding_23 implementation #162865

[ONNX] Fix rotary_embedding_23 implementation #162865

Uh oh!

Conversation

justinchuby commented Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tested

Uh oh!

pytorch-bot bot commented Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162865

✅ No Failures

Uh oh!

justinchuby commented Sep 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justinchuby commented Sep 16, 2025

Uh oh!

pytorchmergebot commented Sep 16, 2025

Merge started

Uh oh!

justinchuby commented Sep 16, 2025

Uh oh!

pytorchmergebot commented Sep 16, 2025

Uh oh!

pytorchmergebot commented Sep 16, 2025

Merge started

Uh oh!

pytorchmergebot commented Sep 16, 2025

Merge failed

Uh oh!

justinchuby commented Sep 16, 2025

Uh oh!

pytorchmergebot commented Sep 16, 2025

Merge started

Uh oh!

justinchuby commented Sep 16, 2025

Uh oh!

pytorchbot commented Sep 16, 2025

Cherry picking #162865

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

justinchuby commented Sep 13, 2025 •

edited

Loading

pytorch-bot bot commented Sep 13, 2025 •

edited

Loading