Add ScalarType -> shim conversion, add stable::Tensor.scalar_type #160557

janeyx99 · 2025-08-13T20:12:30Z

TL;DR: Moving to ScalarType in user extensions and removing deprecated dtypes.

This change modifies the from/to behavior between ScalarType and StableValue! Whereas before, user extensions could only in abstract pass around obfuscated dtypes appearing as int32_ts, now, users can confidently use torch::headeronly::ScalarType in their extensions for major scalar types. This PR enables ABI stability by adding a translation layer through the shim, so that even if the ScalarType enum values change in the future, user extensions need not fear.

Then we add a Tensor scalar_type API which reuses the from/to logic to return to the user a nice ScalarType (vs an abstracted int32_t).

I then changed the test to test the scalar_type API.

This code change required some refactoring because of circular dependencies.

BC Breaking note

This commit is (narrowly) BC-breaking for unpopular dtypes: quint*s, qint*s, Bits*, dummy_uint*s, dummy_int*s, Float8_e8m0fnu, and Float4_e2m1fn_x2 in the narrow use case where an extension retrieves a Tensor dtype of the above and passes it into aoti_torch_call_dispatcher. As of now, I believe there are 0 users of this use case, so the benefits of this change significantly justify BC-breaking this API.

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2025-08-13T20:12:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160557

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f12f9d1 with merge base a44a0d3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: d51cb60 Pull Request resolved: #160557

janeyx99 · 2025-08-13T20:14:29Z

test/cpp_extensions/libtorch_agnostic_extension/libtorch_agnostic/csrc/kernel.cpp


  stack[0] = from(t);
-  stack[1] = from(std::optional(t_dtype));    // dtype
+  stack[1] = from(std::optional(t.scalar_type()));    // dtype


For testing

janeyx99 · 2025-08-13T20:15:22Z

torch/csrc/stable/library.h

The to/from logic of this file got moved to utils.h

janeyx99 · 2025-08-13T20:17:49Z

torch/csrc/stable/tensor.h

This file got split into tensor-struct.h which has everything before the PR, and tensor-inl.h which implements scalar_type as it relies on from/to. The reason for the code split is to allow the code to build without circular dependencies. Without the split, tensor.h would depend on library.h (for to/from) and library.h would depend on tensor.h (cuz to/from Tensor needs a Tensor def).

Now, utils.h (which has to/from) depends on tensor-struct.h, tensor-inl.h depends on both utils.h and tensor-struct.h, and users depend on tensor.h still, which depends on all of the above.

janeyx99 · 2025-08-13T20:33:23Z

torch/csrc/stable/utils.h

+      case ScalarType::UInt64:
+        return from(aoti_torch_dtype_uint64());
+      default:
+        throw std::runtime_error(


NOTE!! THIS IS WHERE I WANT REVIEW!! cc @albanD

Prior, if we had an IValue dtype that was qint8, from_ivalue would call from(ScalarType::Qint8), and the code would just reinterpret the enum and spit out the int32_t correspondingly. This was okay because ScalarType wasn't exposed to the end user, and all they had to work with was an abstracted int32_t that they would get from the C shim.

However, with this change today, from(ScalarType::Qint8) would error!!!! Because now, ScalarType is allowed to be used by the end user, and they can call this function, and naively reinterpreting the enum is no longer ok if the extension's ScalarType is different from libtorch's ScalarType! I think erroring is acceptable because these other types are infrequently used by people anyway, but maybe I am wrong about that. e.g., @swolchok are the Bits ScalarTypes used in ET?

Answering your specific question:

I haven't (yet?) made any attempt to use PyTorch's ScalarType in ExecuTorch. ExecuTorch has https://github.com/pytorch/executorch/blob/52b45e2d2ac244b13a36ddf5d21a9ebe8d8aa17e/runtime/core/portable_type/scalar_type.h#L132

PyTorch's ScalarType will get used in ExecuTorch's ATen mode, though. https://github.com/pytorch/executorch/blob/52b45e2d2ac244b13a36ddf5d21a9ebe8d8aa17e/runtime/core/exec_aten/exec_aten.h#L82

I don't know what the Bits ScalarTypes even are, but ExecuTorch seems to have its own versions of them that it uses: https://github.com/search?q=repo%3Apytorch%2Fexecutorch+ScalarType%3A%3ABits+language%3AC%2B%2B&type=code&l=C%2B%2B

In general: it is not backward compatible to change functionality such that a call that previously succeeded (and really did work fine) is now an error.

I've concluded it is okay to BC break here given that GitHub search yields 0 users for the narrow use case for which this code would break. Updated the PR body consequently.

janeyx99 · 2025-08-13T20:34:13Z

torch/csrc/stable/utils.h

this code is a copy pasta EXCEPT for the specializations for torch::headeronly::ScalarType

…ar_type" This change _modifies_ the from/to behavior between ScalarType and StableValue! Then we add a Tensor scalar_type API which reuses the from/to logic to return to the user a nice ScalarType (vs an abstracted int32_t). I then changed the test to test the scalar_type API. This code change required some refactoring because of circular dependencies. [ghstack-poisoned]

ghstack-source-id: 126ec72 Pull Request resolved: #160557

…ar_type" This change _modifies_ the from/to behavior between ScalarType and StableValue! Then we add a Tensor scalar_type API which reuses the from/to logic to return to the user a nice ScalarType (vs an abstracted int32_t). I then changed the test to test the scalar_type API. This code change required some refactoring because of circular dependencies. [ghstack-poisoned]

ghstack-source-id: 4f72b22 Pull Request resolved: #160557

swolchok · 2025-08-13T20:48:22Z

torch/csrc/stable/utils.h

@@ -0,0 +1,342 @@
+#pragma once


I firmly dislike naming things "utils" because it is synonymous with "stuff" and helps neither predict their current contents nor limit their future contents. Instead I would consider a specific name, like say StableIValueConversions.h .

renamed!!!!!! stableivalue_conversions.h

swolchok · 2025-08-13T20:52:33Z

torch/csrc/stable/utils.h

+      case ScalarType::UInt64:
+        return from(aoti_torch_dtype_uint64());
+      default:
+        throw std::runtime_error(


Answering your specific question:

I haven't (yet?) made any attempt to use PyTorch's ScalarType in ExecuTorch. ExecuTorch has https://github.com/pytorch/executorch/blob/52b45e2d2ac244b13a36ddf5d21a9ebe8d8aa17e/runtime/core/portable_type/scalar_type.h#L132

PyTorch's ScalarType will get used in ExecuTorch's ATen mode, though. https://github.com/pytorch/executorch/blob/52b45e2d2ac244b13a36ddf5d21a9ebe8d8aa17e/runtime/core/exec_aten/exec_aten.h#L82

I don't know what the Bits ScalarTypes even are, but ExecuTorch seems to have its own versions of them that it uses: https://github.com/search?q=repo%3Apytorch%2Fexecutorch+ScalarType%3A%3ABits+language%3AC%2B%2B&type=code&l=C%2B%2B

In general: it is not backward compatible to change functionality such that a call that previously succeeded (and really did work fine) is now an error.

…ar_type" This change _modifies_ the from/to behavior between ScalarType and StableValue! Then we add a Tensor scalar_type API which reuses the from/to logic to return to the user a nice ScalarType (vs an abstracted int32_t). I then changed the test to test the scalar_type API. This code change required some refactoring because of circular dependencies. [ghstack-poisoned]

ghstack-source-id: fc8cab7 Pull Request resolved: #160557

…ar_type" This change _modifies_ the from/to behavior between ScalarType and StableValue! Then we add a Tensor scalar_type API which reuses the from/to logic to return to the user a nice ScalarType (vs an abstracted int32_t). I then changed the test to test the scalar_type API. This code change required some refactoring because of circular dependencies. [ghstack-poisoned]

…ar_type ghstack-source-id: 40ed871 Pull Request resolved: #160557

malfet · 2025-08-18T15:56:14Z