[ROCm] revamp HIPCachingAllocatorMasqueradingAsCUDA #161221

naromero77amd · 2025-08-21T23:21:19Z

HIPAllocatorMasqueradingAsCUDA and HIPCachingAllocatorMasqueradingAsCUDA are now proper complete wrappers of HIPAllocator and HIPCachingAllocator, respectively. HIPAllocatorMasqueradingAsCUDA now subclasses HIPAllocator instead of Allocator. This fixes usability of hipify replacing c10::cuda::CUDACachingAllocator::get() where callers expect a CUDAAllocator to be returned but instead were getting a very thin Allocator shim instead.

This also fixes using cudagraph trees with torch compile. The hip:0 device was not being replaced by the cuda:0 device in all methods.

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang

HIPAllocatorMasqueradingAsCUDA and HIPCachingAllocatorMasqueradingAsCUDA are now proper complete wrappers of HIPAllocator and HIPCachingAllocator, respectively. HIPAllocatorMasqueradingAsCUDA now subclasses HIPAllocator instead of Allocator. This fixes usability of hipify replacing c10::cuda::CUDACachingAllocator::get() where callers expect a CUDAAllocator to be returned but instead were getting a very thin Allocator shim instead. This also fixes using cudagraph trees with torch compile. The hip:0 device was not being replaced by the cuda:0 device in all methods.

pytorch-bot · 2025-08-21T23:21:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161221

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 1b79a2e with merge base 16ada80 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

inductor-perf-nightly-rocm / rocm-py3_10-inductor-benchmark-test / test (inductor_huggingface_perf_rocm, 1, 4, linux.rocm.gpu.gfx942.1) (gh) (similar failure)
Process completed with exit code 134.
inductor-perf-nightly-rocm / rocm-py3_10-inductor-benchmark-test / test (inductor_huggingface_perf_rocm, 3, 4, linux.rocm.gpu.gfx942.1) (gh) (similar failure)
Process completed with exit code 134.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jeffdaily · 2025-08-22T14:51:27Z

@pytorchbot merge

pytorchmergebot · 2025-08-22T14:53:32Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

HIPAllocatorMasqueradingAsCUDA and HIPCachingAllocatorMasqueradingAsCUDA are now proper complete wrappers of HIPAllocator and HIPCachingAllocator, respectively. HIPAllocatorMasqueradingAsCUDA now subclasses HIPAllocator instead of Allocator. This fixes usability of hipify replacing c10::cuda::CUDACachingAllocator::get() where callers expect a CUDAAllocator to be returned but instead were getting a very thin Allocator shim instead. This also fixes using cudagraph trees with torch compile. The hip:0 device was not being replaced by the cuda:0 device in all methods. Pull Request resolved: pytorch#161221 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>

cherry pick of upstream pytorch#161221.

cherry pick of pytorch#161221

HIPAllocatorMasqueradingAsCUDA and HIPCachingAllocatorMasqueradingAsCUDA are now proper complete wrappers of HIPAllocator and HIPCachingAllocator, respectively. HIPAllocatorMasqueradingAsCUDA now subclasses HIPAllocator instead of Allocator. This fixes usability of hipify replacing c10::cuda::CUDACachingAllocator::get() where callers expect a CUDAAllocator to be returned but instead were getting a very thin Allocator shim instead. This also fixes using cudagraph trees with torch compile. The hip:0 device was not being replaced by the cuda:0 device in all methods. Pull Request resolved: pytorch#161221 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>

naromero77amd requested review from jeffdaily and jithunnair-amd as code owners August 21, 2025 23:21

pytorch-bot bot added the module: rocm AMD GPU support for Pytorch label Aug 21, 2025

jeffdaily added ciflow/rocm Trigger "default" config CI on ROCm ciflow/inductor-perf-test-nightly-rocm-mi300 Trigger inductor perf tests on ROCm MI300 release notes: rocm mandatorylabel labels Aug 21, 2025

jeffdaily approved these changes Aug 21, 2025

View reviewed changes

pytorchbot added the open source label Aug 21, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 22, 2025

pytorchmergebot added the merging label Aug 22, 2025

pytorchmergebot added the Merged label Aug 22, 2025

pytorchmergebot closed this in 25df65a Aug 22, 2025

pytorchmergebot removed the merging label Aug 22, 2025

jeffdaily mentioned this pull request Sep 2, 2025

[release/2.8] revamp HIPCachingAllocatorMasqueradingAsCUDA ROCm/pytorch#2592

Merged

jeffdaily mentioned this pull request Sep 2, 2025

[release/2.7] revamp HIPCachingAllocatorMasqueradingAsCUDA ROCm/pytorch#2593

Merged

jeffdaily mentioned this pull request Sep 2, 2025

[release/2.6] revamp HIPCachingAllocatorMasqueradingAsCUDA ROCm/pytorch#2594

Merged

pruthvistony pushed a commit to ROCm/pytorch that referenced this pull request Sep 3, 2025

[release/2.8] revamp HIPCachingAllocatorMasqueradingAsCUDA (#2592)

9596b8b

cherry pick of upstream pytorch#161221.

pruthvistony pushed a commit to ROCm/pytorch that referenced this pull request Sep 3, 2025

[release/2.7] revamp HIPCachingAllocatorMasqueradingAsCUDA (#2593)

e3cca5e

cherry pick of pytorch#161221

pruthvistony pushed a commit to ROCm/pytorch that referenced this pull request Sep 3, 2025

[release/2.6] revamp HIPCachingAllocatorMasqueradingAsCUDA (#2594)

0568140

cherry pick of pytorch#161221

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] revamp HIPCachingAllocatorMasqueradingAsCUDA #161221

[ROCm] revamp HIPCachingAllocatorMasqueradingAsCUDA #161221

Uh oh!

naromero77amd commented Aug 21, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Aug 21, 2025 •

edited

Loading

Uh oh!

jeffdaily commented Aug 22, 2025

Uh oh!

pytorchmergebot commented Aug 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[ROCm] revamp HIPCachingAllocatorMasqueradingAsCUDA #161221

[ROCm] revamp HIPCachingAllocatorMasqueradingAsCUDA #161221

Uh oh!

Conversation

naromero77amd commented Aug 21, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161221

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

jeffdaily commented Aug 22, 2025

Uh oh!

pytorchmergebot commented Aug 22, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

naromero77amd commented Aug 21, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 21, 2025 •

edited

Loading