Optimize mutable torch.library.custom_op overhead #139513

zou3519 · 2024-11-01T20:21:12Z

Stack from ghstack (oldest at bottom):

We don't need to do a loop over all the args, kwargs in the
AdInplaceOrView key; we just need to bump the version on the args,
kwargs that are mutable.

On the benchmark mentioned in
#139494
this made the time go from

mutate2 = 61.72943878173828
no_mutate2 = 36.89440155029297
mutate = 236.3092498779297
no_mutate = 59.31964874267578

to

mutate2 = 47.976478576660156
no_mutate2 = 38.37468719482422
mutate = 71.21315002441406
no_mutate = 59.7432975769043

Test Plan:

existing tests

We don't need to do a loop over all the args, kwargs in the AdInplaceOrView key; we just need to bump the version on the args, kwargs that are mutable. On the benchmark mentioned in #139494 this made the time go from ``` mutate2 = 61.72943878173828 no_mutate2 = 36.89440155029297 mutate = 236.3092498779297 no_mutate = 59.31964874267578 ``` to ``` mutate2 = 47.976478576660156 no_mutate2 = 38.37468719482422 mutate = 71.21315002441406 no_mutate = 59.7432975769043 ``` Test Plan: - existing tests [ghstack-poisoned]

pytorch-bot · 2024-11-01T20:21:16Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139513

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 56c2229 with merge base 5e4c8b6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

We don't need to do a loop over all the args, kwargs in the AdInplaceOrView key; we just need to bump the version on the args, kwargs that are mutable. On the benchmark mentioned in #139494 this made the time go from ``` mutate2 = 61.72943878173828 no_mutate2 = 36.89440155029297 mutate = 236.3092498779297 no_mutate = 59.31964874267578 ``` to ``` mutate2 = 47.976478576660156 no_mutate2 = 38.37468719482422 mutate = 71.21315002441406 no_mutate = 59.7432975769043 ``` Test Plan: - existing tests ghstack-source-id: 46c3beb Pull Request resolved: #139513

bdhirsh · 2024-11-04T16:02:28Z

torch/_library/custom_ops.py

+                for idx in mutated_idxs:
+                    increment_version(args[idx])
+                for key in mutated_keys:
+                    increment_version(kwargs[key])


if we're worried about python overhead here, increment_version() should (as of recently) support Iterable[Tensor] as an argument: https://github.com/pytorch/pytorch/blob/main/torch/autograd/graph.py#L226

nice, thanks for pointing that out

It's not clear to me if building an iterator to pass to increment_version is less expensive than calling increment_version in a loop, but I'll try it out

bdhirsh

Just a side note, but - for compile, we probably shouldn't be running the ADInplaceOrView kernel at runtime (if we are then we should make sure that key is disabled when inductor runs). Since AOTAutograd handles bumping version counters in its epilogue.

zou3519 · 2024-11-04T16:05:54Z

Just a side note, but - for compile, we probably shouldn't be running the ADInplaceOrView kernel at runtime (if we are then we should make sure that key is disabled when inductor runs). Since AOTAutograd handles bumping version counters in its epilogue.

Makes sense, let me file another issue

zou3519 · 2024-11-05T16:12:04Z

@pytorchbot merge

pytorchmergebot · 2024-11-05T16:13:44Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

We don't need to do a loop over all the args, kwargs in the AdInplaceOrView key; we just need to bump the version on the args, kwargs that are mutable. On the benchmark mentioned in pytorch#139494 this made the time go from ``` mutate2 = 61.72943878173828 no_mutate2 = 36.89440155029297 mutate = 236.3092498779297 no_mutate = 59.31964874267578 ``` to ``` mutate2 = 47.976478576660156 no_mutate2 = 38.37468719482422 mutate = 71.21315002441406 no_mutate = 59.7432975769043 ``` Test Plan: - existing tests Pull Request resolved: pytorch#139513 Approved by: https://github.com/bdhirsh ghstack dependencies: pytorch#139509

zou3519 mentioned this pull request Nov 1, 2024

no-op torch.library.custom_op APIs on torch.deploy #139509

Closed

zou3519 requested review from bdhirsh, ezyang and williamwen42 November 4, 2024 15:31

bdhirsh reviewed Nov 4, 2024

View reviewed changes

bdhirsh approved these changes Nov 4, 2024

View reviewed changes

zou3519 added ciflow/trunk Trigger trunk jobs on your pull request release notes: composability release notes category labels Nov 5, 2024

pytorchmergebot added the merging label Nov 5, 2024

pytorchmergebot added the Merged label Nov 5, 2024

pytorchmergebot closed this in 27ec392 Nov 5, 2024

pytorchmergebot removed the merging label Nov 5, 2024

github-actions bot deleted the gh/zou3519/1086/head branch December 6, 2024 02:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize mutable torch.library.custom_op overhead #139513

Optimize mutable torch.library.custom_op overhead #139513

Uh oh!

zou3519 commented Nov 1, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 1, 2024 •

edited

Loading

Uh oh!

bdhirsh Nov 4, 2024

Uh oh!

zou3519 Nov 5, 2024

Uh oh!

zou3519 Nov 5, 2024

Uh oh!

bdhirsh left a comment

Uh oh!

zou3519 commented Nov 4, 2024

Uh oh!

zou3519 commented Nov 5, 2024

Uh oh!

pytorchmergebot commented Nov 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimize mutable torch.library.custom_op overhead #139513

Optimize mutable torch.library.custom_op overhead #139513

Uh oh!

Conversation

zou3519 commented Nov 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139513

✅ No Failures

Uh oh!

bdhirsh Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

zou3519 Nov 5, 2024

Choose a reason for hiding this comment

Uh oh!

zou3519 Nov 5, 2024

Choose a reason for hiding this comment

Uh oh!

bdhirsh left a comment

Choose a reason for hiding this comment

Uh oh!

zou3519 commented Nov 4, 2024

Uh oh!

zou3519 commented Nov 5, 2024

Uh oh!

pytorchmergebot commented Nov 5, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zou3519 commented Nov 1, 2024 •

edited

Loading

pytorch-bot bot commented Nov 1, 2024 •

edited

Loading