[inductor] Minor refactor of hip compile_meta #143815

jansel · 2024-12-25T02:11:08Z

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2024-12-25T02:11:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143815

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1e13b0a with merge base a8ac3a6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

Skylion007 · 2024-12-25T16:23:30Z

torch/_inductor/runtime/triton_heuristics.py

-            compile_meta["constants"][k] = v
+        cfg_kwargs = cfg.kwargs
+        if self.device_props.type == "hip":
+            cfg_kwargs = {**cfg_kwargs}


Suggested change

cfg_kwargs = {**cfg_kwargs}

cfg_kwargs = dict(cfg_kwargs)

Any reason why not just above? or just explicitly make a shallow copy?

@Skylion007 the version I have is 1.5x faster:

>>> other={1:2, 3:4} >>> timeit.timeit(lambda: dict(other)) 0.09024659299757332 >>> timeit.timeit(lambda: {**other}) 0.05995289294514805 >>>

and what about other.copy()? Also @jansel what about as other dict grows? My understanding is dict() may have more initial overhead due to the function call lookup, but better scaling with larger dicts.

Suggested change

cfg_kwargs = {**cfg_kwargs}

cfg_kwargs = cfg_kwargs.copy()

@Skylion007 other.copy() is also slower:

>>> timeit.timeit(lambda: other.copy()) 0.075550084002316

For large dicts all 3 become the same (though {**other} is slightly faster):

>>> large = dict.fromkeys(range(1000)) >>> timeit.timeit(lambda: dict(large)) 2.926349258981645 >>> timeit.timeit(lambda: {**large}) 2.9151299450313672 >>> timeit.timeit(lambda: large.copy()) 2.917799219954759

None of this should be surprising if you look at the output bytecode:

dict becomes LOAD_GLOBAL, which involves 2 module lookups (one for globals() and one for builtins -- both of which are hashtables). You can write dict=MyDict in global scope, so this is very hard to optimize.

copy becomes LOAD_METHOD, which involves a method lookup on an object (also a hashtable lookup). The object might not be a dict, so this is very hard to optimize.

{**x} involves zero string lookups, since you get DICT-specific bytecodes with no dynamic behavior.

The exact same logic flows making [*x] faster than list(x).

Pull Request resolved: #143817 Approved by: https://github.com/eellison ghstack dependencies: #143813, #143814, #143815

Update

7529f7d

[ghstack-poisoned]

jansel mentioned this pull request Dec 25, 2024

[inductor] Reorder imports in codecache.py #143813

Closed

jansel mentioned this pull request Dec 25, 2024

[inductor] Refactor conditional triton imports into triton_compat.py #143814

Closed

pytorch-bot bot added ciflow/inductor module: inductor labels Dec 25, 2024

jansel added ciflow/rocm Trigger "default" config CI on ROCm ciflow/inductor-rocm Trigger "inductor" config CI on ROCm topic: not user facing topic category labels Dec 25, 2024

Update

7ae11a8

[ghstack-poisoned]

This was referenced Dec 25, 2024

[inductor] Drop support for pre-ASTSource Triton #143817

Closed

[inductor] Move GPUTarget backwards compat to triton_compat.py #143818

Closed

jansel removed ciflow/inductor-rocm Trigger "inductor" config CI on ROCm ciflow/rocm Trigger "default" config CI on ROCm labels Dec 25, 2024

Update

1e13b0a

[ghstack-poisoned]

jansel mentioned this pull request Dec 25, 2024

[inductor] Simplify get_launch_args_* handling #143835

Closed

Skylion007 reviewed Dec 25, 2024

View reviewed changes

jansel requested review from eellison and yanboliang December 26, 2024 18:36

eellison approved these changes Dec 26, 2024

View reviewed changes

pytorchmergebot added the Merged label Dec 27, 2024

pytorchmergebot closed this in f3d0f67 Dec 27, 2024

pytorchmergebot pushed a commit that referenced this pull request Dec 27, 2024

[inductor] Drop support for pre-ASTSource Triton (#143817)

be19368

Pull Request resolved: #143817 Approved by: https://github.com/eellison ghstack dependencies: #143813, #143814, #143815

github-actions bot deleted the gh/jansel/465/head branch January 26, 2025 02:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] Minor refactor of hip compile_meta #143815

[inductor] Minor refactor of hip compile_meta #143815

Uh oh!

jansel commented Dec 25, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 25, 2024 •

edited

Loading

Uh oh!

Skylion007 Dec 25, 2024 •

edited

Loading

Uh oh!

jansel Dec 26, 2024

Uh oh!

Skylion007 Dec 26, 2024 •

edited

Loading

Uh oh!

jansel Dec 26, 2024 •

edited

Loading

Uh oh!

jansel Dec 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[inductor] Minor refactor of hip compile_meta #143815

[inductor] Minor refactor of hip compile_meta #143815

Uh oh!

Conversation

jansel commented Dec 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143815

✅ No Failures

Uh oh!

Skylion007 Dec 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jansel Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

Skylion007 Dec 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jansel Dec 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jansel Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jansel commented Dec 25, 2024 •

edited

Loading

pytorch-bot bot commented Dec 25, 2024 •

edited

Loading

Skylion007 Dec 25, 2024 •

edited

Loading

Skylion007 Dec 26, 2024 •

edited

Loading

jansel Dec 26, 2024 •

edited

Loading