gh-92777: Add LOAD_METHOD_LAZY_DICT #92778

Fidget-Spinner · 2022-05-13T12:33:55Z

Fixes #92777. Specialize LOAD_METHOD for lazy dictionaries. This accounts for 40% of the misses.

I'm sad that I missed 3.11 beta freeze for this specialization. It's straightforward and is likely to account for the majority of LOAD_METHOD in real world code since lazy __dict__ is now commonplace.

Fidget-Spinner · 2022-05-13T12:40:51Z

Hah, looks like I was wrong, it wasn't that straightforward after all :).

…nner/cpython into load_method_lazy_dict

markshannon

A few minor issues, but generally looks good.
What are the stats for the LOAD_METHOD_LAZY_DICT instruction?

Python/ceval.c

Python/specialize.c

Python/ceval.c

Fidget-Spinner · 2022-05-13T13:46:59Z

How do you collect stats for pyperformance and create that nice table on faster-cpython? I'm frankly clueless (I only know how to use the one that dumps stats out to the terminal or file). Sorry.

On test suite code, I get a 0.3% improvement on hits and 0.6% more misses. But I want to point out that typing loves messing around with __dict__, so test_typing may not be representative. Pyperformance will likely see a noticeable bump in hits with less misses.

./python -m test test_typing test_re test_dis test_zlib


Before:
opcode[160].specializable : 1
    opcode[160].specialization.success : 1395
    opcode[160].specialization.failure : 1146
    opcode[160].specialization.hit : 1026699
    opcode[160].specialization.deferred : 78612
    opcode[160].specialization.miss : 8664
    opcode[160].specialization.deopt : 156
    opcode[160].execution_count : 18435
    opcode[160].specialization.failure_kinds[0] : 63
    opcode[160].specialization.failure_kinds[1] : 39
    opcode[160].specialization.failure_kinds[2] : 505
    opcode[160].specialization.failure_kinds[4] : 406
    opcode[160].specialization.failure_kinds[9] : 4
    opcode[160].specialization.failure_kinds[10] : 3
    opcode[160].specialization.failure_kinds[17] : 33
    opcode[160].specialization.failure_kinds[18] : 23
    opcode[160].specialization.failure_kinds[19] : 1
    opcode[160].specialization.failure_kinds[22] : 69


After:
opcode[160].specializable : 1
    opcode[160].specialization.success : 1399
    opcode[160].specialization.failure : 1107
    opcode[160].specialization.hit : 1029041
    opcode[160].specialization.deferred : 76217
    opcode[160].specialization.miss : 8717
    opcode[160].specialization.deopt : 157
    opcode[160].execution_count : 18488
    opcode[160].specialization.failure_kinds[0] : 63
    opcode[160].specialization.failure_kinds[2] : 505
    opcode[160].specialization.failure_kinds[4] : 406
    opcode[160].specialization.failure_kinds[9] : 4
    opcode[160].specialization.failure_kinds[10] : 3
    opcode[160].specialization.failure_kinds[17] : 33
    opcode[160].specialization.failure_kinds[18] : 23
    opcode[160].specialization.failure_kinds[19] : 1
    opcode[160].specialization.failure_kinds[22] : 69

Fidget-Spinner · 2022-05-13T14:02:31Z

Wow looks like my expectations were proven wrong by the stats again, after removing test_typing, I get a 0.18% increase in hits at the expense of 0.45% more misses. So test_typing was actually bolstering the numbers!

The part of the stdlib I've found that frequently uses this instruction is the _io module's objects. But I don't know how to get stats on those as their tests use subprocesses.

I'm not feeling too confident about this optimization now. It seems like something that would boost our pyperformance numbers but maybe not in the real world?

./python -m test test_re test_dis test_zlib


Before:
opcode[160].specializable : 1
    opcode[160].specialization.success : 2113
    opcode[160].specialization.failure : 4126
    opcode[160].specialization.hit : 5506338
    opcode[160].specialization.deferred : 306030
    opcode[160].specialization.miss : 44980
    opcode[160].specialization.deopt : 745
    opcode[160].execution_count : 59474
    opcode[160].specialization.failure_kinds[0] : 365
    opcode[160].specialization.failure_kinds[1] : 172
    opcode[160].specialization.failure_kinds[2] : 770
    opcode[160].specialization.failure_kinds[4] : 2417
    opcode[160].specialization.failure_kinds[9] : 12
    opcode[160].specialization.failure_kinds[10] : 4
    opcode[160].specialization.failure_kinds[17] : 150
    opcode[160].specialization.failure_kinds[18] : 58
    opcode[160].specialization.failure_kinds[19] : 2
    opcode[160].specialization.failure_kinds[22] : 139
    opcode[160].specialization.failure_kinds[23] : 37



After:
opcode[160].specializable : 1
    opcode[160].specialization.success : 2127
    opcode[160].specialization.failure : 3954
    opcode[160].specialization.hit : 5516541
    opcode[160].specialization.deferred : 295625
    opcode[160].specialization.miss : 45182
    opcode[160].specialization.deopt : 748
    opcode[160].execution_count : 59676
    opcode[160].specialization.failure_kinds[0] : 365
    opcode[160].specialization.failure_kinds[2] : 770
    opcode[160].specialization.failure_kinds[4] : 2417
    opcode[160].specialization.failure_kinds[9] : 12
    opcode[160].specialization.failure_kinds[10] : 4
    opcode[160].specialization.failure_kinds[17] : 150
    opcode[160].specialization.failure_kinds[18] : 58
    opcode[160].specialization.failure_kinds[19] : 2
    opcode[160].specialization.failure_kinds[22] : 139
    opcode[160].specialization.failure_kinds[23] : 37

markshannon · 2022-05-13T14:43:57Z

Generating the table is somewhat manual and hacky. I mean to automate it, but for now here's the procedure:

Create a new branch and cherry-pick this commit: faster-cpython@a9c92c0
Run pyperformance compile on that branch. I use this config.ini file: https://gist.github.com/markshannon/26f4e8db2b715c991eee1508f430f6b2 You will need to modify it for your machine and repo.
While it is in the installing phase, create /tmp/py_stats and clear it out rm -r /tmp/py_stats/*
About the time that the installing phase finishes and the benchmarks start, one final rm -r /tmp/py_stats/*

The table is created by running ./python Tools/scripts/summarize_stats.py

Fidget-Spinner · 2022-05-24T09:57:25Z

I have the stats here https://gist.github.com/Fidget-Spinner/4dbc2d002c30e36587939c4bdfd9840c.

LOAD_METHOD specialization hits are now 83.5%. It is 78.7% hits on the faster-cpython repo. So that's roughly a 5% increase.

markshannon · 2022-05-25T11:10:40Z

Stats look good. Code looks good.
I'm going to run the benchmarks before merging.

markshannon · 2022-05-25T13:05:05Z

No real difference in performance, but in line with what we would expect.

Add LOAD_METHOD_LAZY_DICT

08f528d

Fidget-Spinner requested a review from markshannon as a code owner May 13, 2022 12:33

bedevere-bot added the awaiting core review label May 13, 2022

📜🤖 Added by blurb_it.

33c1ec5

Fidget-Spinner added 2 commits May 13, 2022 20:48

Set dict offset in cache

7983e4a

Merge branch 'load_method_lazy_dict' of https://github.com/Fidget-Spi…

7504c18

…nner/cpython into load_method_lazy_dict

AlexWaygood added type-feature A feature request or enhancement performance Performance or resource usage labels May 13, 2022

markshannon reviewed May 13, 2022

View reviewed changes

Python/ceval.c Outdated Show resolved Hide resolved

Python/specialize.c Outdated Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

Fidget-Spinner added 2 commits May 13, 2022 21:27

Address Mark's review

78e10e8

Use double backticks for rST

6a07fa4

Fidget-Spinner added 2 commits May 24, 2022 17:07

Merge remote-tracking branch 'upstream/main' into load_method_lazy_dict

1f2300c

regen-opcode regen-opcode-targets

16e0e2f

markshannon merged commit 5e6e5b9 into python:main May 25, 2022

bedevere-bot removed the awaiting core review label May 25, 2022

Fidget-Spinner deleted the load_method_lazy_dict branch May 29, 2022 08:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-92777: Add LOAD_METHOD_LAZY_DICT #92778

gh-92777: Add LOAD_METHOD_LAZY_DICT #92778

Uh oh!

Fidget-Spinner commented May 13, 2022 •

edited

Loading

Uh oh!

Fidget-Spinner commented May 13, 2022

Uh oh!

markshannon left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fidget-Spinner commented May 13, 2022 •

edited

Loading

Uh oh!

Fidget-Spinner commented May 13, 2022 •

edited

Loading

Uh oh!

markshannon commented May 13, 2022

Uh oh!

Fidget-Spinner commented May 24, 2022 •

edited

Loading

Uh oh!

markshannon commented May 25, 2022

Uh oh!

markshannon commented May 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

gh-92777: Add LOAD_METHOD_LAZY_DICT #92778

gh-92777: Add LOAD_METHOD_LAZY_DICT #92778

Uh oh!

Conversation

Fidget-Spinner commented May 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner commented May 13, 2022

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fidget-Spinner commented May 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner commented May 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon commented May 13, 2022

Uh oh!

Fidget-Spinner commented May 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon commented May 25, 2022

Uh oh!

markshannon commented May 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fidget-Spinner commented May 13, 2022 •

edited

Loading

Fidget-Spinner commented May 13, 2022 •

edited

Loading

Fidget-Spinner commented May 13, 2022 •

edited

Loading

Fidget-Spinner commented May 24, 2022 •

edited

Loading