KEMBAR78

Per-thread flamegraph option in JFR heatmap converter by fandreuz · Pull Request #1414 · async-profiler/async-profiler · GitHub

Per-thread flamegraph option in JFR heatmap converter #1414

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

apangin merged 81 commits into async-profiler:master from fandreuz:per-thread-flamegraph

Aug 13, 2025

Contributor

fandreuz commented Jul 25, 2025

Description

In this PR I introduce a new feature in the JFR heatmap converter to have an artificial frame at the base of the flamegraph containing the thread name where the stacktrace happened.

Related issues

Motivation and context

Motivation in the original issue.

How has this been tested?

Manual testing.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

fandreuz added 30 commits

July 11, 2025 12:29


          rename

dd6d3bd


          methodName to name

b13ca56


          split freq and newMethodId

63c7179

cc

0e380c5


          rename

171b133


          rename

da04b73


          new frame type

46622be


          always prefix and suffix

257457e


          sort on copy

29e7d28

nn

6f032b6


          remove next from FrameDesc

5339d45

nn

06fc418


          revert rename

8c882d7

nn

707c48a

cc

ebcffd9

nn

cc

20aa001

cc

facc46e

cc

3c1b504

cc

cf94370


          impl

7c29b1e


          missing ;

fe13daf


          revert

4e3416e


          nope

3d1bc1c


          revert

0b1565e


          arraylist

45eb972


          revert

981fb3a

for

c2edc56

+1

e8fd27f


          array

a7ff3f0

fandreuz added 5 commits

August 8, 2025 13:58


          args

e526863


          extra to classId

590f60e


          bit shift nn

9d947e0


          move out

eaa733a


          rename

3aa3ba6

apangin reviewed

View reviewed changes

src/converter/one/heatmap/Heatmap.java Outdated Show resolved Hide resolved

fandreuz added 2 commits

August 8, 2025 14:53

cc

4804b5d

nn

644d23e

fandreuz mentioned this pull request

Smoke tests for JFR converter #1434

Merged

fandreuz added 4 commits

August 11, 2025 09:48

wip

167bbc6

cc

52a9689

nn

cc

c4569a0

apangin reviewed

View reviewed changes

src/converter/one/heatmap/Heatmap.java Outdated

    
                      }

                      private Integer getMethodIndex(MethodKey key) {

                          return methodCache.computeIfAbsent(key, this::makeMethod);

Member

apangin Aug 12, 2025

Good. Now comes the trick: instead of this::makeMethod pass a lambda/method reference cached in final Function<MethodKey, Integer> field, and you'll get 5% performance improvement out of thin air.

Member

apangin Aug 12, 2025

Alternatively, you can replace computeIfAbsent with get+put pair without lambda and get the same effect.

Contributor Author

fandreuz Aug 12, 2025 •

edited

Loading

I had a quick look at the flamegraph and it looks a tiny bit faster, but why can't the JVM make this optimization on its own? I don't see the difference between a final Function<MethodKey, Integer> and the method reference. Is it just the additional reference to the class?

Member

apangin Aug 12, 2025

but why can't the JVM make this optimization on its own

Function object created by this::someMethod depends on the instance. JVM does do per-instance caching. I can understand why: let's say there are 10 method references in the code - this would require JVM to add 10 extra synthetic fields, which may or may not be used, for every instance of an otherwise small object.

So, it's up to developer to decide whether to cache non-static lambdas or not.

fandreuz force-pushed the per-thread-flamegraph branch from c4569a0 to cb506dd Compare

August 12, 2025 08:10

5%

041cfb7

fandreuz force-pushed the per-thread-flamegraph branch from cb506dd to 041cfb7 Compare

August 12, 2025 11:03

fandreuz added 2 commits

August 12, 2025 11:04

fix

ea75fc8

fix

d60a0f0

apangin reviewed

View reviewed changes

src/converter/one/heatmap/Heatmap.java Outdated Show resolved Hide resolved


          fill methodcache

06fc30e

apangin reviewed

View reviewed changes

src/converter/one/heatmap/Heatmap.java Outdated

    
                          stackTracesCache.put(id, stackTracesRemap.index(cachedStackTrace, size));

                      }

                      private Integer getMethodIndex(MethodKey key) {

Member

apangin Aug 12, 2025

Why Integer? Return value is always used as int.

Contributor Author

fandreuz Aug 12, 2025

src/converter/one/heatmap/Heatmap.java Outdated

    
                          return methodIdx;

                      }

                      private Integer makeMethod(MethodKey key) {

Member

apangin Aug 12, 2025

int. Or just inline this function in the above method.

Contributor Author

fandreuz Aug 12, 2025

src/converter/one/heatmap/Heatmap.java Outdated

    
                          private final MethodKeyType keyType;

                          public MethodKey(MethodKeyType keyType, long methodId, int location, byte type, boolean firstInStack) {

                              if (type < 0) throw new IllegalArgumentException("Unexpected type: " + type);

Member

apangin Aug 12, 2025

Unnecessary limitation. Just replace (long) type with (type & 0xffL).

Contributor Author

fandreuz Aug 12, 2025


          review comments

568425b

apangin reviewed

View reviewed changes

src/converter/one/heatmap/Heatmap.java

    
                  }

                  public void beforeChunk() {

                      state.methodsCache.clear();

Member

apangin Aug 12, 2025

Seems like this corrupted frames in multi-chunk recordings.
Cleaning method cache between chunks is essential.

Contributor Author

fandreuz Aug 12, 2025

How did you find this out? I converted a 4gb JFR and the output looked ok. I'd assume it contains more than one chunk, will check

Contributor Author

fandreuz Aug 13, 2025

Member

apangin Aug 13, 2025

Works like a charm now, thanks!


          clear

14c4a2a

fandreuz force-pushed the per-thread-flamegraph branch from 0b9e7a4 to 568425b Compare

August 13, 2025 07:51

apangin reviewed

View reviewed changes

src/converter/one/convert/JfrToHeatmap.java Outdated

Comment on lines 55 to 56

    
                              jfr.stackTraces.forEach(new Dictionary.Visitor<StackTrace>() {

                                  @Override

                                  public void visit(long key, StackTrace trace) {

                                      heatmap.beforeChunk();

Member

apangin Aug 13, 2025

for each stacktrace??

Contributor Author

fandreuz Aug 13, 2025

Dumb mistake, sorry. 9d970af

fandreuz and others added 2 commits

August 13, 2025 13:07

ops

9d970af


          Style/format

fc31589

Signed-off-by: Andrei Pangin <1749416+apangin@users.noreply.github.com>

apangin merged commit 89ead82 into async-profiler:master

19 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet