[Inductor] [bc-breaking] Node Level provenance tracking #144277

yushangdi · 2025-01-06T19:52:29Z

Summary:

use GraphTransformObserver + replace_node hooks to track node sources when they are replaced
add pre_grad_graph tracking to tlparse
add the node provenance information to post_grad_graph tlparse. This is for the frontend to create a mapping between pre_grad and post_grad graph. See an example frontend (this is just a prototype) here: https://drive.google.com/file/d/1cMHH_0y4FJUSS9tATwGQvA72O0Lth8eh/view?usp=sharing
change "action" of NodeSource from a single action to a list of actions.
It's BC-Breaking because we removed GraphTransformObserver's class methods on_node_erase and on_node_erase .

https://docs.google.com/document/d/1dGh9myqNhywmbfP0Quzx_f04bghDFlj8cawj8MopiO8/edit?tab=t.0

The front-end code that takes in the tlparse result is in https://github.com/yushangdi/compiler_explorer.
ghstack-source-id: 260390519

Test Plan:

buck2 run mode/dev-nosan fbcode//caffe2/test:fx -- -r test_graph_transform_observer
buck run mode/dev-nosan  fbcode//caffe2/test:fx -- -r node_source
buck run mode/dev-nosan  fbcode//caffe2/test:fx -- -r graph_provenance

Front-end example screenshots on a real model, 93% coverage rate between pre_grad_graph and post_grad_graph

{F1973584210}{F1973584209}

buck2 build --show-output mode/opt -c=python.package_style=inplace -c fbcode.enable_gpu_sections=true -c fbcode.platform=platform010 -c fbcode.split-dwarf=true -c fbcode.nvcc_arch=a100,h100 caffe2/torch/fb/model_transform/experimental/benchmark:mts_gpu_benchmark

MODEL_ENTITY_ID=644688112
SNAPSHOT_ID=32
MODULE=merge

TORCH_COMPILE_DEBUG=1 CUDA_VISIBLE_DEVICES=7 TORCH_LOGS="+inductor,+schedule,output_code,graph_code" TORCHINDUCTOR_MAX_AUTOTUNE=1 TORCHINDUCTOR_UNIQUE_KERNEL_NAMES=1 ../buck-out/v2/gen/fbcode/ec86b05dd59e84db/caffe2/torch/fb/model_transform/experimental/benchmark/__mts_gpu_benchmark__/mts_gpu_benchmark.par --local-model /home/bahuang/models/${MODEL_ENTITY_ID}/${SNAPSHOT_ID}/gpu_lowering/input.predictor.disagg.gpu.merge --lower-backend AOT_INDUCTOR_EP --gpu-trace --aot-inductor-config="{'max_autotune':
True}"

buck2 run mode/dev-nosan fbcode//caffe2/test/inductor:auto_functionalize

Differential Revision: D65006709

cc @ezyang @SherlockNoMad @EikanWang @jgong5 @wenzhe-nrv @voznesenskym @penguinwu @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

pytorch-bot · 2025-01-06T19:52:33Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144277

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Failures because of deprecated version of actions/download-artifact: v3

✅ You can merge normally! (1 Unrelated Failure)

As of commit f40b2d3 with merge base a5164a2 ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

pull / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.linux.2xlarge) (gh) (#144480)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-01-06T19:52:45Z

This pull request was exported from Phabricator. Differential Revision: D65006709

Summary: - use GraphTransformObserver + replace_node hooks to track node sources when they are replaced - add pre_grad_graph tracking to tlparse - add the node provenance information to tlparse artifact. This is for the frontend to create a mapping between pre_grad and post_grad graph. See an example frontend (this is just a prototype) here: https://drive.google.com/file/d/1cMHH_0y4FJUSS9tATwGQvA72O0Lth8eh/view?usp=sharing - change "action" of NodeSource from a single action to a list of actions. - Guard provenance tracking behind `_inductor.config.trace.enabled` flag to avoid slowing down compilation https://docs.google.com/document/d/1dGh9myqNhywmbfP0Quzx_f04bghDFlj8cawj8MopiO8/edit?tab=t.0 The front-end code that takes in the tlparse result is in https://github.com/yushangdi/compiler_explorer. ghstack-source-id: 260390519 Test Plan: ``` buck2 run mode/dev-nosan fbcode//caffe2/test:fx -- -r test_graph_transform_observer buck run mode/dev-nosan fbcode//caffe2/test:fx -- -r node_source buck run mode/dev-nosan fbcode//caffe2/test:fx -- -r graph_provenance buck2 run mode/dev-nosan fbcode//caffe2/test/inductor:auto_functionalize buck2 run mode/dev-nosan fbcode//caffe2/test/inductor:provenance_tracing ``` Front-end example screenshots on a real model, 93% coverage rate between pre_grad_graph and post_grad_graph {F1973584210}{F1973584209} ``` buck2 build --show-output mode/opt -c=python.package_style=inplace -c fbcode.enable_gpu_sections=true -c fbcode.platform=platform010 -c fbcode.split-dwarf=true -c fbcode.nvcc_arch=a100,h100 caffe2/torch/fb/model_transform/experimental/benchmark:mts_gpu_benchmark MODEL_ENTITY_ID=644688112 SNAPSHOT_ID=32 MODULE=merge TORCH_COMPILE_DEBUG=1 CUDA_VISIBLE_DEVICES=7 TORCH_LOGS="+inductor,+schedule,output_code,graph_code" TORCHINDUCTOR_MAX_AUTOTUNE=1 TORCHINDUCTOR_UNIQUE_KERNEL_NAMES=1 ../buck-out/v2/gen/fbcode/ec86b05dd59e84db/caffe2/torch/fb/model_transform/experimental/benchmark/__mts_gpu_benchmark__/mts_gpu_benchmark.par --local-model /home/bahuang/models/${MODEL_ENTITY_ID}/${SNAPSHOT_ID}/gpu_lowering/input.predictor.disagg.gpu.merge --lower-backend AOT_INDUCTOR_EP --gpu-trace --aot-inductor-config="{'max_autotune': True}" ``` Differential Revision: D65006709

facebook-github-bot · 2025-01-08T19:50:29Z

This pull request was exported from Phabricator. Differential Revision: D65006709

Summary: - use GraphTransformObserver + replace_node hooks to track node sources when they are replaced - add pre_grad_graph tracking to tlparse - add the node provenance information to tlparse artifact. This is for the frontend to create a mapping between pre_grad and post_grad graph. See an example frontend (this is just a prototype) here: https://drive.google.com/file/d/1cMHH_0y4FJUSS9tATwGQvA72O0Lth8eh/view?usp=sharing - change "action" of NodeSource from a single action to a list of actions. - Guard provenance tracking behind `_inductor.config.trace.enabled` flag to avoid slowing down compilation https://docs.google.com/document/d/1dGh9myqNhywmbfP0Quzx_f04bghDFlj8cawj8MopiO8/edit?tab=t.0 The front-end code that takes in the tlparse result is in https://github.com/yushangdi/compiler_explorer. ghstack-source-id: 260390519 Test Plan: ``` buck2 run mode/dev-nosan fbcode//caffe2/test:fx -- -r test_graph_transform_observer buck run mode/dev-nosan fbcode//caffe2/test:fx -- -r node_source buck run mode/dev-nosan fbcode//caffe2/test:fx -- -r graph_provenance buck2 run mode/dev-nosan fbcode//caffe2/test/inductor:auto_functionalize buck2 run mode/dev-nosan fbcode//caffe2/test/inductor:provenance_tracing python benchmarks/basic_modules_benchmarks.py ``` Front-end example screenshots on a real model, 93% coverage rate between pre_grad_graph and post_grad_graph {F1973584210}{F1973584209} ``` buck2 build --show-output mode/opt -c=python.package_style=inplace -c fbcode.enable_gpu_sections=true -c fbcode.platform=platform010 -c fbcode.split-dwarf=true -c fbcode.nvcc_arch=a100,h100 caffe2/torch/fb/model_transform/experimental/benchmark:mts_gpu_benchmark MODEL_ENTITY_ID=644688112 SNAPSHOT_ID=32 MODULE=merge TORCH_COMPILE_DEBUG=1 CUDA_VISIBLE_DEVICES=7 TORCH_LOGS="+inductor,+schedule,output_code,graph_code" TORCHINDUCTOR_MAX_AUTOTUNE=1 TORCHINDUCTOR_UNIQUE_KERNEL_NAMES=1 ../buck-out/v2/gen/fbcode/ec86b05dd59e84db/caffe2/torch/fb/model_transform/experimental/benchmark/__mts_gpu_benchmark__/mts_gpu_benchmark.par --local-model /home/bahuang/models/${MODEL_ENTITY_ID}/${SNAPSHOT_ID}/gpu_lowering/input.predictor.disagg.gpu.merge --lower-backend AOT_INDUCTOR_EP --gpu-trace --aot-inductor-config="{'max_autotune': True}" ``` Differential Revision: D65006709

facebook-github-bot · 2025-01-08T21:41:55Z

This pull request was exported from Phabricator. Differential Revision: D65006709

desertfire

Please update the title (remove 142739).

desertfire · 2025-01-09T18:36:24Z

torch/_inductor/compile_fx.py

                        print_output=False, include_stride=True, include_device=True
                    ),
                )
+                if config.trace.enabled:


There was a discussion on deprecating TORCH_COMPILE_DEBUG. Ok for now, but we may want to switch to TORCH_LOGS in future.

facebook-github-bot · 2025-01-09T21:59:33Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-01-09T22:01:09Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added ciflow/inductor module: inductor release notes: fx release notes category labels Jan 6, 2025

facebook-github-bot added the fx label Jan 6, 2025

facebook-github-bot added the fb-exported label Jan 6, 2025

yushangdi changed the title ~~[Inductor] Node Level provenance tracking (#142739)~~ [Inductor] [bc-breaking] Node Level provenance tracking (#142739) Jan 6, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 6, 2025

yushangdi added the suppress-bc-linter Suppresses the failures of API backward-compatibility linter (Lint/bc_linter) label Jan 6, 2025

facebook-github-bot force-pushed the export-D65006709 branch from 83409b4 to ef7925a Compare January 8, 2025 19:49

facebook-github-bot force-pushed the export-D65006709 branch from ef7925a to f40b2d3 Compare January 8, 2025 21:41

yushangdi mentioned this pull request Jan 9, 2025

[Inductor] Node Level provenance tracking #142739

Closed

desertfire approved these changes Jan 9, 2025

View reviewed changes

yushangdi changed the title ~~[Inductor] [bc-breaking] Node Level provenance tracking (#142739)~~ [Inductor] [bc-breaking] Node Level provenance tracking Jan 9, 2025

pytorchmergebot added the merging label Jan 9, 2025

pytorchmergebot added the Merged label Jan 9, 2025

pytorchmergebot closed this in 379b546 Jan 9, 2025

pytorchmergebot removed the merging label Jan 9, 2025

github-actions bot deleted the export-D65006709 branch February 12, 2025 02:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inductor] [bc-breaking] Node Level provenance tracking #144277

[Inductor] [bc-breaking] Node Level provenance tracking #144277

Uh oh!

yushangdi commented Jan 6, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 6, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jan 6, 2025

Uh oh!

facebook-github-bot commented Jan 8, 2025

Uh oh!

facebook-github-bot commented Jan 8, 2025

Uh oh!

desertfire left a comment

Uh oh!

desertfire Jan 9, 2025

Uh oh!

facebook-github-bot commented Jan 9, 2025

Uh oh!

pytorchmergebot commented Jan 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Inductor] [bc-breaking] Node Level provenance tracking #144277

[Inductor] [bc-breaking] Node Level provenance tracking #144277

Uh oh!

Conversation

yushangdi commented Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144277

❗ 1 Active SEVs

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

facebook-github-bot commented Jan 6, 2025

Uh oh!

facebook-github-bot commented Jan 8, 2025

Uh oh!

facebook-github-bot commented Jan 8, 2025

Uh oh!

desertfire left a comment

Choose a reason for hiding this comment

Uh oh!

desertfire Jan 9, 2025

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jan 9, 2025

Uh oh!

pytorchmergebot commented Jan 9, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yushangdi commented Jan 6, 2025 •

edited

Loading

pytorch-bot bot commented Jan 6, 2025 •

edited

Loading