[quant][pt2][be] Remove add/relu from conv-bn QAT pattern #113006

andrewor14 · 2023-11-06T03:53:25Z

Stack from ghstack (oldest at bottom):

-> [quant][pt2][be] Remove add/relu from conv-bn QAT pattern #113006

Summary: This commit significantly simplifies the QAT fusion
code for the conv-bn pattern by removing add and relu nodes
from the match and replacement patterns. This does not reduce
functionality; patterns like conv-bn-relu, conv-bn-add,
and conv-bn-add-relu are still supported. We simply do not
match these extra nodes, since there is actually no need to
replace them.

This has the additional benefit of reducing the number of
patterns being matched by 16x, since for each add and relu
variant of the conv-bn pattern there is also an in-place
variant. This also enables more flexible conv-bn pattern
matching in the future and keeps the number of patterns
more scalable.

One important change needed in this commit was to remove
the match filter that requires the input and output
activations to be quantized. This was necessary because
otherwise we would always expect q-dq nodes immediately
after the getitem node, instead of after the add or relu
nodes for example. This has another side benefit of
keeping QAT fusion flexible enough to support weight
only quantization.

Test Plan:
python test/test_quantization.py TestQuantizePT2EQAT

Reviewers: jerryzh168, kimishpatel

Subscribers: jerryzh168, kimishpatel

Summary: This commit significantly simplifies the QAT fusion code for the `conv-bn` pattern by removing add and relu nodes from the match and replacement patterns. This does not reduce functionality; patterns like `conv-bn-relu`, `conv-bn-add`, and `conv-bn-add-relu` are still supported. We simply do not match these extra nodes, since there is actually no need to replace them. This has the additional benefit of reducing the number of patterns being matched by 16x, since for each add and relu variant of the `conv-bn` pattern there is also an in-place variant. This also enables more flexible `conv-bn` pattern matching in the future and keeps the number of patterns more scalable. One important change needed in this commit was to remove the match filter that requires the input and output activations to be quantized. This was necessary because otherwise we would always expect q-dq nodes immediately after the getitem node, instead of after the add or relu nodes for example. This has another side benefit of keeping QAT fusion flexible enough to support weight only quantization. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel [ghstack-poisoned]

pytorch-bot · 2023-11-06T03:53:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113006

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit b5e3cbe with merge base e1c872e ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 1, 5, linux.g5.4xlarge.nvidia.gpu) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: This commit significantly simplifies the QAT fusion code for the `conv-bn` pattern by removing add and relu nodes from the match and replacement patterns. This does not reduce functionality; patterns like `conv-bn-relu`, `conv-bn-add`, and `conv-bn-add-relu` are still supported. We simply do not match these extra nodes, since there is actually no need to replace them. This has the additional benefit of reducing the number of patterns being matched by 16x, since for each add and relu variant of the `conv-bn` pattern there is also an in-place variant. This also enables more flexible `conv-bn` pattern matching in the future and keeps the number of patterns more scalable. One important change needed in this commit was to remove the match filter that requires the input and output activations to be quantized. This was necessary because otherwise we would always expect q-dq nodes immediately after the getitem node, instead of after the add or relu nodes for example. This has another side benefit of keeping QAT fusion flexible enough to support weight only quantization. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel ghstack-source-id: 251cf2e Pull Request resolved: #113006

andrewor14 · 2023-11-06T03:54:16Z

cc @leslie-fang-intel

leslie-fang-intel · 2023-11-06T05:00:59Z

Thanks for the simplification.

One important change needed in this commit was to remove
the match filter that requires the input and output
activations to be quantized. This was necessary because
otherwise we would always expect q-dq nodes immediately
after the getitem node, instead of after the add or relu
nodes for example.

This also fixes one of issues reported in #112833

jerryzh168

LG, thanks!

Summary: This commit significantly simplifies the QAT fusion code for the `conv-bn` pattern by removing add and relu nodes from the match and replacement patterns. This does not reduce functionality; patterns like `conv-bn-relu`, `conv-bn-add`, and `conv-bn-add-relu` are still supported. We simply do not match these extra nodes, since there is actually no need to replace them. This has the additional benefit of reducing the number of patterns being matched by 16x, since for each add and relu variant of the `conv-bn` pattern there is also an in-place variant. This also enables more flexible `conv-bn` pattern matching in the future and keeps the number of patterns more scalable. One important change needed in this commit was to remove the match filter that requires the input and output activations to be quantized. This was necessary because otherwise we would always expect q-dq nodes immediately after the getitem node, instead of after the add or relu nodes for example. This has another side benefit of keeping QAT fusion flexible enough to support weight only quantization. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel [ghstack-poisoned]

Summary: This commit significantly simplifies the QAT fusion code for the `conv-bn` pattern by removing add and relu nodes from the match and replacement patterns. This does not reduce functionality; patterns like `conv-bn-relu`, `conv-bn-add`, and `conv-bn-add-relu` are still supported. We simply do not match these extra nodes, since there is actually no need to replace them. This has the additional benefit of reducing the number of patterns being matched by 16x, since for each add and relu variant of the `conv-bn` pattern there is also an in-place variant. This also enables more flexible `conv-bn` pattern matching in the future and keeps the number of patterns more scalable. One important change needed in this commit was to remove the match filter that requires the input and output activations to be quantized. This was necessary because otherwise we would always expect q-dq nodes immediately after the getitem node, instead of after the add or relu nodes for example. This has another side benefit of keeping QAT fusion flexible enough to support weight only quantization. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel ghstack-source-id: ef30c6f Pull Request resolved: #113006

andrewor14 · 2023-11-07T20:09:10Z

@pytorchbot merge

pytorchmergebot · 2023-11-07T20:11:04Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-11-07T22:53:32Z

Merge failed

Reason: Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x f5615cbfd33a6e13ac260a956598aed98d54e110 returned non-zero exit code 1

Auto-merging torch/ao/quantization/pt2e/qat_utils.py
CONFLICT (content): Merge conflict in torch/ao/quantization/pt2e/qat_utils.py
error: could not apply f5615cbfd33... [quant][pt2][be] Remove add/relu from conv-bn QAT pattern
hint: After resolving the conflicts, mark them with
hint: "git add/rm <pathspec>", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".

Details for Dev Infra team

Raised by workflow job

Summary: This commit significantly simplifies the QAT fusion code for the `conv-bn` pattern by removing add and relu nodes from the match and replacement patterns. This does not reduce functionality; patterns like `conv-bn-relu`, `conv-bn-add`, and `conv-bn-add-relu` are still supported. We simply do not match these extra nodes, since there is actually no need to replace them. This has the additional benefit of reducing the number of patterns being matched by 16x, since for each add and relu variant of the `conv-bn` pattern there is also an in-place variant. This also enables more flexible `conv-bn` pattern matching in the future and keeps the number of patterns more scalable. One important change needed in this commit was to remove the match filter that requires the input and output activations to be quantized. This was necessary because otherwise we would always expect q-dq nodes immediately after the getitem node, instead of after the add or relu nodes for example. This has another side benefit of keeping QAT fusion flexible enough to support weight only quantization. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel [ghstack-poisoned]

Summary: This commit significantly simplifies the QAT fusion code for the `conv-bn` pattern by removing add and relu nodes from the match and replacement patterns. This does not reduce functionality; patterns like `conv-bn-relu`, `conv-bn-add`, and `conv-bn-add-relu` are still supported. We simply do not match these extra nodes, since there is actually no need to replace them. This has the additional benefit of reducing the number of patterns being matched by 16x, since for each add and relu variant of the `conv-bn` pattern there is also an in-place variant. This also enables more flexible `conv-bn` pattern matching in the future and keeps the number of patterns more scalable. One important change needed in this commit was to remove the match filter that requires the input and output activations to be quantized. This was necessary because otherwise we would always expect q-dq nodes immediately after the getitem node, instead of after the add or relu nodes for example. This has another side benefit of keeping QAT fusion flexible enough to support weight only quantization. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel ghstack-source-id: 9724061 Pull Request resolved: #113006

andrewor14 · 2023-11-14T16:06:07Z

@pytorchbot merge

pytorchmergebot · 2023-11-14T16:08:18Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This was referenced Nov 6, 2023

[quant][pt2] Support quantized conv bias in QAT fusion #112528

Closed

[quant][pt2] Fix custom dtype per channel weight in QAT #112612

Closed

pytorch-bot bot added the release notes: AO frontend label Nov 6, 2023

github-actions bot added the release notes: quantization release notes category label Nov 6, 2023

andrewor14 requested review from jerryzh168 and kimishpatel November 6, 2023 03:53

jerryzh168 approved these changes Nov 6, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 7, 2023

pytorchmergebot added the merging label Nov 7, 2023

pytorchmergebot removed the merging label Nov 7, 2023

pytorchmergebot added the merging label Nov 14, 2023

pytorchmergebot added Merged and removed merging labels Nov 14, 2023

pytorchmergebot closed this in 14eb92c Nov 14, 2023

facebook-github-bot deleted the gh/andrewor14/40/head branch November 18, 2023 15:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[quant][pt2][be] Remove add/relu from conv-bn QAT pattern #113006

[quant][pt2][be] Remove add/relu from conv-bn QAT pattern #113006

Uh oh!

andrewor14 commented Nov 6, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 6, 2023 •

edited

Loading

Uh oh!

andrewor14 commented Nov 6, 2023

Uh oh!

leslie-fang-intel commented Nov 6, 2023 •

edited

Loading

Uh oh!

jerryzh168 left a comment

Uh oh!

andrewor14 commented Nov 7, 2023

Uh oh!

pytorchmergebot commented Nov 7, 2023

Uh oh!

pytorchmergebot commented Nov 7, 2023

Uh oh!

andrewor14 commented Nov 14, 2023

Uh oh!

pytorchmergebot commented Nov 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[quant][pt2][be] Remove add/relu from conv-bn QAT pattern #113006

[quant][pt2][be] Remove add/relu from conv-bn QAT pattern #113006

Uh oh!

Conversation

andrewor14 commented Nov 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/113006

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

andrewor14 commented Nov 6, 2023

Uh oh!

leslie-fang-intel commented Nov 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

andrewor14 commented Nov 7, 2023

Uh oh!

pytorchmergebot commented Nov 7, 2023

Merge started

Uh oh!

pytorchmergebot commented Nov 7, 2023

Merge failed

Uh oh!

andrewor14 commented Nov 14, 2023

Uh oh!

pytorchmergebot commented Nov 14, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

andrewor14 commented Nov 6, 2023 •

edited

Loading

pytorch-bot bot commented Nov 6, 2023 •

edited

Loading

leslie-fang-intel commented Nov 6, 2023 •

edited

Loading