[quant][pt2] Support quantized conv bias in QAT fusion #112528

andrewor14 · 2023-10-31T20:34:48Z

Stack from ghstack (oldest at bottom):

Summary: Previously QAT fusion assumes bias is not quantized.
This works for the existing XNNPACKQuantizer, but not for custom
quantizers that wish to quantize the bias. This commit supports
this by adding the necessary patterns. This requires refactoring
the code, however, since it previously assumed that there will
only be one pair of q-dq (from conv weight) in the matched
pattern, and this is no longer true.

Test Plan:
python test/test_quantization.py TestQuantizePT2EQAT.test_qat_conv_bn_bias_derived_qspec

Reviewers: jerryzh168, kimishpatel

Subscribers: jerryzh168, kimishpatel, supriyar

Differential Revision: D50856377

Summary: Previously QAT fusion assumes bias is not quantized. This works for the existing XNNPACKQuantizer, but not for custom quantizers that wish to quantize the bias. This commit supports this by adding the necessary patterns. This requires refactoring the code, however, since it previously assumed that there will only be one pair of q-dq (from conv weight) in the matched pattern, and this is no longer true. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT.test_qat_conv_bn_bias_derived_qspec Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar [ghstack-poisoned]

pytorch-bot · 2023-10-31T20:34:52Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112528

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 6d2800e with merge base 46a34e8 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

andrewor14 · 2023-10-31T20:39:50Z

@andrewor14 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Previously QAT fusion assumes bias is not quantized. This works for the existing XNNPACKQuantizer, but not for custom quantizers that wish to quantize the bias. This commit supports this by adding the necessary patterns. This requires refactoring the code, however, since it previously assumed that there will only be one pair of q-dq (from conv weight) in the matched pattern, and this is no longer true. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT.test_qat_conv_bn_bias_derived_qspec Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Differential Revision: [D50856377](https://our.internmc.facebook.com/intern/diff/D50856377) [ghstack-poisoned]

Summary: Previously QAT fusion assumes bias is not quantized. This works for the existing XNNPACKQuantizer, but not for custom quantizers that wish to quantize the bias. This commit supports this by adding the necessary patterns. This requires refactoring the code, however, since it previously assumed that there will only be one pair of q-dq (from conv weight) in the matched pattern, and this is no longer true. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT.test_qat_conv_bn_bias_derived_qspec Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar ghstack-source-id: 066ef7a Pull Request resolved: #112528

Summary: Previously QAT fusion assumes bias is not quantized. This works for the existing XNNPACKQuantizer, but not for custom quantizers that wish to quantize the bias. This commit supports this by adding the necessary patterns. This requires refactoring the code, however, since it previously assumed that there will only be one pair of q-dq (from conv weight) in the matched pattern, and this is no longer true. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT.test_qat_conv_bn_bias_derived_qspec Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Differential Revision: [D50856377](https://our.internmc.facebook.com/intern/diff/D50856377) [ghstack-poisoned]

andrewor14 · 2023-11-02T14:45:32Z

@andrewor14 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Previously QAT fusion assumes bias is not quantized. This works for the existing XNNPACKQuantizer, but not for custom quantizers that wish to quantize the bias. This commit supports this by adding the necessary patterns. This requires refactoring the code, however, since it previously assumed that there will only be one pair of q-dq (from conv weight) in the matched pattern, and this is no longer true. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT.test_qat_conv_bn_bias_derived_qspec Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Differential Revision: [D50856377](https://our.internmc.facebook.com/intern/diff/D50856377) [ghstack-poisoned]

jerryzh168 · 2023-11-02T19:44:46Z

torch/ao/quantization/pt2e/qat_utils.py


-        # Step (3): Fold BN weights into conv
+        # Step (3): Copy over args for weight (and optionally bias) q - dq nodes
+        _copy_over_q_dq_args(*node_map["conv_weight_q"])


I'm wondering if these can be simplified if we use these? https://github.com/pytorch/pytorch/blob/main/torch/ao/quantization/pt2e/utils.py#L291

Yeah, we can explore that as a BE task along with the conv arg copying. I prefer to do that separately

andrewor14 · 2023-11-03T17:07:41Z

@andrewor14 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

andrewor14 · 2023-11-06T02:11:26Z

@pytorchbot merge

pytorchmergebot · 2023-11-06T02:13:33Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: Previously QAT fusion assumes bias is not quantized. This works for the existing XNNPACKQuantizer, but not for custom quantizers that wish to quantize the bias. This commit supports this by adding the necessary patterns. This requires refactoring the code, however, since it previously assumed that there will only be one pair of q-dq (from conv weight) in the matched pattern, and this is no longer true. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT.test_qat_conv_bn_bias_derived_qspec Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Differential Revision: [D50856377](https://our.internmc.facebook.com/intern/diff/D50856377) [ghstack-poisoned]

pytorchmergebot · 2023-11-06T03:54:50Z

Merge failed

Reason: New commits were pushed while merging. Please rerun the merge command.

Details for Dev Infra team

Raised by workflow job

andrewor14 · 2023-11-06T04:00:26Z

@andrewor14 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

andrewor14 · 2023-11-06T17:56:43Z

@pytorchbot merge

pytorchmergebot · 2023-11-06T17:58:39Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: Previously QAT fusion assumes bias is not quantized. This works for the existing XNNPACKQuantizer, but not for custom quantizers that wish to quantize the bias. This commit supports this by adding the necessary patterns. This requires refactoring the code, however, since it previously assumed that there will only be one pair of q-dq (from conv weight) in the matched pattern, and this is no longer true. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT.test_qat_conv_bn_bias_derived_qspec Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Differential Revision: [D50856377](https://our.internmc.facebook.com/intern/diff/D50856377) Pull Request resolved: pytorch#112528 Approved by: https://github.com/jerryzh168

andrewor14 requested a review from jerryzh168 as a code owner October 31, 2023 20:34

pytorch-bot bot added the release notes: quantization release notes category label Oct 31, 2023

andrewor14 mentioned this pull request Nov 1, 2023

[quant][pt2] Fix custom dtype per channel weight in QAT #112612

Closed

andrewor14 requested a review from kimishpatel November 1, 2023 17:07

andrewor14 mentioned this pull request Nov 2, 2023

[Quant] [PT2] Add ConvBNAdd(ReLU) Annotation into X86InductorQuantizer #111281

Closed

jerryzh168 reviewed Nov 2, 2023

View reviewed changes

jerryzh168 approved these changes Nov 2, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 6, 2023

pytorchmergebot added the merging label Nov 6, 2023

andrewor14 mentioned this pull request Nov 6, 2023

[quant][pt2][be] Remove add/relu from conv-bn QAT pattern #113006

Closed

pytorchmergebot removed the merging label Nov 6, 2023

pytorchmergebot added the merging label Nov 6, 2023

pytorchmergebot added the Merged label Nov 6, 2023

pytorchmergebot removed the merging label Nov 6, 2023

pytorchmergebot closed this in b6e85eb Nov 6, 2023

facebook-github-bot deleted the gh/andrewor14/38/head branch November 10, 2023 15:23

[quant][pt2] Support quantized conv bias in QAT fusion #112528

[quant][pt2] Support quantized conv bias in QAT fusion #112528

Uh oh!

Conversation

andrewor14 commented Oct 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112528

✅ No Failures

Uh oh!

andrewor14 commented Oct 31, 2023

Uh oh!

andrewor14 commented Nov 2, 2023

Uh oh!

jerryzh168 Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

andrewor14 Nov 3, 2023

Choose a reason for hiding this comment

Uh oh!

andrewor14 commented Nov 3, 2023

Uh oh!

andrewor14 commented Nov 6, 2023

Uh oh!

pytorchmergebot commented Nov 6, 2023

Merge started

Uh oh!

pytorchmergebot commented Nov 6, 2023

Merge failed

Uh oh!

andrewor14 commented Nov 6, 2023

Uh oh!

andrewor14 commented Nov 6, 2023

Uh oh!

pytorchmergebot commented Nov 6, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrewor14 commented Oct 31, 2023 •

edited

Loading

pytorch-bot bot commented Oct 31, 2023 •

edited

Loading