[Inductor] [Quant] Enable QConv2d Unary int8-mixed-bf16 Lowering #112550

leslie-fang-intel · 2023-11-01T02:13:55Z

Stack from ghstack (oldest at bottom):

Summary

PR 5 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor [RFC] Enable Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640.
Enable the QConv2d Unary int8-mixed-bf16 weight prepack and post grad lowering inside inductor.

TestPlan

python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

[ghstack-poisoned]

pytorch-bot · 2023-11-01T02:13:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112550

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c6718ec with merge base 8bdce9b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torch/_inductor/fx_passes/quantization.py

ghstack-source-id: f5bbb4a Pull Request resolved: pytorch#112550

…wering" **Summary** - PR 5 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QConv2d Unary int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 1838155 Pull Request resolved: pytorch#112550

…wering" **Summary** - PR 5 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QConv2d Unary int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

ghstack-source-id: 8e8e9d1 Pull Request resolved: pytorch#112550

…wering" **Summary** - PR 5 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QConv2d Unary int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

leslie-fang-intel · 2023-11-06T06:08:21Z

Hi @eellison @jerryzh168, Could you kindly help to review these 3 PRs in this ghstack for the Inductor lowering of int8-mixed-bf16 quantization case?

eellison · 2023-11-07T23:55:58Z

@jerryzh168 want to review this one ?

torch/_inductor/fx_passes/quantization.py

test/inductor/test_mkldnn_pattern_matcher.py

jerryzh168

LG overall, please add more some docs

…wering" **Summary** - PR 5 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QConv2d Unary int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d ``` cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

leslie-fang-intel · 2023-11-10T08:57:03Z

@pytorchbot merge

pytorchmergebot · 2023-11-10T08:59:06Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…12551) **Summary** - PR 6 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor #111640. - Enable the QConv2d Binary int8-mixed-bf16 post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d ``` Pull Request resolved: #112551 Approved by: https://github.com/jgong5, https://github.com/eellison, https://github.com/jerryzh168 ghstack dependencies: #112550

…orch#112550) **Summary** - PR 5 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor pytorch#111640. - Enable the QConv2d Unary int8-mixed-bf16 weight prepack and post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d ``` Pull Request resolved: pytorch#112550 Approved by: https://github.com/jgong5, https://github.com/jerryzh168

…torch#112551) **Summary** - PR 6 for enabling Int8-Mixed-BF16 PT2E PTQ Quantization with Inductor pytorch#111640. - Enable the QConv2d Binary int8-mixed-bf16 post grad lowering inside inductor. **TestPlan** ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qconv2d ``` Pull Request resolved: pytorch#112551 Approved by: https://github.com/jgong5, https://github.com/eellison, https://github.com/jerryzh168 ghstack dependencies: pytorch#112550

* Most part of testcases work properly on Navi48(gfx1201) with TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1, in this commit enable it for this arch. No support of AOTriton currently for Navci44(gfx1200), so these testcases just skipped. * test_qconv2d_int8_mixed_bf16 skipped because it was originally skipped in pytorch#112550 but later lost. * test_sac_ilp_case1 skipped as per SWDEV-509011 * test_distributed_checkpoint_state_dict_type[0-1]_cuda fixed bug with arguments.

… SDPA or Navi4x (#2213) [release/2.6][SWDEV-523736] Skip some testcases for archs without SDPA or Navi4x [SWDEV-523736] Fix some unittests for Navi4x * Most part of testcases work properly on Navi48(gfx1201) with TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1, in this commit enable it for this arch. No support of AOTriton currently for Navci44(gfx1200), so these testcases just skipped. * test_qconv2d_int8_mixed_bf16 skipped because it was originally skipped in pytorch#112550 but later lost. * test_sac_ilp_case1 skipped as per SWDEV-509011 * test_distributed_checkpoint_state_dict_type[0-1]_cuda fixed bug with arguments. Fixes #SWDEV-523736

[Inductor] [Quant] Enable QConv2d Unary int8-mixed-bf16 Lowering

2ed7db4

[ghstack-poisoned]

This was referenced Nov 1, 2023

Enable oneDNN QConv FP32/BF16 output #112010

Closed

Enable oneDNN QLinear FP32/BF16 output #112126

Closed

github-actions bot added module: inductor ciflow/inductor labels Nov 1, 2023

leslie-fang-intel requested review from Xia-Weiwen and jgong5 November 1, 2023 02:16

pytorchbot added the open source label Nov 1, 2023

leslie-fang-intel added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 1, 2023

jgong5 reviewed Nov 1, 2023

View reviewed changes

torch/_inductor/fx_passes/quantization.py Outdated Show resolved Hide resolved

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Nov 2, 2023

[Inductor] [Quant] Enable QConv2d Unary int8-mixed-bf16 Lowering

56444ec

ghstack-source-id: f5bbb4a Pull Request resolved: pytorch#112550

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Nov 3, 2023

[Inductor] [Quant] Enable QConv2d Unary int8-mixed-bf16 Lowering

bc4ef78

ghstack-source-id: 1838155 Pull Request resolved: pytorch#112550

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Nov 3, 2023

[Inductor] [Quant] Enable QConv2d Unary int8-mixed-bf16 Lowering

16b5da1

ghstack-source-id: 8e8e9d1 Pull Request resolved: pytorch#112550

leslie-fang-intel mentioned this pull request Nov 3, 2023

[Inductor] [Quant] Re-structure Quantization testcase pattern matcher check #112570

Closed

leslie-fang-intel added 2 commits November 3, 2023 16:35

leslie-fang-intel requested a review from jgong5 November 4, 2023 23:56

leslie-fang-intel added the topic: not user facing topic category label Nov 4, 2023

jgong5 approved these changes Nov 5, 2023

View reviewed changes

leslie-fang-intel requested review from eellison and jerryzh168 November 6, 2023 06:06