[AOTI][CPU] Consider bias=None case for fbgemm_linear_fp16_weight #158535

hl475 · 2025-07-17T04:07:19Z

Test Plan:

Rollback Plan:

Differential Revision: D78458214

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168 @voznesenskym @penguinwu @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

pytorch-bot · 2025-07-17T04:07:23Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158535

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 919db33 with merge base 393377d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-07-17T04:07:29Z

This pull request was exported from Phabricator. Differential Revision: D78458214

github-actions · 2025-07-17T04:11:17Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

github-actions · 2025-07-17T04:11:17Z

Attention! PyTorch one of the C-stable API file was changed

You MUST NOT change existing function declarations in this, as this header defines a stable C ABI. If you need to change the signature for a function, introduce a new v2 version of the function and modify code generation to target the new version of the function.

Caused by:

torch/csrc/inductor/aoti_torch/c/shim.h

Summary: Pull Request resolved: pytorch#158535 Test Plan: ``` buck2 run mode/opt deeplearning/aot_inductor/cpu:cli -- --local-model-path ~/testing/897908634_3.input.predictor --preset foa_early_stage_ranking ``` this will reduce the number of `aten::add` from 184 to 146 Rollback Plan: Differential Revision: D78458214

facebook-github-bot · 2025-07-17T04:25:45Z

This pull request was exported from Phabricator. Differential Revision: D78458214

Summary: Pull Request resolved: pytorch#158535 Test Plan: ``` buck2 run mode/opt deeplearning/aot_inductor/cpu:cli -- --local-model-path ~/testing/897908634_3.input.predictor --preset foa_early_stage_ranking ``` this will reduce the number of `aten::add` from 184 to 146 Rollback Plan: Differential Revision: D78458214

facebook-github-bot · 2025-07-17T06:28:20Z

This pull request was exported from Phabricator. Differential Revision: D78458214

Summary: Pull Request resolved: #158535 Test Plan: # e2e ``` buck2 run mode/opt deeplearning/aot_inductor/cpu:cli -- --local-model-path ~/testing/897908634_3.input.predictor --preset foa_early_stage_ranking ``` this will reduce the number of `aten::add` from 184 to 146 # BC & FC ## pick two models where one has bias None case and one doesn't ## publish model with diff and run predictor without diff ## publish model without diff and run predictor with diff ``` manifold get ads_storage_fblearner/tree/user/facebook/fblearner/predictor/752748048/0/lowering/.predictor.local/input_model ~/testing/752748048_0.input.predictor.local ``` ``` rm -rf /tmp/pt2_archive_* && buck2 run mode/opt deeplearning/aot_inductor/cpu:cli -- --local-model-path ~/testing/752748048_0.input.predictor.local --submodule merge --preset ads_second_stage_ranking_type_2 --lowering-backend aotinductor_ep ``` ``` buck2 run mode/opt caffe2/torch/fb/model_transform/fx2trt/packaging:load_net_predictor -- --loadMode=Benchmark --inputNetFile /tmp/pt2_archive_merge/package.zip --moduleName=merge --submodToDevice "" --using_aoti_lowering_allowlist=false --benchmarkDontRebatchSamples ``` **failed** - publish model has bias None with diff, and run predictor without diff - P1872435999 succeed - publish model has bias None without diff, and run predictor with diff - P1872444989 ``` rm -rf /tmp/pt2_archive_mix && buck2 run mode/opt deeplearning/aot_inductor/cpu:cli -- --local-model-path ~/testing/742055223_194.input.predictor.local --submodule mix --preset ads_first_stage_ranking_dsnn --lowering-backend aotinductor_ep ``` ``` buck2 run mode/opt caffe2/torch/fb/model_transform/fx2trt/packaging:load_net_predictor -- --loadMode=Benchmark --inputNetFile /tmp/pt2_archive_mix/package.zip --moduleName=mix --submodToDevice "" --using_aoti_lowering_allowlist=false --benchmarkDontRebatchSamples ``` succeed - publish model hasn't bias None with diff, and run predictor without diff - P1872468850 succeed - publish model hasn't bias None without diff, and run predictor with diff - P1872474897 Rollback Plan: Differential Revision: D78458214

facebook-github-bot · 2025-07-21T07:50:33Z

This pull request was exported from Phabricator. Differential Revision: D78458214

Summary: Pull Request resolved: pytorch#158535 Test Plan: # e2e ``` buck2 run mode/opt deeplearning/aot_inductor/cpu:cli -- --local-model-path ~/testing/897908634_3.input.predictor --preset foa_early_stage_ranking ``` this will reduce the number of `aten::add` from 184 to 146 # BC & FC ## pick two models where one has bias None case and one doesn't ## publish model with diff and run predictor without diff ## publish model without diff and run predictor with diff ``` manifold get ads_storage_fblearner/tree/user/facebook/fblearner/predictor/752748048/0/lowering/.predictor.local/input_model ~/testing/752748048_0.input.predictor.local ``` ``` rm -rf /tmp/pt2_archive_* && buck2 run mode/opt deeplearning/aot_inductor/cpu:cli -- --local-model-path ~/testing/752748048_0.input.predictor.local --submodule merge --preset ads_second_stage_ranking_type_2 --lowering-backend aotinductor_ep ``` ``` buck2 run mode/opt caffe2/torch/fb/model_transform/fx2trt/packaging:load_net_predictor -- --loadMode=Benchmark --inputNetFile /tmp/pt2_archive_merge/package.zip --moduleName=merge --submodToDevice "" --using_aoti_lowering_allowlist=false --benchmarkDontRebatchSamples ``` **failed** - publish model has bias None with diff, and run predictor without diff - P1872435999 succeed - publish model has bias None without diff, and run predictor with diff - P1872444989 ``` rm -rf /tmp/pt2_archive_mix && buck2 run mode/opt deeplearning/aot_inductor/cpu:cli -- --local-model-path ~/testing/742055223_194.input.predictor.local --submodule mix --preset ads_first_stage_ranking_dsnn --lowering-backend aotinductor_ep ``` ``` buck2 run mode/opt caffe2/torch/fb/model_transform/fx2trt/packaging:load_net_predictor -- --loadMode=Benchmark --inputNetFile /tmp/pt2_archive_mix/package.zip --moduleName=mix --submodToDevice "" --using_aoti_lowering_allowlist=false --benchmarkDontRebatchSamples ``` succeed - publish model hasn't bias None with diff, and run predictor without diff - P1872468850 succeed - publish model hasn't bias None without diff, and run predictor with diff - P1872474897 Rollback Plan: Differential Revision: D78458214

facebook-github-bot · 2025-07-21T14:36:46Z

This pull request was exported from Phabricator. Differential Revision: D78458214

facebook-github-bot · 2025-07-21T23:24:41Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-07-21T23:26:33Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…torch#158535) Test Plan: Rollback Plan: Differential Revision: D78458214 Pull Request resolved: pytorch#158535 Approved by: https://github.com/houseroad, https://github.com/henryoier, https://github.com/jingsh

hl475 requested review from digantdesai, jerryzh168, jianyuh, kimishpatel and salilsdesai as code owners July 17, 2025 04:07

pytorch-bot bot added ciflow/inductor module: cpu CPU specific problem (e.g., perf, algorithm) module: inductor release notes: quantization release notes category release notes: inductor (aoti) labels Jul 17, 2025

facebook-github-bot added the fb-exported label Jul 17, 2025

hl475 force-pushed the export-D78458214 branch from 21fc4d7 to 93b5591 Compare July 17, 2025 04:25

hl475 force-pushed the export-D78458214 branch from 93b5591 to ab6867d Compare July 17, 2025 06:28

hl475 force-pushed the export-D78458214 branch from ab6867d to 434eb6e Compare July 21, 2025 07:50

hl475 force-pushed the export-D78458214 branch from 434eb6e to 919db33 Compare July 21, 2025 14:36

hl475 changed the title ~~[WIP] bias None case~~ [AOTI][CPU] Consider bias=None case for fbgemm_linear_fp16_weight Jul 21, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 21, 2025

hl475 requested a review from desertfire July 21, 2025 20:52

hl475 requested review from houseroad and muchulee8 July 21, 2025 20:52

houseroad approved these changes Jul 21, 2025

View reviewed changes

henryoier approved these changes Jul 21, 2025

View reviewed changes

jingsh approved these changes Jul 21, 2025

View reviewed changes

pytorchmergebot added the merging label Jul 21, 2025

pytorchmergebot added the Merged label Jul 21, 2025

pytorchmergebot closed this in 2c37acf Jul 21, 2025

pytorchmergebot removed the merging label Jul 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AOTI][CPU] Consider bias=None case for fbgemm_linear_fp16_weight #158535

[AOTI][CPU] Consider bias=None case for fbgemm_linear_fp16_weight #158535

Uh oh!

hl475 commented Jul 17, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Jul 17, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jul 17, 2025

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

facebook-github-bot commented Jul 17, 2025

Uh oh!

facebook-github-bot commented Jul 17, 2025

Uh oh!

facebook-github-bot commented Jul 21, 2025

Uh oh!

facebook-github-bot commented Jul 21, 2025

Uh oh!

facebook-github-bot commented Jul 21, 2025

Uh oh!

pytorchmergebot commented Jul 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[AOTI][CPU] Consider bias=None case for fbgemm_linear_fp16_weight #158535

[AOTI][CPU] Consider bias=None case for fbgemm_linear_fp16_weight #158535

Uh oh!

Conversation

hl475 commented Jul 17, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158535

✅ No Failures

Uh oh!

facebook-github-bot commented Jul 17, 2025

Uh oh!

github-actions bot commented Jul 17, 2025

Attention! native_functions.yaml was changed

Uh oh!

github-actions bot commented Jul 17, 2025

Attention! PyTorch one of the C-stable API file was changed

Uh oh!

facebook-github-bot commented Jul 17, 2025

Uh oh!

facebook-github-bot commented Jul 17, 2025

Uh oh!

facebook-github-bot commented Jul 21, 2025

Uh oh!

facebook-github-bot commented Jul 21, 2025

Uh oh!

facebook-github-bot commented Jul 21, 2025

Uh oh!

pytorchmergebot commented Jul 21, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

hl475 commented Jul 17, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Jul 17, 2025 •

edited

Loading