[QNN EP] Fuse scale into softmax #24809

qti-yuduo · 2025-05-19T18:33:24Z

QNN Softmax op defines pre-scale (beta) that we can fold constant scalar multiply into it.

qti-yuduo · 2025-05-19T18:34:03Z

@microsoft-github-policy-service agree [company=Qualcomm]

qti-yuduo · 2025-05-19T18:37:11Z

@microsoft-github-policy-service agree

yuslepukhin · 2025-05-19T19:26:13Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,ONNX Runtime Web CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline

yuslepukhin · 2025-05-19T19:26:15Z

/azp run Linux QNN CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline,Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2025-05-19T19:26:33Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2025-05-19T19:26:33Z

Azure Pipelines successfully started running 4 pipeline(s).

onnxruntime/core/providers/qnn/builder/qnn_node_group/scale_softmax_fusion.h

onnxruntime/test/providers/qnn/qnn_node_group/scale_softmax_fusion_test.cc

HectorSVC · 2025-05-20T17:19:46Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-05-20T17:20:08Z

Azure Pipelines successfully started running 5 pipeline(s).

HectorSVC · 2025-05-20T21:23:47Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-05-20T21:24:07Z

Azure Pipelines successfully started running 5 pipeline(s).

HectorSVC · 2025-05-22T15:33:53Z

You only need to check the QNN related build. You should have access to the build pipeline, so is the build log.
For Linux:
/mnt/vss/_work/_temp/bfb99205-f415-44a0-be47-77d7f5a6877e.sh: line 3: 118621 Segmentation fault (core dumped) ./build/Release/onnx_test_runner -e qnn -j 1 -i "backend_path|/mnt/vss/_work/_temp/qnn-v2.33.2.250410/lib/x86_64-linux-clang/libQnnCpu.so" cmake/external/onnx/onnx/backend/test/data/node
You can run onnx_test_runner to reproduce it.

For Windows:
1: [ FAILED ] 5 tests, listed below:
1: [ FAILED ] QnnHTPBackendTests.ScaleSoftmaxFusionScalarInitializer
1: [ FAILED ] QnnHTPBackendTests.ScaleSoftmaxFusionScalarConstant
1: [ FAILED ] QnnHTPBackendTests.ScaleSoftmaxFusionScalarInitializerReversed
1: [ FAILED ] QnnHTPBackendTests.ScaleSoftmaxFusionScalarConstantReversed
1: [ FAILED ] QnnHTPBackendTests.ScaleSoftmaxFusionSoftmaxNegativeAxis
1:
1: 5 FAILED TESTS
1: YOU HAVE 13 DISABLED TESTS
1:
1/9 Test #1: onnxruntime_test_all ....................***Failed 248.51 sec
You can run command to repro:
onnxruntime_test_all.exe --gtest_filter=QnnHTPBackendTests.ScaleSoftmaxFusionScalarInitializer

qti-yuduo · 2025-05-22T21:27:09Z

@HectorSVC mind help trigger CI again? Thank you!!

HectorSVC · 2025-05-22T23:40:41Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-05-22T23:41:00Z

Azure Pipelines successfully started running 5 pipeline(s).

qti-yuduo · 2025-05-23T17:34:34Z

I can repro the Linux error, It should be fixed now.

HectorSVC · 2025-05-23T21:15:06Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-05-23T21:15:27Z

Azure Pipelines successfully started running 5 pipeline(s).

qti-yuduo · 2025-05-28T17:06:35Z

ping.

HectorSVC

QNN [Softmax op defines pre-scale (`beta`)](https://docs.qualcomm.com/bundle/publicresource/topics/80-63442-50/MasterOpDef.html#softmax) that we can fold constant scalar multiply into it.

### Description - #24265 - #24616 - #24640 - #24707 - #24646 - #24750 - #24809 - #24895 - #24820 - #25002 - #25171 - #25283 - #24818 - #25351 - #25361 - #25388 - #25520 - #25158 ### Motivation and Context  --------- Co-authored-by: quic-zhaoxul <quic_zhaoxul@quicinc.com> Co-authored-by: Yuduo Wu <6426433+1duo@users.noreply.github.com> Co-authored-by: Hector Li <hecli@microsoft.com> Co-authored-by: chenweng-quic <168707118+chenweng-quic@users.noreply.github.com> Co-authored-by: qti-yuduo <yuduow@qti.qualcomm.com> Co-authored-by: Akupadhye <aupadhye@qti.qualcomm.com> Co-authored-by: Jeff Kilpatrick <jkilpatrick@qti.qualcomm.com> Co-authored-by: Jeff Kilpatrick <jkilpat@qti.qualcomm.com> Co-authored-by: George Wu <jywu@microsoft.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: quic-calvnguy <quic_calvnguy@quicinc.com> Co-authored-by: Changming Sun <chasun@microsoft.com> Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com>

qti-yuduo force-pushed the dev/yuduow/scale-softmax-fusion branch from 3b33063 to 2e8583a Compare May 19, 2025 18:36

yuslepukhin reviewed May 19, 2025

View reviewed changes

onnxruntime/core/providers/qnn/builder/qnn_node_group/scale_softmax_fusion.h Outdated Show resolved Hide resolved

github-advanced-security bot found potential problems May 19, 2025

View reviewed changes

onnxruntime/test/providers/qnn/qnn_node_group/scale_softmax_fusion_test.cc Fixed Show fixed Hide fixed

edgchen1 added the ep:QNN issues related to QNN exeution provider label May 20, 2025

qti-yuduo force-pushed the dev/yuduow/scale-softmax-fusion branch from f1b7014 to ecbbcf7 Compare May 20, 2025 21:14

qti-yuduo added 5 commits May 23, 2025 11:31

[QNN-EP] Fuse pre-scale (multiply) into Softmax op

d0f94ba

Address review feedback

8372bfe

Address review feedback

7129899

Fix CI runs on Windows tests

c7ad7df

Fix CI failure

09a403d

qti-yuduo force-pushed the dev/yuduow/scale-softmax-fusion branch from 5a40bde to 09a403d Compare May 23, 2025 20:06

HectorSVC approved these changes May 28, 2025

View reviewed changes

HectorSVC merged commit f9739c2 into microsoft:main May 28, 2025
82 checks passed

qti-yuduo deleted the dev/yuduow/scale-softmax-fusion branch July 18, 2025 20:14

adrianlizarraga mentioned this pull request Aug 1, 2025

rel-1.22.2 cherry-pick 1 #25633

Merged

[QNN EP] Fuse scale into softmax #24809

[QNN EP] Fuse scale into softmax #24809

Uh oh!

Conversation

qti-yuduo commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qti-yuduo commented May 19, 2025

Uh oh!

qti-yuduo commented May 19, 2025

Uh oh!

yuslepukhin commented May 19, 2025

Uh oh!

yuslepukhin commented May 19, 2025

Uh oh!

azure-pipelines bot commented May 19, 2025

Uh oh!

azure-pipelines bot commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!

HectorSVC commented May 20, 2025

Uh oh!

azure-pipelines bot commented May 20, 2025

Uh oh!

HectorSVC commented May 20, 2025

Uh oh!

azure-pipelines bot commented May 20, 2025

Uh oh!

HectorSVC commented May 22, 2025

Uh oh!

qti-yuduo commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HectorSVC commented May 22, 2025

Uh oh!

azure-pipelines bot commented May 22, 2025

Uh oh!

qti-yuduo commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HectorSVC commented May 23, 2025

Uh oh!

azure-pipelines bot commented May 23, 2025

Uh oh!

qti-yuduo commented May 28, 2025

Uh oh!

HectorSVC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

qti-yuduo commented May 19, 2025 •

edited

Loading

qti-yuduo commented May 22, 2025 •

edited

Loading

qti-yuduo commented May 23, 2025 •

edited

Loading