KEMBAR78
[QNN EP] QNN SDK 2.28.2 by adrianlizarraga · Pull Request #22844 · microsoft/onnxruntime · GitHub
Skip to content

Conversation

@adrianlizarraga
Copy link
Contributor

@adrianlizarraga adrianlizarraga commented Nov 14, 2024

Description

  • Updates pipelines to use QNN SDK 2.28.2.241116.
  • Re-enable LayerNormalization unit tests that failed with accuracy errors with the previous QNN SDK (2.28.0).
  • Update QNN EP to no longer provide a dummy bias for LayerNorm if the QNN SDK version is >= 2.28.0.

Motivation and Context

Use the latest QNN SDK. This version improves inference latency for certain customer models.

@sophies927 sophies927 added release:1.20.1 triage:approved Approved for cherrypicks for release labels Nov 18, 2024
@adrianlizarraga adrianlizarraga marked this pull request as ready for review November 19, 2024 00:18
@adrianlizarraga adrianlizarraga requested a review from a team November 19, 2024 00:18
@adrianlizarraga adrianlizarraga changed the title [QNN EP] [DRAFT] QNN SDK 2.28.2 [QNN EP] QNN SDK 2.28.2 Nov 19, 2024
@yf711 yf711 merged commit 497b06f into main Nov 19, 2024
93 checks passed
@yf711 yf711 deleted the adrianl/qnn-sdk-2.28.2 branch November 19, 2024 04:10
yf711 pushed a commit that referenced this pull request Nov 19, 2024
### Description
- Updates pipelines to use QNN SDK 2.28.2.241116.
- Re-enable LayerNormalization unit tests that failed with accuracy
errors with the previous QNN SDK (2.28.0).
- Update QNN EP to no longer provide a dummy bias for LayerNorm if the
QNN SDK version is >= 2.28.0.


### Motivation and Context
Use the latest QNN SDK. This version improves inference latency for
certain customer models.
yf711 added a commit that referenced this pull request Nov 19, 2024
### Description
<!-- Describe your changes. -->
All three PRs are cherry-picked in this round:
1. [Refactor SkipLayerNorm and handle beta properly (#22862)
](#22862)
2. [[TensorRT EP] Exclude DDS ops from running on TRT
(#22875)](#22875)
3. [[QNN EP] QNN SDK 2.28.2 (#22844) 
](#22844)
### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Signed-off-by: Liqun Fu <liqfu@microsoft.com>
Signed-off-by: Liqun Fu <liqun.fu@microsoft.com>
Co-authored-by: Chi Lo <54722500+chilo-ms@users.noreply.github.com>
Co-authored-by: liqun Fu <liqfu@microsoft.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com>
mszhanyi pushed a commit that referenced this pull request Nov 22, 2024
### Description
- Updates pipelines to use QNN SDK 2.28.2.241116.
- Re-enable LayerNormalization unit tests that failed with accuracy
errors with the previous QNN SDK (2.28.0).
- Update QNN EP to no longer provide a dummy bias for LayerNorm if the
QNN SDK version is >= 2.28.0.


### Motivation and Context
Use the latest QNN SDK. This version improves inference latency for
certain customer models.
guschmue pushed a commit that referenced this pull request Dec 2, 2024
### Description
- Updates pipelines to use QNN SDK 2.28.2.241116.
- Re-enable LayerNormalization unit tests that failed with accuracy
errors with the previous QNN SDK (2.28.0).
- Update QNN EP to no longer provide a dummy bias for LayerNorm if the
QNN SDK version is >= 2.28.0.


### Motivation and Context
Use the latest QNN SDK. This version improves inference latency for
certain customer models.
ankitm3k pushed a commit to intel/onnxruntime that referenced this pull request Dec 11, 2024
### Description
- Updates pipelines to use QNN SDK 2.28.2.241116.
- Re-enable LayerNormalization unit tests that failed with accuracy
errors with the previous QNN SDK (2.28.0).
- Update QNN EP to no longer provide a dummy bias for LayerNorm if the
QNN SDK version is >= 2.28.0.


### Motivation and Context
Use the latest QNN SDK. This version improves inference latency for
certain customer models.
ankitm3k pushed a commit to intel/onnxruntime that referenced this pull request Dec 11, 2024
### Description
- Updates pipelines to use QNN SDK 2.28.2.241116.
- Re-enable LayerNormalization unit tests that failed with accuracy
errors with the previous QNN SDK (2.28.0).
- Update QNN EP to no longer provide a dummy bias for LayerNorm if the
QNN SDK version is >= 2.28.0.


### Motivation and Context
Use the latest QNN SDK. This version improves inference latency for
certain customer models.
ankitm3k pushed a commit to intel/onnxruntime that referenced this pull request Dec 11, 2024
### Description
- Updates pipelines to use QNN SDK 2.28.2.241116.
- Re-enable LayerNormalization unit tests that failed with accuracy
errors with the previous QNN SDK (2.28.0).
- Update QNN EP to no longer provide a dummy bias for LayerNorm if the
QNN SDK version is >= 2.28.0.


### Motivation and Context
Use the latest QNN SDK. This version improves inference latency for
certain customer models.
@snnn
Copy link
Member

snnn commented Sep 5, 2025

This PR has been cherry-picked into the rel-1.20.1 branch in PR #22845. Removing the release:1.20.1 label.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

triage:approved Approved for cherrypicks for release

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants