KEMBAR78
Add FP16 support to batchedNMSPlugin by pbridger · Pull Request #1002 · NVIDIA/TensorRT · GitHub
Skip to content

Conversation

@pbridger
Copy link
Contributor

Add support for float16 boxes and scores (but not mixed precision between boxes and scores).

Gives inference results within expected numerical tolerances using SSD300 and COCO2017 validation set. (As detailed here https://paulbridger.com/posts/tensorrt-object-detection-quantized/).

Despite the "#if CUDA_ARCH >= 800" this has not been tested on Ampere, only Turing arch.

@rajeevsrao
Copy link
Collaborator

Thanks @pbridger. Can you please sign your commits with git commit --amend -s

@rajeevsrao rajeevsrao self-requested a review January 12, 2021 21:49
@rajeevsrao rajeevsrao self-assigned this Jan 12, 2021
@rajeevsrao rajeevsrao added Module:Plugins Issues when using TensorRT plugins Feature Request Request for new functionality Precision: INT8 Module:Performance General performance issues labels Jan 13, 2021
@pranavm-nvidia
Copy link
Collaborator

@rajeevsrao Does this need to be integrated into master as well?

@rajeevsrao
Copy link
Collaborator

@rajeevsrao Does this need to be integrated into master as well?

Yes, but we will cherry-pick it internally first, test and release to master via 21.0x and then merge this change.

Tyler-D and others added 5 commits January 14, 2021 01:36
Signed-off-by: Tyler Zhu <tylerz@nvidia.com>
Signed-off-by: Paul Bridger <paul@paulbridger.com>
Signed-off-by: Paul Bridger <paul@paulbridger.com>
Signed-off-by: Paul Bridger <paul@paulbridger.com>
Signed-off-by: Paul Bridger <paul@paulbridger.com>
Signed-off-by: Rajeev Rao <rajeevrao@nvidia.com>
Signed-off-by: Paul Bridger <paul@paulbridger.com>
@rajeevsrao
Copy link
Collaborator

Thanks @pbridger. Will merge this commit once the corresponding cherry-pick for master is posted alongwith the 21.02 container update.

@rajeevsrao rajeevsrao merged commit 7ca28ec into NVIDIA:release/7.2 Jan 14, 2021
@pbridger
Copy link
Contributor Author

Cool! Many thanks @rajeevsrao and @pranavm-nvidia for all your work to include this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Feature Request Request for new functionality Module:Performance General performance issues Module:Plugins Issues when using TensorRT plugins

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants