Content area

Abstract

Deep learning significantly advances object detection. Post processes, a critical component of this process, select valid bounding boxes to represent the true targets during inference and assign boxes and labels to these objects during training to optimize the loss function. However, post processes constitute a substantial portion of the total processing time for a single image. This inefficiency primarily arises from the extensive Intersection over Union (IoU) calculations required between numerous redundant bounding boxes in post processing algorithms. To reduce these redundant IoU calculations, we introduce a classification prioritization strategy during both training and inference post processes. Additionally, post processes involve sorting operations that contribute to their inefficiency. To minimize unnecessary comparisons in Top-K sorting, we have improved the bitonic sorter by developing a hybrid bitonic algorithm. These improvements have effectively accelerated the post processing. Given the similarities between the training and inference post processes, we unify four typical post processing algorithms and design a hardware accelerator based on this framework. Our accelerator achieves at least 7.55 times the speed in inference post processing compared to that of recent accelerators. When compared to the RTX 2080 Ti system, our proposed accelerator offers at least 21.93 times the speed for the training post process and 19.89 times for the inference post process, thereby significantly enhancing the efficiency of loss function minimization.

Details

1009240
Title
Object Detection Post Processing Accelerator Based on Co-Design of Hardware and Software
Author
Yang, Dengtian 1 ; Chen, Lan 2   VIAFID ORCID Logo  ; Hao, Xiaoran 2 ; Zhang, Yiheng 2 

 Institute of Microelectronics of the Chinese Academy of Sciences, Beijing 100029, China; [email protected] (D.Y.); [email protected] (X.H.); [email protected] (Y.Z.); University of Chinese Academy of Sciences, Beijing 100049, China 
 Institute of Microelectronics of the Chinese Academy of Sciences, Beijing 100029, China; [email protected] (D.Y.); [email protected] (X.H.); [email protected] (Y.Z.) 
Publication title
Volume
16
Issue
1
First page
63
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
e-ISSN
20782489
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-01-17
Milestone dates
2024-12-03 (Received); 2025-01-13 (Accepted)
Publication history
 
 
   First posting date
17 Jan 2025
ProQuest document ID
3159490312
Document URL
https://www.proquest.com/scholarly-journals/object-detection-post-processing-accelerator/docview/3159490312/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-01-25
Database
ProQuest One Academic