Abstract

As unmanned vehicle technology advances rapidly, obstacle recognition and target detection are crucial links, which directly affect the driving safety and efficiency of unmanned vehicles. In response to the inaccurate localization of small targets such as pedestrians in current object detection tasks and the problem of losing local features in the PointPillars, this paper proposes a three-dimensional object detection method based on improved PointPillars. Firstly, addressing the issue of lost spatial and local information in the PointPillars, the feature encoding part of the PointPillars is improved, and a new pillar feature enhancement extraction module, CSM-Module, is proposed. Channel encoding and spatial encoding are introduced in the new pillar feature enhancement extraction module, fully considering the spatial information and local detailed geometric information of each pillar, thereby enhancing the feature representation capability of each pillar. Secondly, based on the fusion of CSPDarknet and SENet, a new backbone network CSE-Net is designed in this paper, enabling the extraction of rich contextual semantic information and multi-scale global features, thereby enhancing the feature extraction capability. Our method achieves higher detection accuracy when validated on the KITTI dataset. Compared to the original network, the improved algorithm’s average detection accuracy is increased by 3.42%, it shows that the method is reasonable and valuable.

Details

Title
EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion
Author
Zhang, Wenbiao 1 ; Chen, Gang 2 ; Wang, Hongyan 1 ; Yang, Lina 3 ; Sun, Tao 1 

 Zhejiang Sci-Tech University, College of Information Science and Engineering, Hangzhou, China (GRID:grid.413273.0) (ISNI:0000 0001 0574 8737) 
 Jiaxing University, College of Information Science and Engineering, Jiaxing, China (GRID:grid.411870.b) (ISNI:0000 0001 0063 8301); Jiaxing Soy Intelligent Co. Ltd., Jiaxing, China (GRID:grid.411870.b) 
 Jiaxing University, College of Information Science and Engineering, Jiaxing, China (GRID:grid.411870.b) (ISNI:0000 0001 0063 8301) 
Pages
90
Publication year
2024
Publication date
Dec 2024
Publisher
Springer Nature B.V.
ISSN
16876172
e-ISSN
16876180
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3113189977
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.