EFMF-pillars: 3D object detection based on

Abstract

As unmanned vehicle technology advances rapidly, obstacle recognition and target detection are crucial links, which directly affect the driving safety and efficiency of unmanned vehicles. In response to the inaccurate localization of small targets such as pedestrians in current object detection tasks and the problem of losing local features in the PointPillars, this paper proposes a three-dimensional object detection method based on improved PointPillars. Firstly, addressing the issue of lost spatial and local information in the PointPillars, the feature encoding part of the PointPillars is improved, and a new pillar feature enhancement extraction module, CSM-Module, is proposed. Channel encoding and spatial encoding are introduced in the new pillar feature enhancement extraction module, fully considering the spatial information and local detailed geometric information of each pillar, thereby enhancing the feature representation capability of each pillar. Secondly, based on the fusion of CSPDarknet and SENet, a new backbone network CSE-Net is designed in this paper, enabling the extraction of rich contextual semantic information and multi-scale global features, thereby enhancing the feature extraction capability. Our method achieves higher detection accuracy when validated on the KITTI dataset. Compared to the original network, the improved algorithm’s average detection accuracy is increased by 3.42%, it shows that the method is reasonable and valuable.

Details

Title

EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion

Author

Zhang, Wenbiao¹; Chen, Gang²; Wang, Hongyan¹; Yang, Lina³; Sun, Tao¹

¹ Zhejiang Sci-Tech University, College of Information Science and Engineering, Hangzhou, China (GRID:grid.413273.0) (ISNI:0000 0001 0574 8737)
² Jiaxing University, College of Information Science and Engineering, Jiaxing, China (GRID:grid.411870.b) (ISNI:0000 0001 0063 8301); Jiaxing Soy Intelligent Co. Ltd., Jiaxing, China (GRID:grid.411870.b)
³ Jiaxing University, College of Information Science and Engineering, Jiaxing, China (GRID:grid.411870.b) (ISNI:0000 0001 0063 8301)

Pages

Publication year

2024

Publication date

Dec 2024

Publisher

Springer Nature B.V.

ISSN

16876172

e-ISSN

16876180

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1186/s13634-024-01186-4

ProQuest document ID

3113189977

© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion

Jump to:

Abstract

Details

Suggested sources