LKC-Net: large kernel convolution object

Abstract

Deep learning-based object detection methods have achieved great performance improvement. However, since small kernel convolution has been widely used, the semantic feature is difficult to obtain due to the small receptive fields, and the key information cannot be highlighted, resulting in a series of problems such as wrong detection, missing detection, and repeated detection. To overcome these problems, we propose a large kernel convolution object detection network based on feature capture enhancement and vast receptive field attention, called LKC-Net. Firstly, a feature capture enhancement block based on large kernel convolution is proposed to improve the semantic feature capturing ability, and depth convolution is used to reduce the number of parameters. Then, the vast receptive filed attention mechanism is constructed to enhance channel direction information extraction ability, and it is more compatible with the proposed backbone than other existing attention mechanisms. Finally, the loss function is improved by introducing the SIoU, which can overcome the angle mismatch problem between the ground truth and prediction box. Experiments are conducted on Pascal VOC and MS COCO datasets for demonstrating the performance of LKC-Net.

Details

Title

LKC-Net: large kernel convolution object detection network

Author

Wang, Weina¹; Li, Shuangyong¹; Shao, Jiapeng¹; Jumahong, Huxidan²

¹ Jilin Institute of Chemical Technology, College of Information and Control Engineering, Jilin, China (GRID:grid.443416.0) (ISNI:0000 0000 9865 0124)
² YiLi Normal University, School of Network Security and Information Technology, Yining, China (GRID:grid.440770.0) (ISNI:0000 0004 1757 2996)

Pages

9535

Publication year

2023

Publication date

2023

Publisher

Nature Publishing Group

e-ISSN

20452322

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1038/s41598-023-36724-x

ProQuest document ID

2825583864

© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

LKC-Net: large kernel convolution object detection network

Jump to:

Abstract

Details

Suggested sources