Full text

Turn on search term navigation

Copyright © 2022 Liming Zhou et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/

Abstract

Due to the multiscale characteristics of ship targets in ORSIs (optical remote sensing images), ship target detection in ORSIs based on depth learning is still facing great challenges. Aiming at the low accuracy of multiscale ship target detection in ORSIs, this paper proposes a ship target detection algorithm based on multiscale feature enhancement based on YOLO v4. Firstly, an improved mixed convolution is introduced into the IRes (inverted residual block) to form an MIRes (mixed inverted residual block). The MIRes are used to replace the Res (residual block) in the deep CSP module of the backbone network to enhance the multiscale feature extraction capability of the backbone network. Secondly, for different scale feature maps’ perception fields, feature information, and the scale of the detected objects, the multiscale feature enhancement modules—SFEM (small scale feature enhancement module) and MFEM (middle scale feature enhancement module)—are proposed to enhance the feature information of the middle- and low-level feature maps, respectively, and then the enhanced feature maps are sent to the detection head for detection. Finally, experiments were implemented on the LEVIR-ship dataset and the NWPU VHR-10 dataset. The accuracy of the proposed algorithm in ship target detection reached 79.55% and 90.70%, respectively, which is improved by 3.25% and 3.56% compared with YOLO v4.

Details

Title
Ship Target Detection in Optical Remote Sensing Images Based on Multiscale Feature Enhancement
Author
Zhou, Liming 1   VIAFID ORCID Logo  ; Li, Yahui 2   VIAFID ORCID Logo  ; Rao, Xiaohan 1   VIAFID ORCID Logo  ; Liu, Cheng 1   VIAFID ORCID Logo  ; Zuo, Xianyu 1   VIAFID ORCID Logo  ; Liu, Yang 2   VIAFID ORCID Logo 

 Henan Key Laboratory of Big Data Analysis and Processing, Henan University, Kaifeng, Henan, China; School of Computer and Information Engineering, Henan University, Kaifeng, Henan, China 
 Henan Key Laboratory of Big Data Analysis and Processing, Henan University, Kaifeng, Henan, China; School of Computer and Information Engineering, Henan University, Kaifeng, Henan, China; Henan Province Engineering Research Center of Spatial Information Processing and Shenzhen Research Institute, Henan University, Kaifeng 475004, China 
Editor
Wenming Cao
Publication year
2022
Publication date
2022
Publisher
John Wiley & Sons, Inc.
ISSN
16875265
e-ISSN
16875273
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2725126289
Copyright
Copyright © 2022 Liming Zhou et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/