Full Text

Turn on search term navigation

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Arbitrary-oriented object detection (AOOD) is a crucial task in aerial image analysis but is also faced with significant challenges. In current AOOD detectors, commonly used multi-scale feature fusion modules fall short in spatial and semantic information complement between scales. Additionally, fixed feature extraction structures are usually used following a fusion model, resulting in the inability of detectors to self-adjust. At the same time, feature fusion and extraction modules are designed in isolation and the internal synergy between them is ignored. The above problems result in feature representation deficiency, thus affecting the overall detection precision. To solve these problems, we first create a fine-grained feature pyramid network (FG-FPN) that not only provides richer spatial and semantic features, but also completes neighbor scale features in a self-learning mode. Subsequently, we propose a novel feature enhancement module (FEM) to fit FG-FPN. FEM authorizes the detection unit to automatically adjust the sensing area and adaptively suppress background interference, thereby generating stronger feature representations. Our proposed solution was tested through extensive experiments on challenging datasets, including DOTA (77.44% mAP), HRSC2016 (97.82% mAP), UCAS-AOD (91.34% mAP), as well as ICDAR2015 (86.27% F-score) and its effectiveness and high applicability are verified on all the above datasets.

Details

Title
AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object Detection
Author
Zhang, Tianwei 1   VIAFID ORCID Logo  ; Xu, Sun 2   VIAFID ORCID Logo  ; Zhuang, Lina 2 ; Dong, Xiaoyu 3 ; Sha, Jianjun 4 ; Zhang, Bing 5   VIAFID ORCID Logo  ; Zheng, Ke 6 

 Key Laboratory of Computational Optical Imaging Technology, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China; [email protected]; Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China; International Research Center of Big Data for Sustainable Development Goals, Chinese Academy of Sciences, Beijing 100094, China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China 
 Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China 
 Department of Complexity Science and Engineering, The University of Tokyo, Tokyo 277-8561, Japan 
 College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China; [email protected]; Qingdao Innovation and Development Center, Harbin Engineering University, Qingdao 266000, China 
 College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China 
 College of Geography and Environment, Liaocheng University, Liaocheng 252059, China 
First page
4965
Publication year
2023
Publication date
2023
Publisher
MDPI AG
e-ISSN
20724292
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2882800154
Copyright
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.