Full text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Underwater sonar target recognition is crucial in fields such as national defense, navigation, and environmental monitoring. However, it faces issues such as the complex characteristics of ship-radiated noise, imbalanced data distribution, non-stationarity, and bottlenecks of existing technologies. This paper proposes the MultiFuseNet-AID network, aiming to address these challenges. The network includes the TriFusion block module, the novel lightweight attention residual network (NLARN), the long- and short-term attention (LSTA) module, and the Mamba module. Through the TriFusion block module, the original, differential, and cumulative signals are processed in parallel, and features such as MFCC, CQT, and Fbank are fused to achieve deep multi-domain feature fusion, thereby enhancing the signal representation ability. The NLARN was optimized based on the ResNet architecture, with the SE attention mechanism embedded. Combined with the long- and short-term attention (LSTA) and the Mamba module, it could capture long-sequence dependencies with an O(N) complexity, completing the optimization of lightweight long sequence modeling. At the same time, with the help of feature fusion, and layer normalization and residual connections of the Mamba module, the adaptability of the model in complex scenarios with imbalanced data and strong noise was enhanced. On the DeepShip and ShipsEar datasets, the recognition rates of this model reached 98.39% and 99.77%, respectively. The number of parameters and the number of floating point operations were significantly lower than those of classical models, and it showed good stability and generalization ability under different sample label ratios. The research shows that the MultiFuseNet-AID network effectively broke through the bottlenecks of existing technologies. However, there is still room for improvement in terms of adaptability to extreme underwater environments, training efficiency, and adaptability to ultra-small devices. It provides a new direction for the development of underwater sonar target recognition technology.

Details

Title
Optimization Strategy for Underwater Target Recognition Based on Multi-Domain Feature Fusion and Deep Learning
Author
Lu Yanyang 1   VIAFID ORCID Logo  ; Ding Lichao 1 ; Chen, Ming 1 ; Shi Danping 1 ; Xie Guohao 2 ; Zhang, Yuxin 2 ; Jiang, Hongyan 2   VIAFID ORCID Logo  ; Chen, Zhe 2 

 Hangzhou Applied Acoustics Research Institute, Hangzhou 310023, China; [email protected] (Y.L.); [email protected] (M.C.); [email protected] (D.S.) 
 School of Information and Communication, Guilin University of Electronic Technology, Guilin 541004, China; [email protected] (G.X.); [email protected] (Y.Z.); [email protected] (H.J.) 
First page
1311
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
20771312
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3233227517
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.