Full Text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

In recent years, weakly supervised semantic segmentation (WSSS) has garnered significant attention in remote sensing image analysis due to its low annotation cost. To address the issues of inaccurate and incomplete seed areas and unreliable pseudo masks in WSSS, we propose a novel WSSS method for remote sensing images based on the Siamese Affinity Network (SAN) and the Segment Anything Model (SAM). First, we design a seed enhancement module for semantic affinity, which strengthens contextual relevance in the feature map by enforcing a unified constraint principle of cross-pixel similarity, thereby capturing semantically similar regions within the image. Second, leveraging the prior notion of cross-view consistency, we employ a Siamese network to regularize the consistency of CAMs from different affine-transformed images, providing additional supervision for weakly supervised learning. Finally, we utilize the SAM segmentation model to generate semantic superpixels, expanding the original CAM seeds to more completely and accurately extract target edges, thereby improving the quality of segmentation pseudo masks. Experimental results on the large-scale remote sensing datasets DRLSD and ISPRS Vaihingen demonstrate that our method achieves segmentation performance close to that of fully supervised semantic segmentation (FSSS) methods on both datasets. Ablation studies further verify the positive optimization effect of each module on segmentation pseudo labels. Our approach exhibits superior localization accuracy and precise visualization effects across different backbone networks, achieving state-of-the-art localization performance.

Details

Title
Weakly Supervised Semantic Segmentation of Remote Sensing Images Using Siamese Affinity Network
Author
Chen, Zheng 1   VIAFID ORCID Logo  ; Lian, Yuheng 2   VIAFID ORCID Logo  ; Bai, Jing 2   VIAFID ORCID Logo  ; Zhang, Jingsen 2   VIAFID ORCID Logo  ; Zhu, Xiao 3   VIAFID ORCID Logo  ; Hou, Biao 2   VIAFID ORCID Logo 

 The Key Laboratory of Intelligent Perception and Image Understanding of the Ministry of Education, School of Artificial Intelligence, Xidian University, Xi’an 710071, China; [email protected] (Z.C.); [email protected] (Y.L.); [email protected] (J.Z.); [email protected] (B.H.); China Mobile Tietong Co., Ltd., Shanxi Branch, Taiyuan 030032, China 
 The Key Laboratory of Intelligent Perception and Image Understanding of the Ministry of Education, School of Artificial Intelligence, Xidian University, Xi’an 710071, China; [email protected] (Z.C.); [email protected] (Y.L.); [email protected] (J.Z.); [email protected] (B.H.) 
 The College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China; [email protected]; The Shenzhen Research Institute, Hunan University, Shenzhen 518055, China 
First page
808
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
20724292
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3176395027
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.