Full text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Image-level weakly supervised semantic segmentation is a challenging problem in computer vision and has gained a lot of attention in recent years. Most existing models utilize class activation mapping (CAM) to generate initial pseudo-labels for each image pixel. However, CAM usually focuses only on the most discriminating regions of target objects and treats each channel feature map independently, which may overlook some important regions due to the lack of accurate pixel-level labels, leading to the underactivation of the target objects. In this paper, we propose a dual attention equivariant network (DAEN) model to address this problem by considering both channel and spatial information of different feature maps. Specifically, we first design a channel–spatial attention module (CSM) for DAEN to extract accurately features of target objects by considering the correlation among feature maps in different channels, and then integrate the CSM with equivariant regularization and pixel-correlation modules to achieve more accurate and effective pixel-level semantic segmentation. Extensive experimental results show that the DAEN model achieved 2.1% and 1.3% higher mIoU scores than the existing weakly supervised semantic segmentation models on the PASCAL VOC 2012 and LUAD-HistoSeg datasets, respectively, validating the effectiveness and efficiency of the DAEN model.

Details

Title
Dual Attention Equivariant Network for Weakly Supervised Semantic Segmentation
Author
Huang Guanglun 1 ; Zheng Zhaohao 2 ; Li, Jun 2   VIAFID ORCID Logo  ; Zhang, Minghe 2 ; Liu, Jianming 2 ; Zhang, Li 2 

 School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China; [email protected] (G.H.); [email protected] (Z.Z.); [email protected] (J.L.); [email protected] (M.Z.), Nanning New Technology Entrepreneur Center, Nanning 530007, China 
 School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China; [email protected] (G.H.); [email protected] (Z.Z.); [email protected] (J.L.); [email protected] (M.Z.) 
First page
6474
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3223873149
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.