Full text

Turn on search term navigation

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

The encroachment of buildings into the waters of rivers and lakes can lead to increased safety hazards, but current semantic segmentation algorithms have difficulty accurately segmenting buildings in such environments. The specular reflection of the water and boats with similar features to the buildings in the environment can greatly affect the performance of the algorithm. Effectively eliminating their influence on the model and further improving the segmentation accuracy of buildings near water will be of great help to the management of river and lake waters. To address the above issues, the present study proposes the design of a U-shaped segmentation network of buildings called RDAU-Net that works through extraction and fuses a convolutional neural network and a transformer to segment buildings. First, we designed a residual dynamic short-cut down-sampling (RDSC) module to minimize the interference of complex building shapes and building scale differences on the segmentation results; second, we reduced the semantic and resolution gaps between multi-scale features using a multi-channel cross fusion transformer module (MCCT); finally, a double-feature channel-wise fusion attention (DCF) was designed to improve the model’s ability to depict building edge details and to reduce the influence of similar features on the model. Additionally, an HRI Building dataset was constructed, comprising water-edge buildings situated in a riverine and lacustrine regulatory context. This dataset encompasses a plethora of water-edge building sample scenarios, offering a comprehensive representation of the subject matter. The experimental results indicated that the statistical metrics achieved by RDAU-Net using the HRI and WHU Building datasets are better than those of others, and that it can effectively solve the building segmentation problems in the management of river and lake waters.

Details

Title
RDAU-Net: A U-Shaped Semantic Segmentation Network for Buildings near Rivers and Lakes Based on a Fusion Approach
Author
Wang, Yipeng 1 ; Wang, Dongmei 2 ; Xu, Teng 3   VIAFID ORCID Logo  ; Shi, Yifan 2   VIAFID ORCID Logo  ; Liang, Wenguang 2 ; Wang, Yihong 2 ; Petropoulos, George P 4 ; Bao, Yansong 5 

 College of Water Conservancy and Hydropower Engineering, Hohai University, Nanjing 210098, China; [email protected] (Y.W.); [email protected] (T.X.); Jiangsu Hydraulic Research Institute, Nanjing 210017, China; [email protected] (Y.S.); [email protected] (W.L.); [email protected] (Y.W.) 
 Jiangsu Hydraulic Research Institute, Nanjing 210017, China; [email protected] (Y.S.); [email protected] (W.L.); [email protected] (Y.W.) 
 College of Water Conservancy and Hydropower Engineering, Hohai University, Nanjing 210098, China; [email protected] (Y.W.); [email protected] (T.X.) 
 Department of Geography, Harokopio University of Athens, EI. Venizelou 70, 17671 Athens, Greece; [email protected] 
 School of Atmospheric Physics, Nanjing University of Information Science and Technology, Nanjing 210044, China; [email protected] 
First page
2
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
20724292
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3153688016
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.