Full text

Turn on search term navigation

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

The accuracy of data-driven landslide susceptibility prediction depends heavily on the quality of non-landslide samples and the selection of machine-learning algorithms. Current methods rely on artificial prior knowledge to obtain negative samples from landslide-free regions or outside the landslide buffer zones randomly and quickly but often ignore the reliability of non-landslide samples, which will pose a serious risk of including potential landslides and lead to erroneous outcomes in training data. Furthermore, diverse machine-learning models exhibit distinct classification capabilities, and applying a single model can readily result in over-fitting of the dataset and introduce potential uncertainties in predictions. To address these problems, taking Chenxi County, a hilly and mountainous area in southern China, as an example, this research proposes a strategy-coupling optimised sampling with heterogeneous ensemble machine learning to enhance the accuracy of landslide susceptibility prediction. Initially, 21 landslide impact factors were derived from five aspects: geology, hydrology, topography, meteorology, human activities, and geographical environment. Then, these factors were screened through a correlation analysis and collinearity diagnosis. Afterwards, an optimised sampling (OS) method was utilised to select negative samples by fusing the reliability of non-landslide samples and certainty factor values on the basis of the environmental similarity and statistical model. Subsequently, the adopted non-landslide samples and historical landslides were combined to create machine-learning datasets. Finally, baseline models (support vector machine, random forest, and back propagation neural network) and the stacking ensemble model were employed to predict susceptibility. The findings indicated that the OS method, considering the reliability of non-landslide samples, achieved higher-quality negative samples than currently widely used sampling methods. The stacking ensemble machine-learning model outperformed those three baseline models. Notably, the accuracy of the hybrid OS–Stacking model is most promising, up to 97.1%. The integrated strategy significantly improves the prediction of landslide susceptibility and makes it reliable and effective for assessing regional geohazard risk.

Details

Title
A Novel Strategy Coupling Optimised Sampling with Heterogeneous Ensemble Machine-Learning to Predict Landslide Susceptibility
Author
Lu, Yongxing 1 ; Xu, Honggen 1 ; Wang, Can 2 ; Guanxi Yan 3   VIAFID ORCID Logo  ; Huo, Zhitao 1 ; Peng, Zuwu 4 ; Liu, Bo 5 ; Xu, Chong 6   VIAFID ORCID Logo 

 Changsha General Survey of Natural Resources Center, China Geological Survey, Changsha 410600, China; [email protected] (Y.L.); [email protected] (H.X.); [email protected] (Z.H.) 
 Hunan Institute of Geological Disaster Investigation and Monitoring, Changsha 410004, China 
 School of Civil Engineering, University of Queensland, St. Lucia, QLD 4067, Australia; [email protected] (G.Y.); [email protected] (B.L.) 
 Geological Survey Institute of Hunan Province, Changsha 410014, China 
 School of Civil Engineering, University of Queensland, St. Lucia, QLD 4067, Australia; [email protected] (G.Y.); [email protected] (B.L.); College of Water Conservancy & Hydropower Engineering, Hohai University, Nanjing 210098, China 
 National Institute of Natural Hazards, Ministry of Emergency Management of China, Beijing 100085, China; [email protected] 
First page
3663
Publication year
2024
Publication date
2024
Publisher
MDPI AG
e-ISSN
20724292
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3116659116
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.