GIS-Based Soft Computing Models for Landslide

Full text

Turn on search term navigation

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1. Introduction

Landslide occurrences in Himalayan region are 30% of the total landslides in the world. Landslides cause huge loss of economy, damage to property, and loss of life all over the world [1]. Every year casualties in the Himalayan region are about 200 people, and economic loss due to landslides is more than US$ 1 billion [2]. Causes of landslides are natural and anthropogenic. Landslides are affected by geological, topographical, meteorological, and geo-environmental factors. Land-use pattern changes due to anthropogenic activities such as road construction, power houses excavations, and open-pit/cast mining cause landslides [3]. Natural causes of landslides include earthquakes, glacier melting, and rainfall. Currently, impact of the climate change has drastically amplified the landslides phenomenon globally [4, 5].

Landslide susceptibility map is regarded as an integral part of landslide management of a particular region [6, 7]. Such maps find their applications in minimizing landslide risk to human life, man-made infrastructures, particularly network of roads in mountainous regions, and land-use planning [8]. Landslide susceptibility means that the area is susceptible to landslides and probability of occurrence of landslides is there on the basis of past experience under similar geo-environment conditions [9]. There are several statistical techniques aided by Geographic Information System (GIS) and Remote Sensing methods that have been widely employed by many researchers to formulate models in order to generate landslide susceptibility maps. In general, there are two main approaches used in modeling and mapping of the landslide susceptibility, namely, quantitative and qualitative methods [10]. In qualitative methods, expert’s opinion-based models are often used and applied such as Analytical Hierarchy Process (AHP) and Weighted Linear Combination (WLC) [11]. Whereas, quantitative methods can be classified as deterministic and probabilistic methods which involve the mathematical relationship between landslides’ occurrences and their associated affecting factors. Statistical and Machine Learning methods such as Binary Logistic Regression, Information Value, Likelihood Ratio, Logistic Regression (LR), and Multivariate Regression are some the quantitative methods applied for landslide susceptibility modeling and mapping [12–15]. Furthermore, few other algorithms with optimum performance and a higher degree of accuracy used in the landslide susceptibility mapping include Discriminant Analysis, Generalized Additive Models, Evidential Belief Functions, Weighted Linear Combinations, and Weights of Evidence [16–20]. Also, Certainty Factors, Probabilistic Models, Modified Bayesian Estimation, Information Values [21–24], Dempster–Shafer models, and Index of Entropy are also widely used as the statistical quantitative techniques to map the landslide susceptible areas [25–28].

Nowadays, researchers are extensively using Machine Learning (ML) algorithms in studies related to natural hazards including landslide studies, such as Multivariate Adaptive Regression Spline [29], Support Vector Machine (SVM) [30], Classification and Regression Trees [31], Boosted Regression [32], Quadratic Discriminant Analysis [33], Naïve Bayes Tree (NBT) [34], Maximum Entropy [35], Random Forest (RF) [36], Artificial Neural Networks (ANN) [37], and Generalized Linear Model [38]. In addition, Naïve Bayes (NB), Multilayer Perceptron (MLP) neural network classifier, and Alternating Decision Tree (ADT) are widely employed algorithms to investigate landslide susceptibility [39–41]. Recently, Senouci et al. [42] deployed a knowledge-driven approach alongside Analytical Hierarchy Process (AHP) in a GIS-based environment to evaluate the landslide susceptibility map in Algeria. Hong et al. [41] evaluated two-class Kernel Logistic Regression (KLR), SVM, and ADT for landslide susceptibility mapping at the Yihuang area (China). It was found that ADT has better capabilities concerning to the prediction compared to SVM and KLR. Sahana et al. [43] employed the MLP neural network to estimate rainfall induced landslide susceptibility. Chen and Wang [44] compared KLR, NBT, and ADT models for landslide susceptibility assessment at Taibai County, China.

In the present research, the main objective was to use three soft computing ML algorithms, namely, NB, MLP, and ADT-based models, which are popular ML models, to predict the landslide susceptibility of Pithoragarh District of Uttarakhand State, India, which is one of the landslide-prone areas in the Himalayas. In literature, there are several studies have been carried out in landslide susceptibility mapping in Himalayas areas [45–47]. However, the application of ML models is still limited. Moreover, during literature survey, it is noticed that, for the first time, these models have been applied in this area for landslide susceptibility mapping. Thus, the main objective of this study was to apply best ML models for the accurate landslide susceptibility mapping of the study area.

In this study, standard statistical measures including Receiver Operating Characteristic (ROC) curve techniques were used to evaluate performance of the models for the selection of the best model for the accurate landslide susceptibility mapping. The maps produced would be useful for regional planning and disaster management. Weka software was used for the models study and GIS software for the development of thematic maps and visualization.

2. Material and Methods

2.1. Study Area

In the present study, the Pithoragarh district of Uttarakhand, India, has been selected as it is one of the prominent landslide-prone areas of the Himalayas [48]. This area is mountainous, characterized by high hills and deep valleys having elevation ranges between 1500 m and 2500 m gradually increasing from south to north (Figure 1). Two main rivers Sarju and Ramganga flow in this area forming deep valleys/gorges. The region is susceptible to occurrence of landslides on the excavated slopes of road sections particularly on Bansura-Rameshwar Ghat Road, in the northeast part, in the suburbs of Muwani, Dewal Thal, northern Chera, and Gangolihat places. Geologically, the aforementioned area is complex due to presence of folded and faulted rocks, belonging to Almora crystalline (granitoid) group and Garhwal group (shale, slate, phyllite, quartzite, dolomite, limestone, magnesite, occasional calc slate, and metavolcanics). The region witnesses tectonic activities that ultimately lead to folding and faulting of the rocks [49]. Quaternary sediments are located in river valleys, on the hill slopes and also as glacial deposits.

[figures omitted; refer to PDF]

2.2. Methodology

The methodology followed in undertaking the formulation of models in this study is illustrated in the Figure 2, which is self-explanatory. Landslide conditioning factors and landslide inventory were utilized in order to model the data using three ML techniques. Models were validated in terms of degree of accuracy and performance using various indices levels. In this study, data of GSI (http://www.portal.gsi.gov.in), Google Earth images, and Aster DEM were used for the development of thematic maps.

[figure omitted; refer to PDF]

2.3. Data Used

2.3.1. Landslide Inventory

In order to undertake the landslide susceptibility mapping for the study area, it is necessary to address the landslide inventory which indicates the geospatial distribution of several factors including locations, types, and nature of landslides [50]. For the model development, the landslide polygon data represented on map by points was split randomly into ratio of 70 : 30 as the training dataset (70%) and validation dataset (30%), respectively [51, 52]. The training data (70%) was used for landslide susceptibility zoning whereas the remaining 30% testing dataset for the validation was used to check accuracy and robustness of models. This split ratio was selected based on the experience of other similar research studies [51, 52]. In this research, the inventory of 91 landslides marked as points in the map was prepared based on the record of (GSI) (https://www.gsi.gov.in/webcenter/portal) as well as interpretation of the Google Earth images for the modeling (Figures 1 and 3). Most of the identified landslides in the area are translational rock-cum-debris slides.

[figure omitted; refer to PDF]

2.3.2. Landslide Conditioning Factors

In this study, ten impacting parameters, namely, slope degree, slope aspect, slope curvature, elevation, geomorphology, land cover, distance to roads, distance to rivers, overburden depth, and Slope Forming Materials (SFM) were taken into consideration on the basis of the indigenous geo-environmental scenario. For the models’ study, thematic layers of the aforementioned parameters were generated using 30 m cell size with the aid of ArcGIS software. Slope degree, aspect, curvature, drainage, and elevation maps were generated from the Aster DEM collected from the United States Geological Survey (USGS) (https://earthexplorer.usgs.gov). Geological, geomorphological, and SFM maps were extracted from the reports of the GSI (https://www.gsi.gov.in/webcenter/portal). Land cover map was generated using GSI data and Google Earth images. The binary classification method (0’s and 1’s) was employed for modeling landslides that is “1” for the presence of landslide instances and “0” for the absence [53, 54]. Table 1 shows the original sources of the data used. Figure 4 shows the maps of the landslide conditioning factors.

Table 1

Information of data used in this study.

Data used	Resolution	Source
Land cover, geomorphology, overburden depth, and SFM	1: 50,000	GSI report (https://www.gsi.gov.in/webcenter/portal) and Google Earth Image (2020)
Elevation, slope angle, slope aspect, curvature, road, and rivers	30 m	DEM (https://earthexplorer.usgs.gov)

[figures omitted; refer to PDF]

(1) Slope Angle. It is an extremely vital parameter while evaluating the occurrence of landslides. Different types of landslides depend on the slope angle. Most of the landslides are reported between 25 and 45 degree slopes [55]. The slope angle also regulates the surface and subsurface water flow that has direct impact on the runoff and infiltration, thus on landslide occurrences [56]. Here, the slope angle was derived from Aster DEM using the slope tool. Based on natural breaks classification, the slope was divided into five classes (Figure 4(a)).

(2) Slope Aspect. Slope aspect is also deemed as a significant parameter to effectively evaluate the landslide susceptibility that is associated with the direction of the sun, air/wind, and rain/precipitation in the specified area [57, 58]. This factor also indirectly impacts the vegetation and amount of water held in the soil. Here, the slope aspect was derived from Aster DEM. Based on natural breaks classification method, the slope was divided into nine different categories, flat, north, northeast, east, southeast, south, southwest, west, and northwest, using ArcGIS (Figure 4(b)).

(3) Curvature. It is another important factor in the surface runoff and ground infiltration which ultimately impacts the erosion, accumulation, and flow of surface and groundwater [57]. Therefore, the curvature tends to affect the occurrence of landslides to a larger extent. In the current study, the curvature map was generated from DEM and categorized as the concave, convex, and flat surfaces using ArcGIS software (Figure 4(c)).

(4) Elevation. Elevation is considered as an imperative parameter in the occurrence of landslides [56]. Generally, landslides occur in mountainous regions with deep valleys and rugged surfaces. In the hilly area, at higher elevations, usually the intensity of rainfall is less; however, glaciers’ existence and prominence are relatively more [46]. At higher elevations, landslides occur due to melting of ice causing erosion. Moraines also block the passage of streams and rivers at higher elevations forming lakes which sometimes burst and cause sudden floods and landslides in the area.

Majority of the rains and vegetation occur mostly in the range of lower and middle elevations in the Himalayas [45]. Furthermore, the landslide events are hugely associated with the elevation where slopes are moderate to high, moreover in the less vegetation area affected by heavy rainfall [59]. In this study, the elevation map was divided into different classes based on the natural break method for landslide susceptibility analysis (Figure 4(d)).

(5) Geomorphology. Geomorphology is one more vital factor for the evaluation of landslide susceptibility [60]. Geomorphological features encompass undulating grounds, mountains, river terraces, valleys, ridges, and escarpments, which affect the occurrence of landslides alongside other topographic and geo-environmental parameters. The geomorphology map (Figure 4(e)) was extracted from the GSI report (https://www.gsi.gov.in/webcenter/portal).

(7) Land Cover. Generally, bare lands and nonvegetated areas are more susceptible to landslides [61]. In a densely vegetated area, roots of plants reinforce the ground, thereby preventing erosion and thus landslides. Incidences of landslides are also in the cultivated region because of saturation of soil as a result of irrigation and also due to removal of surface soil strata. The land cover map was extracted from the GSI report (https://www.gsi.gov.in/webcenter/portal) and verified from the Google Earth images. The land cover map of the study area comprises of eleven major groups of land cover (Figure 4(f)).

(8) Distance to Roads. Presence of the road network in a region highly impacts the landslide occurrences [62]. Roadside excavation and vegetation removal are actions that lead to landslides during pavement/road construction [63]. Excavation of roads create instability in ground mass resulting in slope failure and landslides adjacent to roads up to certain distances based on the nature of soil and geology. Thus, the distance from the roads is an immensely imperative parameter in the landslide study [64]. In this study, roads were identified and delineated using Google Earth images. The road buffer map at 100 m intervals was generated using ArcGIS tool (Figure 4(g)).

(9) Distance to Rivers. Distance to rivers is of prime importance for the development of landslide susceptibility maps. The hydrological network regime, soil saturation of water sources, and groundwater recharge, alongside the mounting water pressure on the groundmass and slopes, lead to landslides in areas adjacent to water sources, rivers, and streams. Distance to rivers maps (Figure 4(h)) was created from the rivers extracted from DEM using Flow Accumulation tool in GIS and buffered at an interval of 100 m for the landslide susceptibility mapping.

(10) Overburden Depth. Chances of slope failure are more likely in thick overburden areas depending on the characteristics of the overburden material. Major part of the study area is covered thin over burden (1–3 m) with occasional relatively thick pockets of greater than 5 m. Therefore, the possibility of landslides due to failure of the overburden material is very less. However, nature and thickness of the material affect the infiltration, thus ground water conditions in the area which may affect moisture conditions of the underlying rock mass creating instability and thus landslides. The overburden material map was extracted from the report of GSI (https://www.gsi.gov.in/webcenter/portal) (Figure 4(i)).

(11) Slope for Mining Material (SFM). SFM map was extracted from the GSI report (https://www.gsi.gov.in/webcenter/portal). The type of the slope forming material is very important in the landslide. Characteristics of the groundmass depend on the SFM, namely, its permeability, porosity, and geotechnical properties. Types of landslides depends on the above characteristics of the material and also on the binding/looseness of soil and rock mass, and the SFM map of the present study area was classified into seventeen main groups (Figure 4(j)): Group 1 (alluvium); Group 2 (chlorite, schist, and massive amphibolite); Group 3 (dolomite); Group 4 (garnet, mica, schist, and micaceous quartzite); Group 5 (granite, gneiss, garnetiferous schist, and amphibolite); Group 6 (in situ soil); Group 7 (limestone with intercalations of shale); Group 8 (limestone, dolomite, shale, and cherty quartzite); Group 9 (metabasite); Group 10 (older well-compacted debris); Group 11 (phyllite, stromatoliphyllite, and stromatolitic); Group 12 (phyllite, stromatolitic dolomite, limestone, and Cu and Mg min); Group 13 (schist, augen gneiss, quartzite, and amphibolites); Group 14 (slate, lenses of quartzite and dolomite, and epidorite dyke); Group 15 (slate, quartzite, and dolomite with epidiorite dyke); Group 16 (transported soil); Group 17 (transported soil and colluvium).

2.4. Methods Used

In this study, we have used three ML models, namely, NB, ADT, and MLP. Brief description of these models is mentioned in the following sections:

2.4.1. Naïve Bayes (NB)

This model (NB) is a classification model based on Bayes theorem [65]. A variable value on a given class is assumed as independent of another variable by the NB classifier, which is also called class conditional independence. The NB algorithm is simpler to use on various types of datasets [66]. Consider X to be a dataset with no class label. Now, assume H to be a hypothesis that X belongs to a specified class C. We aim to ascertain P(H|X), the probability that the hypothesis H relates with given the observed X. P(H|X) is the posterior probability indicating our confidence in the hypothesis after X is given. The Bayesian theorem offers a way of estimating the posterior probability P(H|X) using probability P(H), P(X), and P(X|H) [67]. The equation is presented as follows: $\begin{matrix} (1) & P H | X = \frac{p H | X . P H}{P X} . \end{matrix}$

Assume that there is a set comprising m samples S = {S1, S2, …, Ms.} where sample Si is denoting as an n-dimensional feature vector {X1, X2, …, XNA} as the training dataset. Values X_i relate to attributes A1, A2, … , An, respectively. Also, k classes c1, c2, …, ck, and every sample belong to one of these classes. In the current research approach, k value is set to two since only slide-prone and nonslide-prone classes exist. Given an additional data sample X whose class is unknown, it is possible to predict X using the highest conditional probability P(Ci|X), where i = 1, 2, …, k. Then, the NB theorem is formulated as $\begin{matrix} (2) & P C_{i} | X = \frac{p X | C_{i} . P C_{i}}{P X} . \end{matrix}$

2.4.2. Multilayer Perceptron (MLP) Neural Network Classifier

This MLP neural network is a classification technique also considered as one type of Artificial Neural network that has been widely in use, for instance, in remote sensing applications for the sake of land cover classification [59, 68]. It encompasses several merits, for instance, partitioning of training data points is done straightforwardly without any prior assumptions, no decision is deemed as fixed concerning the comparative significance of several input measurements, and most of these measurements are selected on the basis of weights adjustment during the training process [69].

The MLP neural network comprises three main layers: input layers, hidden layers, and output layers [70]. The input layers consist of factors that affect landslides, while the output layers yield the classified results that help in classifying landslide or nonlandslide classes. The hidden layers act as classifiers to convert the input into the output form.

Two main steps are involved in the MLP neural nets’ training: initially, the inputs are processed through the hidden layers for obtaining the desired outcomes. Then, the output results are associated to the target values to calculate the specified error, and ultimately, the biases/weights are adjusted to improve the results [71].

Suppose that x = x_i, where i = 1, 2, …, n is a vector comprising n factors affecting the landslides, and y = 1 (representing landslide class) or 0 (non-andslide class). Afterwards, the classification function for the MLP neural network function is illustrated in the form of equation (1): $\begin{matrix} (3) & Y = f x, \end{matrix}$ where f(x) refers to the hidden function which is yielded by the adjustment of biases during the training process of the model for a given architecture of the network.

2.4.3. Alternating Decision Tree (ADT)

ADT refers to an advance version of Decision Trees (DT) having boosting which is capable of generating classifiers with higher accuracy [34, 72]. In literature, it has been shown that ADT-based landslide susceptibility modeling shows superior modeling performance [41, 73]. ADT encompasses dual alternating layers; the layer containing decision nodes, also called splitter nodes, help to specify the condition of predicate, while the second layer contains optional nodes that are also known as the prediction nodes which contain only a single number. The resulting tree always exhibits the prediction nodes in the form of both roots and leaves assuming no interconnection with extradecision nodes. ADT has been employed to segregate the landslide or nonlandslide classes while modeling the landslide susceptibility, in this current study. Also, this tree is capable to generate accurate classifiers which can be interpreted simply in contrast to the variety of DT algorithms, for example, Classification and Regression Trees (CART), and Random Forest (RF) [41, 73].

The classification rules of ADT are based on defining a set of paths at a particular time. When a certain path approaches the decision node, it is accompanied by the off spring that represents the results of the decision linked to the particular node. It is necessary to mention that the path remains with the entire offspring of the node upon approaching the prediction node, thus leading the instance to track a “multipath.” The categorization for the recently produced instance refers to the landslide susceptibility of the individual pixel that can be determined from the sign of summation of the entire values present over predictor nodes along the aforementioned multipath [74, 75]. The basic algorithm of the ADT algorithm is stated below [72].

First of all, it is the weight of an individual training instance that produces a group of weak suppositions. Therefore, the initial estimation is done using equation (2), which also refers to entire weights comprising the positive and negative instances: $\begin{matrix} (4) & a = \frac{1}{2} \ln \frac{W_{+} T}{W_{-} T}, \end{matrix}$ where T represents the precondition and condition of an only base rule.

After that, the precondition c₁ and condition c₂ are used to define $Z_{t} c_{1}, c_{2}$ . In equation (3), ^ and ¬ represent conjunction (AND gate) and negation (NOT gate), respectively: $\begin{matrix} (5) & Z_{t} c_{1}, c_{2} = 2 \sqrt{W_{+} c_{1}^{c_{2}} W_{-} c_{1}^{c_{2}}} + \sqrt{W_{+} c_{1}^{￢ c_{2}} W_{-} c_{1}^{￢ c_{2}}} + W ￢ C_{2}, \end{matrix}$ where a and b are the estimated values which can be defined by equation (4), wherein the optimized c₁ and c₂ values are chosen by reducing $Z_{t} c_{1}, c_{2}$ : $\begin{matrix} (6) & a = \frac{1}{2} \ln \frac{W_{+} T c_{1}^{c_{2}}}{W_{-} T c_{1}^{c_{2}}}, b = \frac{1}{2} \ln \frac{W_{+} T c_{1}^{￢ c_{2}}}{W_{-} T c_{1}^{￢ c_{2}}} . \end{matrix}$

The weights of individual training instances are updated in accordance with equation (5), where x defines a set of instances and y refers to a set of prediction layers: $\begin{matrix} (7) & ω_{i, t + 1} = ω_{i, t} e^{- r_{t} x_{i} y_{t}}, \end{matrix}$ where r denotes a base rule, r(x) denotes the real value which the rule relates with the instance x, t represents boosting iteration, and $ω$ is the weight of examples [72]. Eventually, the classification rule can be achieved as illustrated in equation (8) that represents the sign of the summation of the entire base rules: $\begin{matrix} (8) & class x = sign \sum_{i = 1}^{T} r_{t} x . \end{matrix}$

2.4.4. Validation Methods

(1) Area under the Receiver Operating Characteristic Curve (AUC-ROC). The AUC-ROC curve is used as a prevalent validation technique for landslide susceptibility models [76, 77]. The current study also employed AUC-ROC for evaluating performance of the developed models. The values of sensitivity and 100% specificity were first plotted, and then, AUC was employed for quantitative evaluation [73]. The AUC value of 1 suggests a perfect model, whereas a value <0.5 indicates an incorrect model [78, 79].

(2) Statistical Indices. The common statistical indices for landslide susceptibility models, namely, Kappa (k), Specificity (SPF), Sensitivity (SST), Negative Predictive Value (NPV), Positive Predictive Value (PPV), Root Mean Square Error (RMSE), and Accuracy (ACC) were chosen for validation of the models [80, 81]. Out of these, the reliability of the models for landslide prediction was assessed using k value that varies between “0” and “1” [34]. If k is closer to “1,” it indicates that the landslide models are more reliable, and if the values of Kappa are closer to 0, it indicates that the landslide models are not corrected. Furthermore, SPF and NPV were used to evaluate the goodness of landslide models for classification of nonlandslide pixels, SST and PPV were used for landslide pixels, RMSE indicates the error analysis of the models, and ACC was used to validate the general accuracy of the model. Detailed description and calculation of these indices are presented in the published works [80, 82].

3. Results

3.1. Evaluation of Goodness of Fit of the Models

Using the training dataset, three landslide susceptibility models, namely, NB, MLP, and ADT were developed for spatial landslide prediction and evaluated for the goodness of fit of the models. The statistical evaluation of the developed models has been listed in Table 2 and Figures 5–8. The classification probability of the landslide class i.e., PPV for ADT and MLP is almost similar, which reflects the strong classifying capability of the landslide class as compared to NB. For the NPV, the MLP model indicates that the probability to correctly classify the pixels in the training data related to the nonlandslide class is 85.29%, followed by the ADT model (83.53%) and the NB model (80%).

Table 2

Accuracy analysis of the models for the training dataset.

Parameters		PPV (%)	NPV (%)	SST (%)	SPF (%)	ACC (%)	Kappa (k)	MAE	RMSE
Training dataset	NB	74.71	80	78.88	75.98	77.35	0.547	0.292	0.416
	MLP	79.41	85.29	84.38	80.56	82.35	0.647	0.271	0.364
	ADT	80	83.53	82.93	80.68	81.76	0.635	0.329	0.372

[figures omitted; refer to PDF]

The MLP model shows the highest SST (84.38%) demonstrating that the landslide points are accurately classified to the landslide class though in comparison to the ADT and NB models which are 82.93% and 78.88%, respectively. The maximum SPF is for the ADT model describing that 80.68% nonlandslide points are accurately classified to the nonlandslide class. MLP and NB models showed the SPF of 80.56% and 75.98%, respectively.

The MLP model manifests the highest values of the Kappa index of 0.647, followed by ADT (0.6353) and NB (0.547) models representing a substantial agreement between the predictions and reality (Figure 5). The minimum value of RMSE for the training data was observed as 0.364 for the MLP model, followed by 0.372 and 0.416 for ADT and NB models, respectively (Figure 6). The highest value of ACC is for the MLP model (82.35%), followed by ADT (81.76%), and NB (77.35%) (Table 2). From the ROC curve analysis, it can be seen that the ADT model received the highest value of AUC (0.902), followed by the MLP (0.886) and the NB (0.802) (Figure 8). From these results, it can be stated that all models have a high goodness of fit with the data used, but the MLP and ADT are slightly better than NB.

3.2. Validation of Predictive Capability of the Models

Using the testing dataset, the predictive capability of the landslide susceptibility models was tested. The results are illustrated in Tables 3 and Figures 5–8. The maximum ACC in the validation dataset is for the MLP model (83.55%), followed by the NB model (79.61%) and the ADT model (78.95%). The MLP model has the maximum PPV (81.58%), signifying the probability to accurately classify pixels to the landslide class at 81.58%. The value of PPV for ADT is similar to MLP, whereas it is 80.26% for the NB model (Table 2). The highest probability to classify pixels to the nonlandslide class is also for the MLP model (85.53%). The MLP model has the highest SST (84.93%) indicating that 84.93% of the landslide points are correctly classified into the landslide class whereas the NB and ADT can accurately classify 79.22% and 77.50%, respectively. SPF is 82.28% for the MLP model supporting 82.28% nonlandslide points which are correctly categorized to the nonlandslide class whereas this value is 80% and 80.56% for NB and ADT, respectively. Figure 7 also shows the RMSE values of the models. The Kappa indices of 0.647, 0.635, and 0.547 for the MLP, the ADT, and the NB models, respectively, reveal a significant agreement between observed and anticipated landslides (Figure 5). The AUC-ROC results indicate that all three models have good prediction capabilities with the maximum for the ADT model (AUC = 0.902), followed by the MLP model (AUC = 0.886) and NB model (AUC = 0.829) (Figure 8). Overall, all three models have demonstrated close agreement between the predicted and actual values. However, the NB and MLP models showed better performance in the validation datasets comparatively.

Table 3

Accuracy analysis of the models for the testing dataset.

Parameters		PPV (%)	NPV (%)	SST (%)	SPF (%)	ACC (%)	Kappa (k)	MAE	RMSE
Testing dataset	NB	80.26	78.95	79.22	80	79.61	0.592	0.269	0.375
	MLP	81.58	85.53	84.93	82.28	83.55	0.671	0.304	0.385
	ADT	81.58	76.32	77.5	80.56	78.95	0.579	0.363	0.407

3.3. Construction of Landslide Susceptibility Maps

Results of the studied landslide models were used to develop landslide susceptibility maps. Initially, landslide susceptibility indices (LSI) were created from the training of the models, and each pixel was assigned a distinctive LSI. In the subsequent step, the geometrical intervals’ (GIs) method was used to reclassify LSI into different ranges [83]. On the basis of these intervals, five different susceptible classes were identified for developing landslide susceptibility maps, namely, very low, low, moderate, high, and very high (Figure 9).

[figures omitted; refer to PDF]

Landslide susceptibility maps developed on the basis of the models were validated by Frequency Ratio (FR) analysis of each susceptible class for each model on the maps (Figure 10). FR for each susceptible class on the map was obtained from equation (9). Each class represents a ratio of the percentage of landslide pixels (P_LS %) to the percentage of all pixels of class (P_P %): $\begin{matrix} (9) & FR = \frac{P_{LS}}{P_{p}} . \end{matrix}$

[figures omitted; refer to PDF]

It can be observed from the Figure 10 that the map produced by MLP and ADT models are the most reliable as they received the highest value of FR on the very high class compared with another model (NB).

4. Discussion

Landslide susceptibility maps are deemed as a final product that can assist authorities in land-use planning. The development of these maps is generally considered as the first important step in landslide hazard and risk evaluation for better landslide management [84, 85]. So, it is desirable to develop and use a number of models which can forecast correctly landslide susceptible areas. Since the precision of a variety of suggested methods for landslide susceptibility modeling is under discussion, investigations of existing and novel techniques for this purpose are very important to reach some satisfactory conclusions [86]. The recent advancement in the field of GIS and ML techniques has been helpful in performance of the models [87]. Some advanced innovative ML methods, for instance, ADT and KLR have been employed in several fields with high accuracy [88–91]. ML models such as NB, MLP, and ADT have also been employed in some landslide susceptibility studies [40, 73]. However, the literature lacks the comparison of these three models in the assessment of landslide susceptibility at the study area. This issue is addressed in the current research by analyzing and comparing the NB, ADT, and MLP techniques for landslide susceptibility modeling in the study area of Pithoragarh District of Uttarakhand State, India.

Overall, all three applied models performed well for landslide susceptibility mapping. However, the performance of the ADT and MLP models is slightly better than the NB model. It is reasonable as both of the ADT and MLP are advanced and popular ML models with many advantages. In the case of the ADT, it has the handling of large input data with the proper assessment of the accuracy of the output [92]. In addition, in the ADT, each decision rule can be analyzed in isolation as it is known explicitly [93]. With the case of MLP, it has an adapting learning, which is able to learn how to do the tasks on the given data. However, in this case, no assumption is made about the pattern classes for the underlying probability density functions [94].

Comparison with the published works shows that the modeling results of this study are comparative with other published works. In the study of Thai Pham et al. [76] for landslide susceptibility of Luc Yen district of northern part of Viet Nam, it obtained the AUC value of 0.827 for the ADT model. Similarly, Hong et al. [41] obtained the AUCs’ equals of 0.957, 0.925, and 0.888 for the ADT, the KLR, and the SVM models. Hong et al. [41] evaluated the values of Kappa indices as 0.741, 0.702, and 0.626 for the ADT, the KLG, and the SVM models, respectively, indicating a substantial agreement between the models and the reality. Pham et al. [95] also compared MLP with NB for landslide susceptibility mapping and stated that the MLP (AUC = 0.850) is slightly better than the NB (AUC = 0.838). In [73], it was also stated that the ADT is slightly better than NBT, which is a hybrid model of NB and DT, for landslide susceptibility modeling. From these comparisons, we can state that although the ML models have good accuracy prediction for landslide susceptibility, however, their predictive capability is different from case to case, which depends on the data used for each model. In addition, we insisted that the performance of the ML models depends on the selection of the hyperparameters used to train the models. In this work, the hyperparameters of the applied models are selected by the trial-error process as presented in Table 4.

Table 4

Hyperparameters used in the applied models of this study.

Hyperparameters	Models
Hyperparameters	NB	MLP	ADT
Batch size	100	100	100
Debug	False	False	False
Display model in old format	False	—	—
Do not check capabilities	False	False	False
Num decimal places	2	2	2
Use kernel estimator	False	—	—
Use supervised discretization	False	—	—
Activation function	—	Approximate sigmoid	—
Loss function	—	Squared error	—
Num function	—	2	—
Num threads	—	1	—
Pool size	—	1	—
Ridge	—	0.01	—
Seed	—	1	1
Tolerance	—	1.0E-6	—
Use CGD	—	False	—
Num of booting iterations	—	—	10
Resume	—	—	False
Save instance data	—	—	False
Search path	—	—	Expand all paths

5. Concluding Remarks

In this study, we compared three popular ML models, namely, NB, MLP, and ADT for landslide susceptibility modeling and mapping at the Pithoragarh District, Uttarakhand State, India. Several validation indices including the AUC-ROC curve were used for validation and comparison of the models. The results infer that all the three models performed well for landslide susceptibility mapping. However, the ADT and MLP are slightly better than NB. Therefore, it can be concluded that the ADT and MLP models are promising techniques that could be used for landslide susceptibility modeling and mapping. The developed maps would be useful in identifying future landslide-prone areas for the better landslide management, prevention of causalities and damages to infrastructure, and proper land-use planning of the area. Model development is a continuous process. Therefore, it is proposed to carryout similar studies by comparing these models with other soft computing models in other areas also for its wider application.

Acknowledgments

The authors thank the University of Transport Technology for supporting this research and also GSI and USGS for their useful website for scientific/academic work.

References

[1] T. Li, Landslide Management in the Mountain Areas of China, 1990.

[2] D. P. Kanungo, M. K. Arora, R. P. Gupta, S. Sarkar, "Landslide risk assessment using concepts of danger pixels and fuzzy set theory in Darjeeling Himalayas," Landslides, vol. 5 no. 4, pp. 407-416, DOI: 10.1007/s10346-008-0134-3, 2008.

[3] P. De Vita, P. Reichenbach, J. C. Bathurst, M. Borga, G. Crosta, M. Crozier, T. Glade, F. Guzzetti, A. Hansen, J. Wasowski, "Rainfall-triggered landslides: a reference list," Environmental Geology, vol. 35 no. 2-3, pp. 219-233, DOI: 10.1007/s002540050308, 1998.

[4] S. L. Gariano, F. Guzzetti, "Landslides in a changing climate," Earth-Science Reviews, vol. 162, pp. 227-252, DOI: 10.1016/j.earscirev.2016.08.011, 2016.

[5] J. Corominas, J. Moya, "A review of assessing landslide frequency for hazard zoning purposes," Engineering Geology, vol. 102 no. 3-4, pp. 193-213, DOI: 10.1016/j.enggeo.2008.03.018, 2008.

[6] Y. Huang, L. Zhao, "Review on landslide susceptibility mapping using support vector machines," Catena, vol. 165, pp. 520-529, DOI: 10.1016/j.catena.2018.03.003, 2018.

[7] J. Roy, S. Saha, A. Arabameri, T. Blaschke, D. T. Bui, "A novel ensemble approach for landslide susceptibility mapping (LSM) in Darjeeling and Kalimpong districts, West Bengal, India," Remote Sensing, vol. 11 no. 23,DOI: 10.3390/rs11232866, 2019.

[8] B. C. Ozer, B. Mutlu, H. A. Nefeslioglu, "On the use of hierarchical fuzzy inference systems (HFIS) in expert-based landslide susceptibility mapping: the central part of the Rif Mountains (Morocco)," Bulletin of Engineering Geology and the Environment, vol. 79 no. 1, pp. 551-568, DOI: 10.1007/s10064-019-01548-5, 2020.

[9] R. Roslee, A. C. Mickey, N. Simon, M. N. Norhisham, "Landslide susceptibility analysis LSA using weighted overlay method wom along the genting sempah to bentong highway pahang," Malaysian Journal Geosciences, vol. 1 no. 2, pp. 13-19, DOI: 10.26480/mjg.02.2017.13.19, 2017.

[10] P. Tsangaratos, I. Ilia, H. Hong, W. Chen, C. Xu, "Applying Information Theory and GIS-based quantitative methods to produce landslide susceptibility maps in Nancheng County, China," Landslides, vol. 14 no. 3, pp. 1091-1111, DOI: 10.1007/s10346-016-0769-4, 2017.

[11] B. Ahmed, "Landslide susceptibility mapping using multi-criteria evaluation techniques in Chittagong Metropolitan Area, Bangladesh," Landslides, vol. 12 no. 6, pp. 1077-1095, DOI: 10.1007/s10346-014-0521-x, 2015.

[12] P. Wongsasuluk, S. Chotpantarat, W. Siriwong, M. Robson, "Using hair and fingernails in binary logistic regression for bio-monitoring of heavy metals/metalloid in groundwater in intensively agricultural areas, Thailand," Environmental Research, vol. 162, pp. 106-118, DOI: 10.1016/j.envres.2017.11.024, 2018.

[13] Y. Achour, A. Boumezbeur, R. Hadji, A. Chouabbi, V. Cavaleiro, E. A. Bendaoud, "Landslide susceptibility mapping using analytic hierarchy process and information value methods along a highway road section in Constantine, Algeria," Arabian Journal of Geosciences, vol. 10 no. 8,DOI: 10.1007/s12517-017-2980-6, 2017.

[14] K. Kavoura, N. Sabatakakis, "Investigating landslide susceptibility procedures in Greece," Landslides, vol. 17 no. 1, pp. 127-145, DOI: 10.1007/s10346-019-01271-y, 2020.

[15] A. Arabameri, B. Pradhan, K. Rezaei, M. Sohrabi, Z. Kalantari, "GIS-based landslide susceptibility mapping using numerical risk factor bivariate model and its ensemble with linear multivariate regression and boosted regression tree algorithms," Journal of Mountain Science, vol. 16 no. 3, pp. 595-618, DOI: 10.1007/s11629-018-5168-y, 2019.

[16] B. T. Pham, I. Prakash, "Evaluation and comparison of logitboost ensemble, fisher’s linear discriminant analysis, logistic regression and support vector machines methods for landslide susceptibility mapping," Geocarto International, vol. 34 no. 3, pp. 316-333, DOI: 10.1080/10106049.2017.1404141, 2019.

[17] L. Dahoua, S. V. Yakovitch, R. Hadji, Z. Farid, "Landslide susceptibility mapping using analytic hierarchy process method in BBA-bouira region, case study of East-West Highway, NE Algeria," Euro-Mediterranean Conference for Environmental Integration, 2017.

[18] P. Kadavi, C.-W. Lee, S. Lee, "Application of ensemble-based machine learning models to landslide susceptibility mapping," Remote Sensing, vol. 10 no. 8,DOI: 10.3390/rs10081252, 2018.

[19] A. M. S. Pradhan, Y.-T. Kim, "Spatial data analysis and application of evidential belief functions to shallow landslide susceptibility mapping at Mt. Umyeon, Seoul, Korea," Bulletin of Engineering Geology and the Environment, vol. 76 no. 4, pp. 1263-1279, DOI: 10.1007/s10064-016-0919-x, 2017.

[20] Z. Fang, Y. Wang, L. Peng, H. Hong, "A comparative study of heterogeneous ensemble-learning techniques for landslide susceptibility mapping," International Journal of Geographical Information Science, vol. 35 no. 2, pp. 321-347, DOI: 10.1080/13658816.2020.1808897, 2021.

[21] Q. Wang, Y. Guo, W. Li, J. He, Z. Wu, "Predictive modeling of landslide hazards in Wen County, northwestern China based on information value, weights-of-evidence, and certainty factor," Geomatics, Natural Hazards and Risk, vol. 10 no. 1, pp. 820-835, DOI: 10.1080/19475705.2018.1549111, 2019.

[22] B. T. Pham, C. Luu, T. V. Phong, H. D. Nguyen, H. Van Le, T. Quoc Tran, H. Thu Ta, I. Prakash, "Flood risk assessment using hybrid artificial intelligence models integrated with multi-criteria decision analysis in Quang nam Province, Vietnam," Journal of Hydrology, vol. 592,DOI: 10.1016/j.jhydrol.2020.125815, 2021.

[23] E. J. M. Carranza, M. Hale, "Geologically constrained probabilistic mapping of gold potential, Baguio district, Philippines," Natural Resources Research, vol. 9 no. 3, pp. 237-253, DOI: 10.1023/a:1010147818806, 2000.

[24] F. Guzzetti, R. Paola, A. Francesca, C. Mauro, G. Mirco, "Estimating the quality of landslide susceptibility models," Geomorphology, vol. 81 no. 1-2, pp. 166-184, DOI: 10.1016/j.geomorph.2006.04.007, 2006.

[25] B. Pradhan, A. Chaudhari, J. Adinarayana, M. F. Buchroithner, "Soil erosion assessment and its correlation with landslide events using remote sensing data and GIS: a case study at Penang Island, Malaysia," Environmental Monitoring and Assessment, vol. 184 no. 2, pp. 715-727, DOI: 10.1007/s10661-011-1996-8, 2012.

[26] S. Wan, T. C. Lei, T. Y. Chou, "Optimized object-based image classification: development of landslide knowledge decision support system," Arabian Journal of Geosciences, vol. 7 no. 5, pp. 2059-2070, DOI: 10.1007/s12517-013-0952-z, 2014.

[27] M. H. Tangestani, "A comparative study of dempster-shafer and fuzzy models for landslide susceptibility mapping using a GIS: an experience from Zagros Mountains, SW Iran," Journal of Asian Earth Sciences, vol. 35 no. 1, pp. 66-73, DOI: 10.1016/j.jseaes.2009.01.002, 2009.

[28] B. Neuhäuser, B. Terhorst, "Landslide susceptibility assessment using “weights-of-evidence” applied to a study area at the Jurassic escarpment (SW-Germany)," Geomorphology, vol. 86 no. 1-2, pp. 12-24, DOI: 10.1016/j.geomorph.2006.08.002, 2007.

[29] L. Chu, L.-J. Wang, J. Jiang, X. Liu, K. Sawada, J. Zhang, "Comparison of landslide susceptibility maps using random forest and multivariate adaptive regression spline models in combination with catchment map units," Geosciences Journal, vol. 23 no. 2, pp. 341-355, DOI: 10.1007/s12303-018-0038-8, 2019.

[30] S. Lee, S.-M. Hong, H.-S. Jung, "A support vector machine for landslide susceptibility mapping in Gangwon Province, Korea," Sustainability, vol. 9 no. 1,DOI: 10.3390/su9010048, 2017.

[31] Á. M. Felicísimo, A. Cuartero, J. Remondo, E. Quirós, "Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study," Landslides, vol. 10 no. 2, pp. 175-189, DOI: 10.1007/s10346-012-0320-1, 2013.

[32] S. Park, J. Kim, "Landslide susceptibility mapping based on random forest and boosted regression tree models, and a comparison of their performance," Applied Sciences, vol. 9 no. 5,DOI: 10.3390/app9050942, 2019.

[33] G. Wang, X. Chen, W. Chen, "Spatial prediction of landslide susceptibility based on gis and discriminant functions," ISPRS International Journal of Geo-Information, vol. 9 no. 3,DOI: 10.3390/ijgi9030144, 2020.

[34] D. Tien Bui, P. Biswajeet, L. Owe, R. Inge, "Landslide susceptibility assessment in vietnam using support vector machines, decision tree, and Naive Bayes Models," Mathematical Problems in Engineering, vol. 2012,DOI: 10.1155/2012/974638, 2012.

[35] A. Kornejady, M. Ownegh, A. Bahremand, "Landslide susceptibility assessment using maximum entropy model with two different data sampling methods," Catena, vol. 152, pp. 144-162, DOI: 10.1016/j.catena.2017.01.010, 2017.

[36] H. Hong, Y. Miao, J. Liu, A.-X. Zhu, "Exploring the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping," Catena, vol. 176, pp. 45-64, DOI: 10.1016/j.catena.2018.12.035, 2019.

[37] H. Moayedi, M. Mehrabi, M. Mosallanezhad, A. S. A. Rashid, B. Pradhan, "Modification of landslide susceptibility mapping using optimized PSO-ANN technique," Engineering with Computers, vol. 35 no. 3, pp. 967-984, DOI: 10.1007/s00366-018-0644-0, 2019.

[38] D. C. Camilo, L. Lombardo, P. M. Mai, J. Dou, R. Huser, "Handling high predictor dimensionality in slope-unit-based landslide susceptibility models through LASSO-penalized generalized linear model," Environmental Modelling & Software, vol. 97, pp. 145-156, DOI: 10.1016/j.envsoft.2017.08.003, 2017.

[39] V.-H. Nhu, A. Shirzadi, H. Shahabi, "Shallow landslide susceptibility mapping: a comparison between logistic model tree, logistic regression, naïve bayes tree, artificial neural network, and support vector machine algorithms," International Journal of Environmental Research and Public Health, vol. 17 no. 8,DOI: 10.3390/ijerph17082749, 2020.

[40] M. Zare, H. R. Pourghasemi, M. Vafakhah, B. Pradhan, "Landslide susceptibility mapping at Vaz Watershed (Iran) using an artificial neural network model: a comparison between multilayer perceptron (MLP) and radial basic function (RBF) algorithms," Arabian Journal of Geosciences, vol. 6 no. 8, pp. 2873-2888, DOI: 10.1007/s12517-012-0610-x, 2013.

[41] H. Hong, B. Pradhan, C. Xu, D. Tien Bui, "Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines," Catena, vol. 133, pp. 266-281, DOI: 10.1016/j.catena.2015.05.019, 2015.

[42] R. Senouci, N.-E. Taibi, A. C. Teodoro, L. Duarte, H. Mansour, R. Yahia Meddah, "GIS-based expert knowledge for landslide susceptibility mapping (LSM): case of mostaganem coast district, west of Algeria," Sustainability, vol. 13 no. 2,DOI: 10.3390/su13020630, 2021.

[43] M. Sahana, B. T. Pham, M. Shukla, C. Romulus, D. X. Thu, R. Chakrabortty, N. Satyam, H. D. Nguyen, T. V. Phong, H. V. Le, S. Chandra Pal, G. Areendran, K. Imdad, I. Prakash, "Rainfall induced landslide susceptibility mapping using novel hybrid soft computing methods based on multi-layer perceptron neural network classifier," Geocarto International,DOI: 10.1080/10106049.2020.1837262, 2020.

[44] Z. Chen, J. Wang, "Landslide hazard mapping using logistic regression model in Mackenzie Valley, Canada," Natural Hazards, vol. 42 no. 1, pp. 75-89, DOI: 10.1007/s11069-006-9061-6, 2007.

[45] A. K. Saha, R. P. Gupta, I. Sarkar, M. K. Arora, E. Csaplovics, "An approach for GIS-based statistical landslide susceptibility zonation?with a case study in the Himalayas," Landslides, vol. 2 no. 1, pp. 61-69, DOI: 10.1007/s10346-004-0039-8, 2005.

[46] I. Das, S. Sahoo, C. van Westen, A. Stein, R. Hack, "Landslide susceptibility assessment using logistic regression and its comparison with a rock mass classification system, along a road section in the Northern Himalayas (India)," Geomorphology, vol. 114 no. 4, pp. 627-637, DOI: 10.1016/j.geomorph.2009.09.023, 2010.

[47] S. Sarkar, A. K. Roy, T. R. Martha, "Landslide susceptibility assessment using information value method in parts of the Darjeeling Himalayas," Journal of the Geological Society of India, vol. 82 no. 4, pp. 351-362, DOI: 10.1007/s12594-013-0162-z, 2013.

[48] A. Dikshit, R. Sarkar, B. Pradhan, S. Segoni, A. M. Alamri, "Rainfall induced landslide studies in Indian Himalayan region: a critical review," Applied Sciences, vol. 10 no. 7,DOI: 10.3390/app10072466, 2020.

[49] S. Das, "Uttarakhand tragedy," Journal of the Geological Society of India, vol. 82 no. 2,DOI: 10.1007/s12594-013-0141-4, 2013.

[50] C. Xu, X. Xu, F. Dai, "Application of an incomplete landslide inventory, logistic regression model and its validation for landslide susceptibility mapping related to the May 12, 2008 Wenchuan earthquake of China," Natural Hazards, vol. 68 no. 2, pp. 883-900, DOI: 10.1007/s11069-013-0661-7, 2013.

[51] T. Chen, L. Zhu, R.-Q. Niu, C. J. Trinder, L. Peng, T. Lei, "Mapping landslide susceptibility at the Three Gorges Reservoir, China, using gradient boosting decision tree, random forest and information value models," Journal of Mountain Science, vol. 17 no. 3, pp. 670-685, DOI: 10.1007/s11629-019-5839-3, 2020.

[52] G. Wang, X. Lei, W. Chen, H. Shahabi, A. Shirzadi, "Hybrid computational intelligence methods for landslide susceptibility mapping," Symmetry, vol. 12 no. 3,DOI: 10.3390/sym12030325, 2020.

[53] S. V. Razavi-Termeh, A. Sadeghi-Niaraki, S.-M. Choi, "Ubiquitous GIS-based forest fire susceptibility mapping using artificial intelligence methods," Remote Sensing, vol. 12 no. 10,DOI: 10.3390/rs12101689, 2020.

[54] A. M. Youssef, H. R. Pourghasemi, "Landslide susceptibility mapping using machine learning algorithms and comparison of their performance at abha basin, asir region, Saudi Arabia," Geoscience Frontiers, vol. 12 no. 2, pp. 639-655, DOI: 10.1016/j.gsf.2020.05.010, 2021.

[55] M. Kasai, M. Ikeda, T. Asahina, K. Fujisawa, "LiDAR-derived DEM evaluation of deep-seated landslides in a steep and rocky region of Japan," Geomorphology, vol. 113 no. 1-2, pp. 57-69, DOI: 10.1016/j.geomorph.2009.06.004, 2009.

[56] J.-H. Lee, M. I. Sameen, B. Pradhan, H.-J. Park, "Modeling landslide susceptibility in data-scarce environments using optimized data mining and statistical methods," Geomorphology, vol. 303, pp. 284-298, DOI: 10.1016/j.geomorph.2017.12.007, 2018.

[57] H. Hong, B. Pradhan, M. I. Sameen, W. Chen, C. Xu, "Spatial prediction of rotational landslide using geographically weighted regression, logistic regression, and support vector machine models in Xing Guo area (China)," Geomatics, Natural Hazards and Risk, vol. 8 no. 2, pp. 1997-2022, DOI: 10.1080/19475705.2017.1403974, 2017.

[58] X. Yu, H. Gao, "A landslide susceptibility map based on spatial scale segmentation: a case study at Zigui-Badong in the Three Gorges Reservoir Area, China," PLoS One, vol. 15 no. 3,DOI: 10.1371/journal.pone.0229818, 2020.

[59] B. T. Pham, D. Tien Bui, I. Prakash, M. B. Dholakia, "Hybrid integration of multilayer perceptron neural networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS," Catena, vol. 149, pp. 52-63, DOI: 10.1016/j.catena.2016.09.007, 2017.

[60] K. Shirani, M. Pasandi, A. Arabameri, "Landslide susceptibility assessment by dempster-shafer and index of entropy models, sarkhoun basin, southwestern Iran," Natural Hazards, vol. 93 no. 3, pp. 1379-1418, DOI: 10.1007/s11069-018-3356-2, 2018.

[61] S. Beguería, "Changes in land cover and shallow landslide activity: a case study in the Spanish Pyrenees," Geomorphology, vol. 74 no. 1-4, pp. 196-206, DOI: 10.1016/j.geomorph.2005.07.018, 2006.

[62] Z. Liu, G. Gilbert, J. M. Cepeda, A. O. Kydland Lysdahl, L. Piciullo, H. Hefre, S. Lacasse, "Modelling of shallow landslides with machine learning algorithms," Geoscience Frontiers, vol. 12 no. 1, pp. 385-393, DOI: 10.1016/j.gsf.2020.04.014, 2021.

[63] M. I. Sameen, R. Sarkar, B. Pradhan, D. Drukpa, A. M. Alamri, H.-J. Park, "Landslide spatial modelling using unsupervised factor optimisation and regularised greedy forests," Computers & Geosciences, vol. 134,DOI: 10.1016/j.cageo.2019.104336, 2020.

[64] H. R. Pourghasemi, B. Pradhan, C. Gokceoglu, M. Mohammadi, H. R. Moradi, "Application of weights-of-evidence and certainty factor models and their comparison in landslide susceptibility mapping at Haraz watershed, Iran," Arabian Journal of Geosciences, vol. 6 no. 7, pp. 2351-2365, DOI: 10.1007/s12517-012-0532-7, 2013.

[65] C. Borgelt, M. Steinbrecher, R. R. Kruse, Graphical Models: Representations for Learning, Reasoning and Data Mining, 2009.

[66] C. Catal, U. Sevim, B. Diri, "Practical development of an Eclipse-based software fault prediction tool using Naive Bayes algorithm," Expert Systems with Applications, vol. 38 no. 3, pp. 2347-2353, DOI: 10.1016/j.eswa.2010.08.022, 2011.

[67] T. Shatovskaya, V. Repka, A. Good, "Application of the bayesian networks in the informational modeling," ,DOI: 10.1109/tcset.2006.4404462, .

[68] C. Zhang, X. Pan, H. Li, G. Andy, S. Isabel, H. Jonathon, P. M. Atkinson, "A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification," ISPRS Journal of Photogrammetry and Remote Sensing, vol. 140, pp. 133-144, DOI: 10.1016/j.isprsjprs.2017.07.014, 2018.

[69] M. W. Gardner, S. Dorling, "Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences," Atmospheric Environment, vol. 32 no. 14-15, pp. 2627-2636, DOI: 10.1016/s1352-2310(97)00447-0, 1998.

[70] G. Panchal, A. Ganatra, Y. P. Kosta, D. Panchal, "Behaviour analysis of multilayer perceptronswith multiple hidden neurons and hidden layers," International Journal of Computer Theory and Engineering, vol. 3 no. 2, pp. 332-337, DOI: 10.7763/ijcte.2011.v3.328, 2011.

[71] D. T. Bui, B. Pradhan, I. Revhaug, D. B. Nguyen, H. V. Pham, Q. N. Bui, "A novel hybrid evidential belief function-based fuzzy logic model in spatial prediction of rainfall-induced shallow landslides in the Lang Son city area (Vietnam)," Geomatics, Natural Hazards and Risk, vol. 6 no. 3, pp. 243-271, DOI: 10.1080/19475705.2013.843206, 2015.

[72] Y. Freund, L. Mason, The Alternating Decision Tree Learning Algorithm, 1999.

[73] W. Chen, X. Xie, J. Peng, J. Wang, Z. Duan, H. Hong, "GIS-based landslide susceptibility modelling: a comparative assessment of kernel logistic regression, Naïve-Bayes tree, and alternating decision tree models," Geomatics, Natural Hazards and Risk, vol. 8 no. 2, pp. 950-973, DOI: 10.1080/19475705.2017.1289250, 2017.

[74] T. Van Gestel, B. Baesens, J. Suykens, M. Espinoza, D.-E. Baestaens, J. Vanthienen, B. De Moor, "Bankruptcy prediction with least squares support vector machine classifiers," ,DOI: 10.1109/CIFER.2003.1196234, .

[75] B. T. Pham, D. Tien Bui, M. B. Dholakia, I. Prakash, H. V. Pham, "A comparative study of least square support vector machines and multiclass alternating decision trees for spatial prediction of rainfall-induced landslides in a tropical cyclones area," Geotechnical & Geological Engineering, vol. 34 no. 6, pp. 1807-1824, DOI: 10.1007/s10706-016-9990-0, 2016.

[76] B. Thai Pham, D. Tien Bui, I. Prakash, "Landslide susceptibility modelling using different advanced decision trees methods," Civil Engineering and Environmental Systems, vol. 35 no. 1-4, pp. 139-157, DOI: 10.1080/10286608.2019.1568418, 2018.

[77] S. Beguería, "Validation and evaluation of predictive models in hazard assessment and risk management," Natural Hazards, vol. 37 no. 3, pp. 315-329, DOI: 10.1007/s11069-005-5182-6, 2006.

[78] H. Hong, J. Liu, D. T. Bui, B. Pradhan, T. D. Acharya, B. Thai Pham, A.-X. Zhu, W. Chen, B. Bin Ahmad, "Landslide susceptibility mapping using J48 decision tree with AdaBoost, bagging and rotation forest ensembles in the guangchang area (China)," Catena, vol. 163, pp. 399-413, DOI: 10.1016/j.catena.2018.01.005, 2018.

[79] D. T. Bui, T.-C. Ho, B. Pradhan, B.-T. Pham, V.-H. Nhu, I. Revhaug, "GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with adaboost, bagging, and multiboost ensemble frameworks," Environmental Earth Sciences, vol. 75 no. 14,DOI: 10.1007/s12665-016-5919-4, 2016.

[80] V.-H. Nhu, A. Mohammadi, H. Shahabi, A. Baharin Bin, A.-A. Nadhir, S. Ataollah, G. Marten, R. K. Victoria, K. Sadra, K. Khalil Valizadeh, W. Chen, H. Nguyen, "Landslide detection and susceptibility modeling on cameron highlands (Malaysia): a comparison between random forest, logistic regression and logistic model tree algorithms," Forests, vol. 11 no. 8,DOI: 10.3390/f11080830, 2020.

[81] D.-H. Lee, Y.-T. Kim, S.-R. Lee, "Shallow landslide susceptibility models based on artificial neural networks considering the factor selection method and various non-linear activation functions," Remote Sensing, vol. 12 no. 7,DOI: 10.3390/rs12071194, 2020.

[82] P. Yariyan, S. Janizadeh, T. Van Phong, H. D. Nguyen, R. Costache, V. L. Hiep, T. P. Binh, P. Biswajeet, J. P. Tiefenbacher, "Improvement of best first decision trees using bagging and dagging ensembles for flood probability mapping," Water Resources Management, vol. 34 no. 9, pp. 3037-3053, DOI: 10.1007/s11269-020-02603-7, 2020.

[83] N. Kazakis, K. S. Voudouris, "Groundwater vulnerability and pollution risk assessment of porous aquifers to nitrate: modifying the DRASTIC method using quantitative parameters," Journal of Hydrology, vol. 525, pp. 13-25, DOI: 10.1016/j.jhydrol.2015.03.035, 2015.

[84] R. Fell, C. Jordi, B. Christophe, C. Leonardo, L. Eric, W. Z. Savage, "Guidelines for landslide susceptibility, hazard and risk zoning for land-use planning," Engineering Geology, vol. 102 no. 3-4, pp. 99-111, DOI: 10.1016/j.enggeo.2008.03.014, 2008.

[85] G. Manzo, V. Tofani, S. Segoni, A. Battistini, F. Catani, "GIS techniques for regional-scale landslide susceptibility assessment: the Sicily (Italy) case study," International Journal of Geographical Information Science, vol. 27 no. 7, pp. 1433-1452, DOI: 10.1080/13658816.2012.693614, 2013.

[86] D. Tien Bui, T. A. Tuan, H. Klempe, B. Pradhan, I. Revhaug, "Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree," Landslides, vol. 13 no. 2, pp. 361-378, DOI: 10.1007/s10346-015-0557-6, 2016.

[87] I. H. Witten, E. Frank, "Data mining," Acm Sigmod Record, vol. 31 no. 1, pp. 76-77, DOI: 10.1145/507338.507355, 2002.

[88] C.-J. F. Chung, A. G. Fabbri, C. J. Van Westen, "Multivariate regression analysis for landslide hazard zonation," Geographical Information Systems in Assessing Natural Hazards, pp. 107-133, DOI: 10.1007/978-94-015-8404-3_7, 1995.

[89] R. T. Guy, P. Santago, C. D. Langefeld, "Bootstrap aggregating of alternating decision trees to detect sets of SNPs that associate with disease," Genetic Epidemiology, vol. 36 no. 2, pp. 99-106, DOI: 10.1002/gepi.21608, 2012.

[90] K.-Y. Liu, J. Lin, X. Zhou, S. T. C. Wong, "Boosting alternating decision trees modeling of disease trait information," BMC Genetics, vol. 6 no. Suppl_1,DOI: 10.1186/1471-2156-6-s1-s132, 2005.

[91] M. Maalouf, T. B. Trafalis, "Robust weighted kernel logistic regression in imbalanced and rare events data," Computational Statistics & Data Analysis, vol. 55 no. 1, pp. 168-183, DOI: 10.1016/j.csda.2010.06.014, 2011.

[92] A. Arabameri, E. Karimi-Sangchini, S. C. Pal, Asish Saha, Indrajit Chowdhuri, Saro Lee, Dieu Tien Bui, "Novel credal decision tree-based ensemble approaches for predicting the landslide susceptibility," Remote Sensing, vol. 12 no. 20,DOI: 10.3390/rs12203389, 2020.

[93] M. P.-L. Ooi, H. K. Sok, Y. C. Kuang, S. Demidenko, "Alternating decision trees," Handbook of Neural Computation, pp. 345-371, DOI: 10.1016/b978-0-12-811318-9.00019-3, 2017.

[94] D. S. Jayas, J. Paliwal, N. S. Visen, "Review paper (AE-Automation and emerging technologies)," Journal of Agricultural Engineering Research, vol. 77 no. 2, pp. 119-128, DOI: 10.1006/jaer.2000.0559, 2000.

[95] B. T. Pham, T. B. Dieu, P. Hamid Reza, I. Prakash, M. B. Dholakia, "Landslide susceptibility assesssment in the Uttarakhand area (India) using GIS: a comparison study of prediction capability of naïve bayes, multilayer perceptron neural networks, and functional trees methods," Theoretical and Applied Climatology, vol. 128 no. 1-2, pp. 255-273, DOI: 10.1007/s00704-015-1702-9, 2017.

Word count: 8731

Show less

Copyright © 2021 Trung-Hieu Tran et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/

Abstract

Translate

The main objective of the study was to investigate performance of three soft computing models: Naïve Bayes (NB), Multilayer Perceptron (MLP) neural network classifier, and Alternating Decision Tree (ADT) in landslide susceptibility mapping of Pithoragarh District of Uttarakhand State, India. For this purpose, data of 91 past landslide locations and ten landslide influencing factors, namely, slope degree, curvature, aspect, land cover, slope forming materials (SFM), elevation, distance to rivers, geomorphology, overburden depth, and distance to roads were considered in the models study. Thematic maps of the Geological Survey of India (GSI), Google Earth images, and Aster Digital Elevation Model (DEM) were used for the development of landslide susceptibility maps in the Geographic Information System (GIS) environment. Landslide locations data was divided into a 70 : 30 ratio for the training (70%) and testing/validation (30%) of the three models. Standard statistical measures, namely, Positive Predicted Values (PPV), Negative Predicted Values (NPV), Sensitivity, Specificity, Mean Absolute Error (MAE), Root Mean Squire Error (RMSE), and Area under the ROC Curve (AUC) were used for the evaluation of the models. All the three soft computing models used in this study have shown good performance in the accurate development of landslide susceptibility maps, but performance of the ADT and MLP is better than NB. Therefore, these models can be used for the construction of accurate landslide susceptibility maps in other landslide-prone areas also.

Details

Title

GIS-Based Soft Computing Models for Landslide Susceptibility Mapping: A Case Study of Pithoragarh District, Uttarakhand State, India

Author

Tran, Trung-Hieu¹; Nguyen Duc Dam¹

; Jalal, Fazal E²; Al-Ansari, Nadhir³

; Lanh Si Ho⁴; Tran Van Phong⁵

; Iqbal, Mudassir⁶; Hiep Van Le¹; Hanh Bich Thi Nguyen¹; Prakash, Indra⁷

; Binh Thai Pham¹

¹ University of Transport Technology, Ha Noi 100000, Vietnam
² Department of Civil Engineering, State Key Laboratory of Ocean Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
³ Department of Civil, Environmental and Natural Resources Engineering, Lulea University of Technology, Lulea 971 87, Sweden
⁴ University of Transport Technology, Ha Noi 100000, Vietnam; Civil and Environmental Engineering Program, Graduate School of Advanced Science and Engineering, Hiroshima University, 1-4-1 Kagamiyama, Higashi-Hiroshima, Hiroshima 739-8527, Japan
⁵ Institute of Geological Sciences, Vietnam Academy of Sciences and Technology, 84 Chua Lang Street, Dong Da, Hanoi 100000, Vietnam
⁶ Department of Civil Engineering, State Key Laboratory of Ocean Engineering, Shanghai Jiao Tong University, Shanghai 200240, China; Department of Civil Engineering, University of Engineering and Technology, Peshawar, Pakistan
⁷ DDG (R) Geological Survey of India, Gandhinagar 382010, India

Editor

Ana C Teodoro

Publication year

2021

Publication date

2021

Publisher

John Wiley & Sons, Inc.

ISSN

1024123X

e-ISSN

15635147

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1155/2021/9914650

ProQuest document ID

2569267459

GIS-Based Soft Computing Models for Landslide Susceptibility Mapping: A Case Study of Pithoragarh District, Uttarakhand State, India

Jump to:

Full text

Abstract

Details

Suggested sources