Next Article in Journal
An Ultra-Wideband Microwave Photonic Channelized Receiver with Zero-IF Architecture
Next Article in Special Issue
Determination of the Stability of High-Steep Slopes by Global Navigation Satellite System (GNSS) Real-Time Monitoring in Long Wall Mining
Previous Article in Journal
Strongly Focused Circularly Polarized Optical Vortices Regulated by a Fractal Conical Lens
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatial Prediction of Landslides Using Hybrid Integration of Artificial Intelligence Algorithms with Frequency Ratio and Index of Entropy in Nanzheng County, China

1
Key Laboratory of Mine Geological Hazard Mechanism and Control, Shaanxi Institute of Geo-Environment Monitoring, Xi′an 710054, China
2
College of Geology & Environment, Xi′an University of Science and Technology, Xi′an 710054, China
3
Key Laboratory of Coal Resources Exploration and Comprehensive Utilization, Ministry of Natural Resources, Xi’an 710021, China
4
Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam
*
Authors to whom correspondence should be addressed.
Submission received: 8 November 2019 / Revised: 29 November 2019 / Accepted: 4 December 2019 / Published: 19 December 2019
(This article belongs to the Special Issue Assessment of Landslide Susceptibility and Hazard in the Big Data Era)

Abstract

:
The main object of this study is to introduce hybrid integration approaches that consist of state-of-the-art artificial intelligence algorithms (SysFor) and two bivariate models, namely the frequency ratio (FR) and index of entropy (IoE), to carry out landslide spatial prediction research. Hybrid integration approaches of these two bivariate models and logistic regression (LR) were used as benchmark models. Nanzheng County was considered as the study area. First, a landslide distribution map was produced using news reports, interpreting satellite images and a regional survey. A total of 202 landslides were identified and marked. According to the previous studies and local geological environment conditions, 16 landslide conditioning factors were chosen for landslide spatial prediction research: elevation, profile curvature, plan curvature, slope angle, slope aspect, stream power index (SPI), topographic wetness index (TWI), sediment transport index (STI), distance to roads, distance to rivers, distance to faults, lithology, rainfall, soil, normalized different vegetation index (NDVI), and land use. Then, the 202 landslides were randomly segmented into two parts with a ratio of 70:30. Seventy percent of the landslides (141) were used as the training dataset and the remaining landslides (61) were used as the validating dataset. Next, the evaluation models were built using the training dataset and compared by the receiver operating characteristics (ROC) curve. The results showed that all models performed well; the FR_SysFor model exhibited the best prediction ability (0.831), followed by the IoE_SysFor model (0.819), IoE_LR model (0.702), FR_LR model (0.696), IoE model (0.691), and FR model (0.681). Overall, these six models are practical tools for landslide spatial prediction research and the results can provide a reference for landslide prevention and control in the study area.

1. Introduction

Landslides, as one of the most frequent geological disasters, have caused casualties, property damage and a series of geological environment problems [1,2]. According to the statistics report of the China Institute of Geo-environmental Monitoring [3], a total of 2966 geological disasters occurred in 2018, including 1631 landslides, resulting in 112 deaths and a direct financial loss of CNY 1.47 billion. Hence, to minimize the damage caused by the landslides, it is necessary to carry out landslide susceptibility research in a region [1,4,5]. Generally, landslide susceptibility can be roughly defined as the probability of landslide occurrence in a certain area under the coupling action of a series of geological environmental factors and human activities [6,7,8,9]. Currently, many approaches have been applied in landslide susceptibility research and their results have been applied to landslide prevention [10,11,12]. According to previous studies [13,14,15,16], the approaches applied in landslide susceptibility assessment can be roughly divided into three types: statistical models, machine learning models and hybrid integration models. The commonly used statistical models are statistical index (SI) [17,18], index of entropy (IoE) [19,20], frequency ratio (FR) [21,22,23], weights of evidence (WoE) [24,25], certainty factor (CF) [26,27], evidential belief function (EBF) [28,29], analytical hierarchy process (AHP) [30,31,32], and logistical regression (LR) [33].
As the prediction ability of the above models has needed to be improved, machine learning methods have been introduced, such as support vector machine (SVM) [34,35,36], random forest (RF) [37,38], adaptive neuro-fuzzy inference systems (ANFIS) [39,40], artificial neural network (ANN) [41,42,43], decision tree (DT) [44,45], and classification and regression tree (CART) [46,47]. Many researchers have gradually found that the prediction ability of a single model is limited [48]. Therefore, it is necessary to develop hybrid integration approaches, such as support vector machine–evidential belief function (SVM_EBF) [49], decision tree–weights of evidence (DT_WoE) [50], adaptive neuro-fuzzy inference systems–statistical index (ANFIS_SI) [51], frequency ratio–logistic regression (FR-LR) [23,52], rotation forest–naive Bayes tree (RF_NBT) [53], and random subspace–classification and regression tree (RS_CART) [54].
In order to obtain an appropriate model that is suitable for the study area, this paper introduces an advanced artificial intelligence algorithm (SysFor) with a frequency ratio and index of entropy to carry out a landslide susceptibility assessment. A logistic regression model is used as the benchmark model to compare the performance of the hybrid models.

2. Methodology

2.1. Frequency Ratio

The frequency ratio (FR) is defined as the ratio between the percentage of landslides and the percentage of pixels within one class [20,55]. The relationship between landslide and the factors is stronger when the frequency ratio is larger than 1 [56,57]. The frequency ratio is calculated by Equation (1).
F R = N L S p i x 1 n N L S p i x N C p i x 1 n N C p i x
where NLSpix is the number of landslides, and NCpix is number of pixels of a class.

2.2. Index of Entropy

The second model is the entropy index (IoE), which is based on the bivariate analysis principle [58,59]. The model can calculate the weight of each input variable, and the weight can show which variable is most relevant to the occurrence of landslides in the natural environment [20]. The weights of each variable obtained are taken as the entropy index [60]. The equations used to calculate the weight of each variable are shown as Equations (2)–(8).
P i j = b a
( P i j ) = P i j j = 1 S j P i j
H j = i = 1 S j ( P i j ) log 2 ( P i j ) j = 1 , 2 , 3 , , n
H j max = log 2 S j
I j = H j max H j H j max , I = ( 0 , 1 ) j = 1 , 2 , 3 , , n
P j = 1 S j i = 1 S j P i j
W j = I j × P j
where a is the percentage of the defined domain; b is the landslide percentage; Pij is the probability density; Hjmax and Hj are both entropy values; Sj is the number of classes; Ij is the information coefficient; and Wj is the weight for the variable as a whole.

2.3. SysFor

The systematically developed forest of multiple trees (SysFor) is a data mining algorithm that is based on the concept of the gain ratio and was proposed by Islam and Giggins in 2011 [61,62]. Compared with the commonly used techniques, the SysFor applies both high-dimensional and low-dimensional data sets. In addition, it also shows a higher prediction accuracy than the common techniques [63]. Generally, the SysFor is built with the following four steps: (1) According to the user-defined gain ratio and separation value, a set of good attributes and their segmentation points are identified. (2) If the number of good attributes is smaller than the number of user-defined trees, each attribute is selected as the root attribute of the tree, and the number of trees built is equal to the number of good attributes. (3) If the number of trees generated in step 2 is less than the number of user-defined trees, more trees are built. (4) All trees built in steps (2) and (3) are returned as the SysFor.

2.4. Logistic Regression

Logistic regression is a multivariable method [64,65,66]. The main goal of a logistic regression model is to obtain the most suitable method to determine whether there are landslides using particular variables [47,67]. The relationship between landslide occurrence and the variables is shown in Equation (9).
P = 1 1 + exp ( Z )
where P is the probability of landslide occurrence, and Z is the linear sum, which is obtained by the product of independent variables and their coefficients. The calculation of Z is shown in Equation (10).
Z = α + β 1 x 1 + β 2 x 2 + β 3 x 3 + + β n x n
where α is a constant; βi (i = 1, 2, 3… n) are the coefficients; and xi (i = 1, 2, 3… n) are the independent variables.

3. Study Area and Data Used

The study area (Nanzheng County) lies in Shaanxi Province, China (Figure 1). It is located between longitude 106°30′ and 107°22′ E and latitude 32°24′ to 33°07′ N. It covers an area of 2823 square kilometers. The altitude of the study area is between 442 and 2410 m. The study area has a subtropical monsoon climate, with an annual average temperature of 14.2 °C. Rainfall is mainly concentrated from June to September and the mean annual rainfall is 909.8 mm.
A landslide inventory can give insight into landslide location, dates, type, and damage caused [68,69,70]. In this study, the landslide inventory map was prepared on the basis of historical landslide records and satellite images (Google Earth and ZY03 images). A total of 202 landslides were identified, including 190 slides and 12 rock falls [71]. The largest landslide was more than 1,000,000 m3, and the smallest landslide was nearly 160 m3 [70]. Finally, 141 landslides were randomly selected as training and validation datasets with a ratio of 70:30 (Figure 1).
There is no clear agreement about the precise cause of landslides due to their complex nature and development. According to the latest relevant research and local geological environment characteristics [72,73,74,75,76], 16 landslide conditioning factors have been compiled: elevation, profile curvature, plan curvature, slope angle, slope aspect, stream power index (SPI), topographic wetness index (TWI), sediment transport index (STI), distance to roads, distance to rivers, distance to faults, lithology, rainfall, soil, normalized different vegetation index (NDVI), and land use. The 16 landslide susceptibility conditioning factors were converted with a resolution of 30 × 30 m.
Elevation influences earth surface and topographic attributes which account for spatial variability of precipitation, soil thickness, erosion, and vegetation [38]. Profile curvature and plan curvature have significant effects on surface runoff and infiltration [77]. Slope angle is an important factor that controls the velocity of the slopes, and the slope aspect has an important effect on rainfall, wind and sunlight exposure [78]. SPI, TWI and STI are related to soil water content status, water accumulation and progress of erosion and sedimentation in a watershed, which influence landslide stability [77,78]. In the present study, the elevation map was acquired from a 30 × 30 m digital elevation model (DEM) and reclassified into 10 classes: 442–600 m, 600–800 m, 800–1000 m, 1000–1200 m, 1200–1400 m, 1400–1600 m, 1600–1800 m, 1800–2000 m, 2000–2200 m, and 2200–2410 m (Figure 2a). Similarly, the profile curvature, plan curvature, slope angle, slope aspect, SPI, TWI, and STI were also extracted from the DEM by ArcGIS. In the study, the profile curvature was grouped into three classes of −14.28 to −0.05, −0.05 to 0.05, and 0.05 to 14.77 (Figure 2b). The plan curvature was divided into three categories of −14.0 to −0.05, −0.05 to 0.05, and 0.05 to 13.07 (Figure 2c). The slope angle was grouped into seven categories: 0–10°, 10–20°, 20–30°, 30–40°, 40–50°, 50–60°, and 60–72.83° (Figure 2d). The slope aspect was reclassified into nine classes (Figure 2e). The SPI values were grouped into five classes: <20, 20–40, 40–60, 60–80, and >80 (Figure 2f). The TWI was reclassified into five categories: <4, 4–5, 5–6, 6–7, and >7 (Figure 2g). The STI values were divided into five classes: <10, 10–20, 20–30, 30–40, and >40 (Figure 2h).
Distance to roads, distance to rivers and distance to faults are three commonly used landslide conditioning factors, these factors are related to the infiltration and strength of slopes [79,80]. Distance to roads, distance to rivers and distance to faults are divided into five categories by the equal distance method, as shown in Figure 2i–k, respectively. Lithology is one of the most important restrictive factors in landslide susceptibility evaluation; areas with highly resistant rocks or highly permeable subsoil material have low drainage density [81], which are natural factors essential to determine landslide occurrence [82,83]. The geological map of this study was extracted from a geological map with 1:1,000,000 scale. The study area has 12 different lithological units, which are shown in Figure 2l. Rainfall is a widely recognized landslide-inducing factor, which not only increases the weight of the slope, but also reduces soil strength [84,85]. From Figure 2m, it can be seen that the rainfall in the middle of the study area is significantly higher than that in the north and south. The rainfall map was reclassified with an interval of 100 mm/yr. Soil is the material composition of the slope, and different soils have different physical and mechanical properties. Soil type was extracted from the geological map with a 1:1,000,000 scale, as shown in Figure 2n, and there are nine kinds of soils in the study area. NDVI is a measure of surface reflectance and gives a quantitative estimate of biomass and the vegetation growth [38,86,87,88]. The NDVI values in this paper were reclassified into five categories: −0.21 to 0.21, 0.21 to 0.36, 0.36 to 0.44, 0.44 to 0.52, and 0.52 to 0.65 (Figure 2o). Land use is considered as the direct manifestation of human activities impacting on landslide probability. Land use types control the amount of infiltration and surface runoff generation [77]. In general, landslides are concentrated in human active areas [89,90]. In this study, a land use map was extracted from regional land use maps with a 1:100,000 scale. Land use was divided into farmland, forestland, grassland, water, residential areas, and bare land (Figure 2p).

4. Results

As the LR method was employed to produce the landslide susceptibility map, it is necessary to analyze the multicollinearity of landslide conditioning factors [91]. Currently, the tolerance (TOL) and variance inflation factor (VIF) are mostly used in multicollinearity analysis [92,93]. Generally, a TOL < 0.1 or a VIF > 10 indicate multicollinearity [72]. In the study, the results indicate that there are no multicollinearities among the 16 landslide conditioning factors (Table 1).

4.1. Application of FR Model

The frequency ratio (FR) is a commonly used method of univariate probability analysis in landslide susceptibility assessment [21,25]. When the frequency ratio is 1, it represents the average value; when the frequency ratio is greater than 1, it indicates that the factor has a strong correlation with the occurrence of the landslide; when the frequency ratio is less than 1, it indicates that the correlation between the factor and the occurrence of the landslide is weak. The landslide susceptibility index (LSI) calculated by the FR model can be expressed as Equation (11), as shown in Table 2.
L S I F R = ( E l e v a t i o n F R ) + ( P r o f i l e   c u r v a t u r e F R ) + ( P l a n   c u r v a t u r e F R ) + ( S l o p e   a n g l e F R ) + ( S l o p e   a s p e c t F R ) + ( S P I F R ) + ( T W I F R ) + ( S T I F R ) + ( D i s t a n c e   t o   r o a d s F R ) + ( D i s t a n c e   t o   r i v e r s F R ) + ( D i s t a n c e   t o   f a u l t s F R ) + ( L i t h o l o g y F R ) + ( R a i n f a l l F R ) + ( S o i l F R ) + ( N D V I F R ) + ( L a n d u s e F R )

4.2. Application of IoE Model

The index of entropy (IoE) is based on the bivariate analysis principle. The weight of each conditioning factor in landslides can be calculated by this method. From Table 2, it can be seen that soil has the highest weight (0.326), followed by land use (0.311), elevation (0.223), slope angle (0.146), NDVI (0.128), lithology (0.117), rainfall (0.111), profile curvature (0.083), slope aspect (0.075), distance to roads (0.065), TWI (0.046), STI (0.024), distance to faults (0.022), plan curvature (0.014), SPI (0.01), and distance to rivers (0.003). The LSI calculated by the IoE model can be represented as Equation (12).
L S I I o E = ( E l e v a t i o n F R 0.223 ) + ( P r o f i l e   c u r v a t u r e F R 0.083 ) + ( P l a n   c u r v a t u r e F R 0.014 ) + ( S l o p e   a n g l e F R 0.146 ) + ( S l o p e   a s p e c t F R 0.075 ) + ( S P I F R 0.010 ) + ( T W I F R 0.046 ) + ( S T I F R 0.024 ) + ( D i s t a n c e   t o   r o a d s F R 0.065 ) + ( D i s t a n c e   t o   r i v e r s F R 0.003 ) + ( D i s t a n c e   t o   f a u l t s F R 0.022 ) + ( L i t h o l o g y F R 0.117 ) + ( R a i n f a l l F R 0.111 ) + ( S o i l F R 0.326 ) + ( N D V I F R 0.128 ) + ( L a n d u s e F R 0.311 )

4.3. Application of Hybrid Models

Compared with a single model, a hybrid model has higher predictive ability when dealing with high-dimensional problems [94,95,96]. In this study, the SysFor algorithm was combined with FR and IoE models. During the modeling process, the following parameters were used for hybrid FR_SysFor and IoE_SysFor models: separation, 0.3; confidence, 0.25; goodness, 0.3; number of trees built in the forest, 500.
The hybrid integration of LR with FR and IoE was also applied as a benchmark model to build landslide susceptibility maps. A forward stepwise LR was adopted, and the analysis results are given in Table 3. The landslide occurrence probability P of FR_LR and IoE_LR models can be expressed using Equations (13)–(16), respectively.
Z F R L R = ( E l e v a t i o n F R 0.596 ) + ( P r o f i l e   c u r v a t u r e F R 0.005 ) + ( P l a n   c u r v a t u r e F R 1.390 ) + ( S l o p e   a n g l e F R 0.182 ) + ( S l o p e   a s p e c t F R 0.633 ) + ( S P I F R 0.370 ) + ( T W I F R 0.729 ) + ( S T I F R 0.381 ) + ( D i s t a n c e   t o   r o a d s F R 0.436 ) + ( D i s t a n c e   t o   r i v e r s F R ( 0.722 ) ) + ( D i s t a n c e   t o   f a u l t s F R ( 0.060 ) ) + ( L i t h o l o g y F R ( 0.138 ) ) + ( R a i n f a l l F R 0.607 ) + ( S o i l F R ( 0.030 ) ) + ( N D V I F R 0.275 ) + ( L a n d u s e F R 0.399 ) 5.702
P F R L R = e Z F R L R 1 + e Z F R L R
Z I o E L R = ( E l e v a t i o n I o E 1.978 ) + ( P r o f i l e   c u r v a t u r e I o E ( 0.626 ) ) + ( P l a n   c u r v a t u r e I o E 72.424 ) + ( S l o p e   a n g l e I o E 1.042 ) + ( S l o p e   a s p e c t I o E 9.670 ) + ( S P I I o E 66.354 ) + ( T W I I o E 1.570 ) + ( S T I I o E 7.543 ) + ( D i s t a n c e   t o   r o a d s I o E 5.933 ) + ( D i s t a n c e   t o   f a u l t s I o E ( 4.887 ) ) + ( L i t h o l o g y I o E ( 1.090 ) ) + ( R a i n f a l l I o E 5.141 ) + ( S o i l I o E 0.008 ) + ( N D V I I o E 5.007 ) + ( L a n d u s e I o E 1.260 ) 6.400
P I o E L R = e Z I o E L R 1 + e Z I o E L R
Finally, all the landslide susceptibility maps were reclassified into five categories using the equal area classification method [92,93]: very high (5%), high (10%), moderate (15%), low (20%), and very low (50%) (Figure 3).

4.4. Validation of Landslide Susceptibility Maps

Validating the landslide susceptibility map (LSM) and determining its accuracy is an important aspect of landslide susceptibility research. The verification of landslide susceptibility maps is very important. Without it, the rigor of the study is greatly weakened, and the results of the study have no scientific significance [97,98,99]. In consequence, three commonly used statistical parameters, the receiver operating characteristics (ROC) curve [100,101,102] and the area under the curve (AUC) [103,104,105], standard error, and 95% confidence interval are introduced to verify the landslide susceptibility maps. It can be seen that the FR_SysFor model acquired the highest AUC value (0.940) of all hybrid models in the training data, followed by the IoE_SysFor model (0.926), IoE_LR model (0.783) and FR_LR model (0.779) (Figure 4). With the exception of the AUC value, the other statistical parameters in Table 4 and Table 5 show similar results; the FR_SysFor model obtained the smallest SE (0.0132) and 95% CI (0.906, 0.965), followed by the IoE_SysFor, IoE_LR and FR_LR models. The validating data showed similar results to the training data, and the FR_SysFor model achieved the highest AUC value (0.831) and the smallest SE (0.0388) and 95% CI (0.753, 0.893).
In addition, the success rate curves and prediction rate curves of the landslide susceptibility maps were also assessed (Figure 5 and Figure 6). It can be seen that the FR_SysFor model recognized a highly susceptible area with a success rate curve that contains more than 80% of landslides. Similarly, the highly susceptible area with the prediction rate curve recognized by the FR_SysFor model also includes more than 70% of landslides, which illustrates that the precision of the FR_SysFor model is the highest and that the FR_SysFor model is the best model in this study.

5. Discussion

Currently, some novel ensemble techniques have been proposed in landslide susceptibility mapping, and the excellent performance of ensemble techniques has been proven [92,106,107,108,109]. Furthermore, the hybrid integration of machine learning algorithms with bivariate statistical models can weaken the hypotheses of the conventional bivariate models and retain the merits of bivariate statistical models and machine learning models [110].
Compared with SysFor, FR, IoE and LR are three common evaluation models which have the merits of model stabilization, higher accuracy and simple calculation. Razavizadeh et al. proposed a GIS-based landslide susceptibility mapping with frequency ratio, statistical index, and weights of evidence models for a part of Mazandaran Province, Iran. Both success rate curve and prediction rate curve demonstrated that the frequency ratio (FR) is a reliable model with the highest accuracy. Moreover, the landslide susceptibility map generated by the FR model is trustworthy for hazard mitigation strategies [25]. Pourghasemi et al. introduced the index of entropy (IoE) and conditional probability models to landslide susceptibility research in Safarood Basin, Iran. The results indicated that both models have good predictive capacity, while the IoE performed slightly better than the conditional probability model in landslide susceptibility mapping [111]. Abedini et al. applied logistic regression (LR) and AHP models to landslide susceptibility assessment. The results indicated that the LR is a suitable model to classify and estimate the probability of landslide occurrence in the process of project research planning and implementation [112].
In the present study, for the class of 600–800 m of elevation (ratio value = 2.650), distance to roads <300 m (ratio value = 2.126) and 1100–1200 mm/yr of rainfall (ratio value = 2.354) facilitated the occurrence of landslides. The results based on the IoE model showed that soil (0.326), land use (0.311), elevation (0.223), and slope angle (0.143) are the most important factors, which are closely related to the occurrence and spatial distribution of landslides compared with other factors. On the contrary, distance to faults, plane curvature, SPI, and distance to rivers achieved the four lowest weight values, which were 0.022, 0.014, 0.003, and 0.014, respectively. However, the classifications of landslide conditioning factors were based on previous studies and might not be suitable for the present study. Therefore, further studies should be conducted to find an objective classification method for landslide conditioning factors [113,114]. In order to obtain more reliable landslide susceptibility maps, the significant differences of six landslide susceptibility methods were calculated and compared. According to the relevant research [58,115], the most common and effective methods include the receiver operating characteristics (ROC) curve, standard error (SE), 95% confidence interval (CI), and Wilcoxon signed-rank tests. Hence, the paper introduced three well known statistical parameters—the ROC curve, SE and 95% CI—to calculate and compare the model performance. For the training data, the results of the ROC curve, SE and 95% CI can be seen in Table 4. The FR_SysFor model performed best and acquired the highest AUC value (0.940), followed by IoE_SysFor, IoE_LR, FR_LR, FR, and IoE models with AUC values of 0.926, 0.783, 0.779, 0.757, and 0.746, respectively. Similarly, the two other statistical parameters showed the same results; the FR_SysFor model obtained the smallest SE and 95% CI. For the validating data, the parameters of the ROC curve, SE and 95% CI are shown in Table 5. The validating data showed that the FR_SysFor model had higher accuracy (AUC = 0.831) than the remaining models and performed best in the research. Meanwhile, the smallest SE and 95% CI also belonged to the FR_SysFor model.
Overall, based on the FR and IoE models, combined with SysFor and LR models, the landslide susceptibility of Nanzheng County was studied. As described above, the FR_SysFor model performed the best in this study as compared to other models. Finally, the ensembles of FR and IoE with the proposed algorithm (SysFor) can provide a reference for landslide susceptibility research in other areas.
In practice, landslide hazard managers can employ the FR_SysFor model to determine regions with high and very high susceptibility in the study area. An early warning system for landslide occurrence can transmit useful awareness and warning information for residents living in these areas [107]. Furthermore, the landslide susceptibility map of the present study can help to construct retaining wall systems and anchor systems to enhance slope stability [107].

6. Conclusions

In this case study, six landslide susceptibility evaluation methods, namely, the FR, IoE, FR_SysFor, IoE_SysFor, FR_LR, and IoE_LR models, were systematically analyzed and compared as part of landslide susceptibility research in Nanzheng Country (China). Based on previous research and the geological environmental characteristics in the study area, 16 conditioning factors were selected for the research: elevation, profile curvature, plan curvature, slope angle, slope aspect, SPI, TWI, SPI, distance to roads, distance to rivers, distance to faults, lithology, rainfall, soil, NDVI, and land use. These models were applied to the calculation of landslide occurrence probability for the first time. Finally, the model performances were compared by the statistical parameters of ROC curves, AUC values, SE and 95% CI. The results show that all the models perform well. Compared with other models, the prediction capability of the FR_SysFor model is the highest, with a success rate of 0.940 and prediction rate of 0.831. Hence, the FR_SysFor model is considered the most promising technique in this study. The results can provide a reference for land-use planning and decision-making in the study area.

Author Contributions

W.C., L.F., C.L., and B.T.P. contributed equally to the work. W.C. collected field data and conducted the landslide susceptibility mapping and analysis. W.C. wrote and revised the manuscript. L.F., C.L., and B.T.P. edited the manuscript. All the authors discussed the results and edited the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Key Laboratory of Mine Geological Hazard Mechanism and Control, Shaanxi Provincial Natural Resources Department (Program No. KF2018-05), the National Natural Science Foundation of China (41807192), Natural Science Basic Research Program of Shaanxi (Program No. 2019JLM-7, Program No. 2019JQ-094), China Postdoctoral Science Foundation (Grant No. 2018T111084, 2017M613168), and Project funded by Shaanxi Province Postdoctoral Science Foundation (Grant No. 2017BSHYDZZ07).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wang, Z.; Hu, Z.; Liu, H.; Gong, H.; Zhao, W.; Yu, M.; Zhang, M. In Application of the relief degree of land surface in landslide disasters susceptibility assessment in china. In Proceedings of the 2010 18th International Conference on Geoinformatics, Beijing, China, 18–20 June 2010; pp. 1–5. [Google Scholar]
  2. Wu, Y.; Liu, X.; Wang, J.A.; Liu, L.; Shi, P. Landslide and debris flow disasters in china. In Natural Disasters in China; Shi, P., Ed.; Springer: Berlin/Heidelberg, Germany, 2016; pp. 73–101. [Google Scholar]
  3. Available online: http://www.cigem.cgs.gov.cn/ (accessed on 17 April 2019).
  4. Adineh, F.; Motamedvaziri, B.; Ahmadi, H.; Moeini, A. Landslide susceptibility mapping using genetic algorithm for the rule set production (garp) model. J. Mt. Sci. 2018, 15, 2013–2026. [Google Scholar] [CrossRef]
  5. Ahmed, B. Landslide susceptibility modelling applying user-defined weighting and data-driven statistical techniques in cox’s bazar municipality, bangladesh. Nat. Hazards 2015, 79, 1707–1737. [Google Scholar] [CrossRef]
  6. Acharya, T.D.; Lee, D.H. Landslide susceptibility mapping using relative frequency and predictor rate along araniko highway. KSCE J. Civ. Eng. 2019, 23, 763–776. [Google Scholar] [CrossRef]
  7. Ilia, I.; Tsangaratos, P. Applying weight of evidence method and sensitivity analysis to produce a landslide susceptibility map. Landslides 2016, 13, 379–397. [Google Scholar] [CrossRef]
  8. Ko, F.W.Y.; Lo, F.L.C. From landslide susceptibility to landslide frequency: A territory-wide study in Hong Kong. Eng. Geol. 2018, 242, 12–22. [Google Scholar] [CrossRef]
  9. Guo, C.; Qin, Y.; Ma, D.; Xia, Y.; Chen, Y.; Si, Q.; Lu, L. Ionic composition, geological signature and environmental impacts of coalbed methane produced water in China. Energy Sources Part A Recovery Util. Environ. Eff. 2019, 1–15. [Google Scholar] [CrossRef]
  10. Meten, M.; Bhandary, N.P.; Yatabe, R. Gis-based frequency ratio and logistic regression modelling for landslide susceptibility mapping of debre sina area in central ethiopia. J. Mt. Sci. 2015, 12, 1355–1372. [Google Scholar] [CrossRef]
  11. Pham, B.T.; Tien Bui, D.; Prakash, I.; Dholakia, M.B. Hybrid integration of multilayer perceptron neural networks and machine learning ensembles for landslide susceptibility assessment at himalayan area (India) using gis. CATENA 2017, 149, 52–63. [Google Scholar] [CrossRef]
  12. Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
  13. Hussin, H.Y.; Zumpano, V.; Reichenbach, P.; Sterlacchini, S.; Micu, M.; van Westen, C.; Bălteanu, D. Different landslide sampling strategies in a grid-based bi-variate statistical susceptibility model. Geomorphology 2016, 253, 508–523. [Google Scholar] [CrossRef]
  14. Pham, B.T.; Pradhan, B.; Tien Bui, D.; Prakash, I.; Dholakia, M.B. A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of uttarakhand area (India). Environ. Model. Softw. 2016, 84, 240–250. [Google Scholar] [CrossRef]
  15. Zhu, A.X.; Miao, Y.; Wang, R.; Zhu, T.; Deng, Y.; Liu, J.; Yang, L.; Qin, C.-Z.; Hong, H. A comparative study of an expert knowledge-based model and two data-driven models for landslide susceptibility mapping. CATENA 2018, 166, 317–327. [Google Scholar] [CrossRef]
  16. He, Q.; Shahabi, H.; Shirzadi, A.; Li, S.; Chen, W.; Wang, N.; Chai, H.; Bian, H.; Ma, J.; Chen, Y.; et al. Landslide spatial modelling using novel bivariate statistical based naïve bayes, rbf classifier, and rbf network machine learning algorithms. Sci. Total Environ. 2019, 663, 1–15. [Google Scholar] [CrossRef] [PubMed]
  17. Bui, D.T.; Lofman, O.; Revhaug, I.; Dick, O. Landslide susceptibility analysis in the hoa binh province of vietnam using statistical index and logistic regression. Nat. Hazards 2011, 59, 1413. [Google Scholar] [CrossRef]
  18. Zhang, G.; Cai, Y.; Zheng, Z.; Zhen, J.; Liu, Y.; Huang, K. Integration of the statistical index method and the analytic hierarchy process technique for the assessment of landslide susceptibility in Huizhou, China. CATENA 2016, 142, 233–244. [Google Scholar] [CrossRef]
  19. Constantin, M.; Bednarik, M.; Jurchescu, M.C.; Vlaicu, M. Landslide susceptibility assessment using the bivariate statistical analysis and the index of entropy in the sibiciu basin (romania). Environ. Earth Sci. 2011, 63, 397–406. [Google Scholar] [CrossRef]
  20. Jaafari, A.; Najafi, A.; Pourghasemi, H.R.; Rezaeian, J.; Sattarian, A. Gis-based frequency ratio and index of entropy models for landslide susceptibility assessment in the caspian forest, northern iran. Int. J. Environ. Sci. Technol. 2014, 11, 909–926. [Google Scholar] [CrossRef] [Green Version]
  21. Aditian, A.; Kubota, T.; Shinohara, Y. Comparison of gis-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of ambon, Indonesia. Geomorphology 2018, 318, 101–111. [Google Scholar] [CrossRef]
  22. Regmi, A.D.; Yoshida, K.; Pourghasemi, H.R.; DhitaL, M.R.; Pradhan, B. Landslide susceptibility mapping along bhalubang—Shiwapur area of mid-western nepal using frequency ratio and conditional probability models. J. Mt. Sci. 2014, 11, 1266–1285. [Google Scholar] [CrossRef]
  23. Umar, Z.; Pradhan, B.; Ahmad, A.; Jebur, M.N.; Tehrany, M.S. Earthquake induced landslide susceptibility mapping using an integrated ensemble frequency ratio and logistic regression models in west sumatera province, Indonesia. CATENA 2014, 118, 124–135. [Google Scholar] [CrossRef]
  24. Rana, N.; Bisht, P.; Bagri, D.S.; Wasson, R.J.; Sundriyal, Y. Identification of landslide-prone zones in the geomorphically and climatically sensitive mandakini valley, (central Himalaya), for disaster governance using the weights of evidence method. Geomorphology 2017, 284, 41–52. [Google Scholar]
  25. Razavizadeh, S.; Solaimani, K.; Massironi, M.; Kavian, A. Mapping landslide susceptibility with frequency ratio, statistical index, and weights of evidence models: A case study in northern iran. Environ. Earth Sci. 2017, 76, 499. [Google Scholar] [CrossRef]
  26. Fan, W.; Wei, X.-S.; Cao, Y.-B.; Zheng, B. Landslide susceptibility assessment using the certainty factor and analytic hierarchy process. J. Mt. Sci. 2017, 14, 906–925. [Google Scholar] [CrossRef]
  27. Pradhan, B.; Sameen, M.I. Landslide susceptibility modeling: Optimization and factor effect analysis. In Laser Scanning Applications in Landslide Assessment; Pradhan, B., Ed.; Springer International Publishing: Cham, Switzerland, 2017; pp. 115–132. [Google Scholar]
  28. Althuwaynee, O.F.; Pradhan, B.; Lee, S. Application of an evidential belief function model in landslide susceptibility mapping. Comput. Geosci. 2012, 44, 120–135. [Google Scholar] [CrossRef]
  29. Gayen, A.; Saha, S. Application of weights-of-evidence (woe) and evidential belief function (ebf) models for the delineation of soil erosion vulnerable zones: A study on pathro river basin, jharkhand, India. Model. Earth Syst. Environ. 2017, 3, 1123–1139. [Google Scholar] [CrossRef]
  30. Mondal, S.; Maiti, R. Integrating the analytical hierarchy process (ahp) and the frequency ratio (fr) model in landslide susceptibility mapping of shiv-khola watershed, darjeeling himalaya. Int. J. Disaster Risk Sci. 2013, 4, 200–212. [Google Scholar] [CrossRef] [Green Version]
  31. Myronidis, D.; Papageorgiou, C.; Theophanous, S. Landslide susceptibility mapping based on landslide history and analytic hierarchy process (ahp). Nat. Hazards 2016, 81, 245–263. [Google Scholar] [CrossRef]
  32. Yoshimatsu, H.; Abe, S. A review of landslide hazards in japan and assessment of their susceptibility using an analytical hierarchic process (ahp) method. Landslides 2006, 3, 149–158. [Google Scholar] [CrossRef]
  33. Chen, W.; Zhao, X.; Shahabi, H.; Shirzadi, A.; Khosravi, K.; Chai, H.; Zhang, S.; Zhang, L.; Ma, J.; Chen, Y.; et al. Spatial prediction of landslide susceptibility by combining evidential belief function, logistic regression and logistic model tree. Geocarto Int. 2019, 34, 1177–1201. [Google Scholar] [CrossRef]
  34. Feizizadeh, B.; Roodposhti, M.S.; Blaschke, T.; Aryal, J. Comparing gis-based support vector machine kernel functions for landslide susceptibility mapping. Arab. J. Geosci. 2017, 10, 122. [Google Scholar] [CrossRef]
  35. Pourghasemi, H.R.; Jirandeh, A.G.; Pradhan, B.; Xu, C.; Gokceoglu, C. Landslide susceptibility mapping using support vector machine and gis at the golestan province, iran. J. Earth Syst. Sci. 2013, 122, 349–369. [Google Scholar] [CrossRef] [Green Version]
  36. Wu, X.; Ren, F.; Niu, R. Landslide susceptibility assessment using object mapping units, decision tree, and support vector machine models in the three gorges of china. Environ. Earth Sci. 2014, 71, 4725–4738. [Google Scholar] [CrossRef]
  37. Catani, F.; Lagomarsino, D.; Segoni, S.; Tofani, V. Landslide susceptibility estimation by random forests technique: Sensitivity and scaling issues. Nat. Hazards Earth Syst. Sci. 2013, 13, 2815–2831. [Google Scholar] [CrossRef] [Green Version]
  38. Chen, W.; Pourghasemi, H.R.; Naghibi, S.A. Prioritization of landslide conditioning factors and its spatial modeling in shangnan county, china using gis-based data mining algorithms. Bull. Eng. Geol. Environ. 2018, 77, 611–629. [Google Scholar] [CrossRef]
  39. Chen, W.; Hong, H.; Panahi, M.; Shahabi, H.; Wang, Y.; Shirzadi, A.; Pirasteh, S.; Alesheikh, A.A.; Khosravi, K.; Panahi, S.; et al. Spatial prediction of landslide susceptibility using gis-based data mining techniques of anfis with whale optimization algorithm (woa) and grey wolf optimizer (gwo). Appl. Sci. 2019, 9, 3755. [Google Scholar] [CrossRef] [Green Version]
  40. Nasiri Aghdam, I.; Pradhan, B.; Panahi, M. Landslide susceptibility assessment using a novel hybrid model of statistical bivariate methods (fr and woe) and adaptive neuro-fuzzy inference system (anfis) at southern zagros mountains in iran. Environ. Earth Sci. 2017, 76, 237. [Google Scholar] [CrossRef]
  41. Quan, H.-C.; Lee, B.-G. Gis-based landslide susceptibility mapping using analytic hierarchy process and artificial neural network in jeju (korea). KSCE J. Civ. Eng. 2012, 16, 1258–1266. [Google Scholar] [CrossRef]
  42. Wang, Q.; Li, W.; Xing, M.; Wu, Y.; Pei, Y.; Yang, D.; Bai, H. Landslide susceptibility mapping at gongliu county, china using artificial neural network and weight of evidence models. Geosci. J. 2016, 20, 705–718. [Google Scholar] [CrossRef]
  43. Zare, M.; Pourghasemi, H.R.; Vafakhah, M.; Pradhan, B. Landslide susceptibility mapping at vaz watershed (iran) using an artificial neural network model: A comparison between multilayer perceptron (mlp) and radial basic function (rbf) algorithms. Arab. J. Geosci. 2013, 6, 2873–2888. [Google Scholar] [CrossRef]
  44. Pradhan, B. A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using gis. Comput. Geosci. 2013, 51, 350–365. [Google Scholar] [CrossRef]
  45. Zhang, K.; Wu, X.; Niu, R.; Yang, K.; Zhao, L. The assessment of landslide susceptibility mapping using random forest and decision tree methods in the three gorges reservoir area, China. Environ. Earth Sci. 2017, 76, 405. [Google Scholar] [CrossRef]
  46. Pham, B.T.; Tien Bui, D.; Prakash, I. Application of classification and regression trees for spatial prediction of rainfall-induced shallow landslides in the uttarakhand area (India) using gis. In Climate Change, Extreme Events and Disaster Risk Reduction: Towards Sustainable Development Goals; Mal, S., Singh, R.B., Huggel, C., Eds.; Springer International Publishing: Cham, Switzerland, 2018; pp. 159–170. [Google Scholar]
  47. Sahana, M.; Sajjad, H. Evaluating effectiveness of frequency ratio, fuzzy logic and logistic regression models in assessing landslide susceptibility: A case from rudraprayag district, India. J. Mt. Sci. 2017, 14, 2150–2167. [Google Scholar] [CrossRef]
  48. Chen, W.; Shahabi, H.; Zhang, S.; Khosravi, K.; Shirzadi, A.; Chapi, K.; Pham, B.T.; Zhang, T.; Zhang, L.; Chai, H.; et al. Landslide susceptibility modeling based on gis and novel bagging-based kernel logistic regression. Appl. Sci. 2018, 8, 2540. [Google Scholar] [CrossRef] [Green Version]
  49. Jebur, M.N.; Pradhan, B.; Tehrany, M.S. Manifestation of lidar-derived parameters in the spatial prediction of landslides using novel ensemble evidential belief functions and support vector machine models in gis. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 674–690. [Google Scholar] [CrossRef]
  50. Wang, L.-J.; Guo, M.; Sawada, K.; Lin, J.; Zhang, J. A comparative study of landslide susceptibility maps using logistic regression, frequency ratio, decision tree, weights of evidence and artificial neural network. Geosci. J. 2016, 20, 117–136. [Google Scholar] [CrossRef]
  51. Aghdam, I.N.; Varzandeh, M.H.M.; Pradhan, B. Landslide susceptibility mapping using an ensemble statistical index (wi) and adaptive neuro-fuzzy inference system (anfis) model at Alborz mountains (Iran). Environ. Earth Sci. 2016, 75, 553. [Google Scholar] [CrossRef]
  52. Youssef, A.M.; Pradhan, B.; Jebur, M.N.; El-Harbi, H.M. Landslide susceptibility mapping using ensemble bivariate and multivariate statistical models in Fayfa area, Saudi Arabia. Environ. Earth Sci. 2015, 73, 3745–3761. [Google Scholar] [CrossRef]
  53. Chen, W.; Shirzadi, A.; Shahabi, H.; Ahmad, B.B.; Zhang, S.; Hong, H.; Zhang, N. A novel hybrid artificial intelligence approach based on the rotation forest ensemble and naïve bayes tree classifiers for a landslide susceptibility assessment in Langao County, China. Geomat. Nat. Hazards Risk 2017, 8, 1955–1977. [Google Scholar] [CrossRef] [Green Version]
  54. Pham, B.T.; Prakash, I.; Tien Bui, D. Spatial prediction of landslides using a hybrid machine learning approach based on random subspace and classification and regression trees. Geomorphology 2018, 303, 256–270. [Google Scholar] [CrossRef]
  55. Persichillo, M.G.; Bordoni, M.; Meisina, C. The role of land use changes in the distribution of shallow landslides. Sci. Total Environ. 2017, 574, 924–937. [Google Scholar] [CrossRef]
  56. Nicu, I.C.; Asăndulesei, A. Gis-based evaluation of diagnostic areas in landslide susceptibility analysis of Bahluieț river basin (Moldavian Plateau, Ne Romania). Are neolithic sites in danger? Geomorphology 2018, 314, 27–41. [Google Scholar] [CrossRef]
  57. Sharma, S.; Mahajan, A.K. A comparative assessment of information value, frequency ratio and analytical hierarchy process models for landslide susceptibility mapping of a Himalayan watershed, India. Bull. Eng. Geol. Environ. 2019, 78, 2431–2448. [Google Scholar] [CrossRef]
  58. Shirani, K.; Pasandi, M.; Arabameri, A. Landslide susceptibility assessment by dempster–shafer and index of entropy models, Sarkhoun basin, southwestern Iran. Nat. Hazards 2018, 93, 1379–1418. [Google Scholar] [CrossRef]
  59. Youssef, A.M.; Al-Kathery, M.; Pradhan, B. Landslide susceptibility mapping at Al-Hasher area, Jizan (Saudi Arabia) using gis-based frequency ratio and index of entropy models. Geosci. J. 2015, 19, 113–134. [Google Scholar] [CrossRef]
  60. Wang, Q.; Li, W.; Wu, Y.; Pei, Y.; Xie, P. Application of statistical index and index of entropy methods to landslide susceptibility assessment in gongliu (Xinjiang, China). Environ. Earth Sci. 2016, 75, 599. [Google Scholar] [CrossRef]
  61. Veenadhari, S.; Misra, B.; Singh, C. Machine learning approach for forecasting crop yield based on climatic parameters. In Proceedings of the 2014 International Conference on Computer Communication and Informatics, Coimbatore, India, 3–5 January 2014; pp. 1–5. [Google Scholar]
  62. Islam, Z.; Giggins, H. Knowledge discovery through sysfor: A systematically developed forest of multiple decision trees. In Proceedings of the Ninth Australasian Data Mining Conference; Australian Computer Society, Inc.: Ballarat, Australia, 2011; Volume 121, pp. 195–204. [Google Scholar]
  63. Bibri, S.E. Data science for urban sustainability: Data mining and data-analytic thinking in the next wave of city analytics. In Smart Sustainable Cities of the Future: The Untapped Potential of Big Data Analytics and Context–Aware Computing for Advancing Sustainability; Bibri, S.E., Ed.; Springer International Publishing: Cham, Switherland, 2018; pp. 189–246. [Google Scholar]
  64. Polykretis, C.; Chalkias, C. Comparison and evaluation of landslide susceptibility maps obtained from weight of evidence, logistic regression, and artificial neural network models. Nat. Hazards 2018, 93, 249–274. [Google Scholar] [CrossRef]
  65. Pourghasemi, H.R.; Moradi, H.R.; Fatemi Aghda, S.M. Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances. Nat. Hazards 2013, 69, 749–779. [Google Scholar] [CrossRef]
  66. Shahabi, H.; Hashim, M.; Ahmad, B.B. Remote sensing and gis-based landslide susceptibility mapping using frequency ratio, logistic regression, and fuzzy logic methods at the central zab basin, iran. Environ. Earth Sci. 2015, 73, 8647–8668. [Google Scholar] [CrossRef]
  67. Meng, Q.; Miao, F.; Zhen, J.; Wang, X.; Wang, A.; Peng, Y.; Fan, Q. Gis-based landslide susceptibility mapping with logistic regression, analytical hierarchy process, and combined fuzzy and support vector machine methods: A case study from wolong giant panda natural reserve, china. Bull. Eng. Geol. Environ. 2016, 75, 923–944. [Google Scholar] [CrossRef]
  68. Fell, R.; Glastonbury, J.; Hunter, G. Rapid landslides: The importance of understanding mechanisms and rupture surface mechanics. Q. J. Eng. Geol. Hydrogeol. 2007, 40, 9–27. [Google Scholar] [CrossRef]
  69. Rosi, A.; Tofani, V.; Tanteri, L.; Stefanelli, C.T.; Agostini, A.; Catani, F.; Casagli, N. The new landslide inventory of tuscany (Italy) updated with ps-insar: Geomorphological features and landslide distribution. Landslides 2018, 15, 5–19. [Google Scholar] [CrossRef] [Green Version]
  70. Chen, W.; Sun, Z.; Han, J. Landslide susceptibility modeling using integrated ensemble weights of evidence with logistic regression and random forest models. Appl. Sci. 2019, 9, 171. [Google Scholar] [CrossRef] [Green Version]
  71. Hungr, O.; Leroueil, S.; Picarelli, L. The varnes classification of landslide types, an update. Landslides 2014, 11, 167–194. [Google Scholar] [CrossRef]
  72. Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
  73. Conforti, M.; Pascale, S.; Robustelli, G.; Sdao, F. Evaluation of prediction capability of the artificial neural networks for mapping landslide susceptibility in the Turbolo river catchment (northern Calabria, Italy). CATENA 2014, 113, 236–250. [Google Scholar] [CrossRef]
  74. Hong, H.; Pradhan, B.; Xu, C.; Tien Bui, D. Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines. CATENA 2015, 133, 266–281. [Google Scholar] [CrossRef]
  75. Pham, B.T.; Tien Bui, D.; Pourghasemi, H.R.; Indra, P.; Dholakia, M.B. Landslide susceptibility assesssment in the uttarakhand area (India) using gis: A comparison study of prediction capability of naïve bayes, multilayer perceptron neural networks, and functional trees methods. Theor. Appl. Climatol. 2017, 128, 255–273. [Google Scholar] [CrossRef]
  76. Chen, W.; Panahi, M.; Pourghasemi, H.R. Performance evaluation of gis-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (anfis) with genetic algorithm (ga), differential evolution (de), and particle swarm optimization (pso) for landslide spatial modelling. CATENA 2017, 157, 310–324. [Google Scholar] [CrossRef]
  77. Chapi, K.; Singh, V.P.; Shirzadi, A.; Shahabi, H.; Bui, D.T.; Pham, B.T.; Khosravi, K. A novel hybrid artificial intelligence approach for flood susceptibility assessment. Environ. Model. Softw. 2017, 95, 229–245. [Google Scholar] [CrossRef]
  78. Chen, W.; Zhang, S.; Li, R.; Shahabi, H. Performance evaluation of the gis-based data mining techniques of best-first decision tree, random forest, and naïve bayes tree for landslide susceptibility modeling. Sci. Total Environ. 2018, 644, 1006–1018. [Google Scholar] [CrossRef]
  79. Losasso, L.; Rinaldi, C.; Alberico, D.; Sdao, F. In Landslide risk analysis along strategic touristic roads in Basilicata (southern Italy) using the modified rhrs 2.0 method. In Proceedings of the Computational Science and Its Applications—ICCSA 2017; Gervasi, O., Murgante, B., Misra, S., Borruso, G., Torre, C.M., Rocha, A.M.A.C., Taniar, D., Apduhan, B.O., Stankova, E., Cuzzocrea, A., Eds.; Springer International Publishing: Cham, Switherland, 2017; pp. 761–776. [Google Scholar]
  80. Sridhar, B.; Rao, P.J.; Narasimha Rao, G.; Duvvuru, R.; Anusha, C.; Sanyasi Naidu, D.; Srinivas, E.; Sridevi, T.; Madhuri, M.; Padmini, Y. In Identification of landslide hazard zones along the Bheemili Beach road, Visakhapatnam district, A.P. In Proceedings of International Conference on Remote Sensing for Disaster Management; Rao, P.J., Rao, K.N., Kubo, S., Eds.; Springer International Publishing: Cham, Switherland, 2019; pp. 515–522. [Google Scholar]
  81. Srivastava, P.K.; Han, D.; Rico-Ramirez, M.A.; Islam, T.J.H.P. Sensitivity and uncertainty analysis of mesoscale model downscaled hydro-meteorological variables for discharge prediction. Hydrol. Process. 2014, 28, 4419–4432. [Google Scholar] [CrossRef]
  82. Bièvre, G.; Jongmans, D.; Goutaland, D.; Pathier, E.; Zumbo, V. Geophysical characterization of the lithological control on the kinematic pattern in a large clayey landslide (Avignonet, French Alps). Landslides 2016, 13, 423–436. [Google Scholar] [CrossRef]
  83. Watakabe, T.; Matsushi, Y. Lithological controls on hydrological processes that trigger shallow landslides: Observations from granite and hornfels hillslopes in Hiroshima, Japan. CATENA 2019, 180, 55–68. [Google Scholar] [CrossRef]
  84. Bezak, N.; Jemec Auflič, M.; Mikoš, M. Application of hydrological modelling for temporal prediction of rainfall-induced shallow landslides. Landslides 2019, 16, 1273–1283. [Google Scholar] [CrossRef] [Green Version]
  85. Piciullo, L.; Gariano, S.L.; Melillo, M.; Brunetti, M.T.; Peruccacci, S.; Guzzetti, F.; Calvello, M. Definition and performance of a threshold-based regional early warning model for rainfall-induced landslides. Landslides 2017, 14, 995–1008. [Google Scholar] [CrossRef]
  86. Chen, C.-W.; Chen, H.; Oguchi, T. Distributions of landslides, vegetation, and related sediment yields during typhoon events in northwestern Taiwan. Geomorphology 2016, 273, 1–13. [Google Scholar] [CrossRef]
  87. Fiorucci, F.; Ardizzone, F.; Mondini, A.C.; Viero, A.; Guzzetti, F. Visual interpretation of stereoscopic ndvi satellite images to map rainfall-induced landslides. Landslides 2019, 16, 165–174. [Google Scholar] [CrossRef]
  88. Sun, W.; Tian, Y.; Mu, X.; Zhai, J.; Gao, P.; Zhao, G. Loess landslide inventory map based on gf-1 satellite imagery. Remote Sens. 2017, 9, 314. [Google Scholar] [CrossRef] [Green Version]
  89. Bartelletti, C.; Giannecchini, R.; D’Amato Avanzi, G.; Galanti, Y.; Mazzali, A. The influence of geological–morphological and land use settings on shallow landslides in the pogliaschina T. Basin (northern Apennines, Italy). J. Maps 2017, 13, 142–152. [Google Scholar] [CrossRef] [Green Version]
  90. Li, R.; Wang, N. Landslide susceptibility mapping for the Muchuan county (China): A comparison between bivariate statistical models (woe, ebf, and ioe) and their ensembles with logistic regression. Symmetry 2019, 11, 762. [Google Scholar] [CrossRef] [Green Version]
  91. Chen, W.; Xie, X.; Peng, J.; Wang, J.; Duan, Z.; Hong, H. Gis-based landslide susceptibility modelling: A comparative assessment of kernel logistic regression, naïve-bayes tree, and alternating decision tree models. Geomat. Nat. Hazards Risk 2017, 8, 950–973. [Google Scholar] [CrossRef] [Green Version]
  92. Chen, W.; Shahabi, H.; Shirzadi, A.; Hong, H.; Akgun, A.; Tian, Y.; Liu, J.; Zhu, A.X.; Li, S. Novel hybrid artificial intelligence approach of bivariate statistical-methods-based kernel logistic regression classifier for landslide susceptibility modeling. Bull. Eng. Geol. Environ. 2019, 78, 4397–4419. [Google Scholar] [CrossRef]
  93. Chen, W.; Shahabi, H.; Shirzadi, A.; Li, T.; Guo, C.; Hong, H.; Li, W.; Pan, D.; Hui, J.; Ma, M.; et al. A novel ensemble approach of bivariate statistical-based logistic model tree classifier for landslide susceptibility assessment. Geocarto Int. 2018, 33, 1398–1420. [Google Scholar] [CrossRef]
  94. Ashournejad, Q.; Hosseini, A.; Pradhan, B.; Hosseini, S.J. Hazard zoning for spatial planning using gis-based landslide susceptibility assessment: A new hybrid integrated data-driven and knowledge-based model. Arab. J. Geosci. 2019, 12, 126. [Google Scholar] [CrossRef]
  95. Moosavi, V.; Niazi, Y. Development of hybrid wavelet packet-statistical models (wp-sm) for landslide susceptibility mapping. Landslides 2016, 13, 97–114. [Google Scholar] [CrossRef]
  96. Shirzadi, A.; Bui, D.T.; Pham, B.T.; Solaimani, K.; Chapi, K.; Kavian, A.; Shahabi, H.; Revhaug, I. Shallow landslide susceptibility assessment using a novel hybrid intelligence approach. Environ. Earth Sci. 2017, 76, 60. [Google Scholar] [CrossRef]
  97. Battistini, A.; Rosi, A.; Segoni, S.; Lagomarsino, D.; Catani, F.; Casagli, N. Validation of landslide hazard models using a semantic engine on online news. Appl. Geogr. 2017, 82, 59–65. [Google Scholar] [CrossRef]
  98. Hong, H.; Shahabi, H.; Shirzadi, A.; Chen, W.; Chapi, K.; Ahmad, B.B.; Roodposhti, M.S.; Yari Hesar, A.; Tian, Y.; Tien Bui, D. Landslide susceptibility assessment at the wuning area, china: A comparison between multi-criteria decision making, bivariate statistical and machine learning methods. Nat. Hazards 2019, 96, 173–212. [Google Scholar] [CrossRef]
  99. Murillo-García, F.G.; Steger, S.; Alcántara-Ayala, I. Landslide susceptibility: A statistically-based assessment on a depositional pyroclastic ramp. J. Mt. Sci. 2019, 16, 561–580. [Google Scholar] [CrossRef]
  100. Chen, W.; Hong, H.; Li, S.; Shahabi, H.; Wang, Y.; Wang, X.; Ahmad, B.B. Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles. J. Hydrol. 2019, 575, 864–873. [Google Scholar] [CrossRef]
  101. Chen, W.; Panahi, M.; Khosravi, K.; Pourghasemi, H.R.; Rezaie, F.; Parvinnezhad, D. Spatial prediction of groundwater potentiality using anfis ensembled with teaching-learning-based and biogeography-based optimization. J. Hydrol. 2019, 572, 435–448. [Google Scholar] [CrossRef]
  102. Chen, W.; Li, Y.; Xue, W.; Shahabi, H.; Li, S.; Hong, H.; Wang, X.; Bian, H.; Zhang, S.; Pradhan, B.; et al. Modeling flood susceptibility using data-driven approaches of naïve bayes tree, alternating decision tree, and random forest methods. Sci. Total Environ. 2020, 701, 134979. [Google Scholar] [CrossRef] [PubMed]
  103. Chen, W.; Pradhan, B.; Li, S.; Shahabi, H.; Rizeei, H.M.; Hou, E.; Wang, S. Novel hybrid integration approach of bagging-based fisher’s linear discriminant function for groundwater potential analysis. Nat. Resour. Res. 2019, 28, 1239–1258. [Google Scholar] [CrossRef] [Green Version]
  104. Chen, W.; Tsangaratos, P.; Ilia, I.; Duan, Z.; Chen, X. Groundwater spring potential mapping using population-based evolutionary algorithms and data mining methods. Sci. Total Environ. 2019, 684, 31–49. [Google Scholar] [CrossRef] [PubMed]
  105. Chen, W.; Li, H.; Hou, E.; Wang, S.; Wang, G.; Panahi, M.; Li, T.; Peng, T.; Guo, C.; Niu, C.; et al. Gis-based groundwater potential analysis using novel ensemble weights-of-evidence with logistic regression and functional tree models. Sci. Total Environ. 2018, 634, 853–867. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  106. Pham, B.T.; Prakash, I.; Singh, S.K.; Shirzadi, A.; Shahabi, H.; Tran, T.-T.-T.; Bui, D.T. Landslide susceptibility modeling using reduced error pruning trees and different ensemble techniques: Hybrid machine learning approaches. CATENA 2019, 175, 203–218. [Google Scholar] [CrossRef]
  107. Tien Bui, D.; Pham, B.T.; Nguyen, Q.P.; Hoang, N.-D. Spatial prediction of rainfall-induced shallow landslides using hybrid integration approach of least-squares support vector machines and differential evolution optimization: A case study in central vietnam. Int. J. Digit. Earth 2016, 9, 1077–1097. [Google Scholar] [CrossRef]
  108. Althuwaynee, O.F.; Pradhan, B.; Park, H.J.; Lee, J.H. A novel ensemble decision tree-based chi-squared automatic interaction detection (chaid) and multivariate logistic regression models in landslide susceptibility mapping. Landslides 2014, 11, 1063–1078. [Google Scholar] [CrossRef]
  109. Arabameri, A.; Pradhan, B.; Rezaei, K. Spatial prediction of gully erosion using alos palsar data and ensemble bivariate and data mining models. Geosci. J. 2019, 23, 669–686. [Google Scholar] [CrossRef]
  110. Chen, W.; Xie, X.; Peng, J.; Shahabi, H.; Hong, H.; Bui, D.T.; Duan, Z.; Li, S.; Zhu, A.X. Gis-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method. CATENA 2018, 164, 135–149. [Google Scholar] [CrossRef]
  111. Pourghasemi, H.R.; Mohammady, M.; Pradhan, B. Landslide susceptibility mapping using index of entropy and conditional probability models in gis: Safarood basin, Iran. CATENA 2012, 97, 71–84. [Google Scholar] [CrossRef]
  112. Abedini, M.; Ghasemyan, B.; Rezaei Mogaddam, M.H. Landslide susceptibility mapping in bijar city, kurdistan province, iran: A comparative study by logistic regression and ahp models. Environ. Earth Sci. 2017, 76, 308. [Google Scholar] [CrossRef]
  113. Ba, Q.; Chen, Y.; Deng, S.; Yang, J.; Li, H. A comparison of slope units and grid cells as mapping units for landslide susceptibility assessment. Earth Sci. Inform. 2018, 11, 373–388. [Google Scholar] [CrossRef]
  114. Zhao, X.; Chen, W. Gis-based evaluation of landslide susceptibility models using certainty factors and functional trees-based ensemble techniques. Appl. Sci. 2020, 10, 16. [Google Scholar]
  115. Raja, N.B.; Çiçek, I.; Türkoğlu, N.; Aydin, O.; Kawasaki, A. Correction to: Landslide susceptibility mapping of the sera river basin using logistic regression model. Nat. Hazards 2018, 91, 1423. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Study area and landslide inventory.
Figure 1. Study area and landslide inventory.
Applsci 10 00029 g001
Figure 2. Thematic maps of landslide conditioning factors.
Figure 2. Thematic maps of landslide conditioning factors.
Applsci 10 00029 g002aApplsci 10 00029 g002b
Figure 3. Landslide susceptibility maps: (a) FR model, (b) IoE model, (c) FR_SysFor model, (d) IoE_SysFor model, (e) FR_LR model, (f) IoE_LR model.
Figure 3. Landslide susceptibility maps: (a) FR model, (b) IoE model, (c) FR_SysFor model, (d) IoE_SysFor model, (e) FR_LR model, (f) IoE_LR model.
Applsci 10 00029 g003aApplsci 10 00029 g003b
Figure 4. Receiver operating characteristics (ROC) curves: (a) training data, (b) validating data.
Figure 4. Receiver operating characteristics (ROC) curves: (a) training data, (b) validating data.
Applsci 10 00029 g004
Figure 5. Model validation with the success rate (AUC_T) curve for the landslide susceptibility maps.
Figure 5. Model validation with the success rate (AUC_T) curve for the landslide susceptibility maps.
Applsci 10 00029 g005
Figure 6. Model validation with the prediction rate (AUC_P) curve for the landslide susceptibility maps.
Figure 6. Model validation with the prediction rate (AUC_P) curve for the landslide susceptibility maps.
Applsci 10 00029 g006
Table 1. Multicollinearity analysis.
Table 1. Multicollinearity analysis.
ParametersCollinearity Statistics
ToleranceVIFToleranceVIF
Elevation0.5431.8430.5071.970
Profile curvature0.8971.1150.8971.115
Plan curvature0.8241.2140.8011.248
Slope angle0.7641.3080.7591.317
Slope aspect0.9121.0960.9041.106
SPI0.5241.9070.4642.157
TWI0.8301.2050.8341.200
STI0.4792.0860.4262.346
Distance to roads0.8171.2240.8231.214
Distance to rivers0.9521.0500.9241.083
Distance to faults0.9261.0800.7311.368
Lithology0.7291.3720.7341.362
Rainfall0.7411.3490.8021.247
Soil0.8051.2420.6871.456
NDVI0.8471.1800.6391.566
Land use0.6401.5620.5071.970
Table 2. Relationship between landslides and conditioning factors using FR and IoE models.
Table 2. Relationship between landslides and conditioning factors using FR and IoE models.
FactorsClassPercentage of DomainPercentage of LandslidesFRWj
Elevation (m)442–60013.24219.8581.5000.223
600–80016.39934.0432.076
800–100012.09121.9861.818
1000–120010.45012.0571.154
1200–140012.75610.6380.834
1400–160012.3411.4180.115
1600–180012.0420.0000.000
1800–20007.9200.0000.000
2000–22002.5040.0000.000
2200–24100.2550.0000.000
Profile curvature−14.28 to −0.0545.75346.8091.0230.083
−0.05–0.055.69611.3481.992
0.05–14.7748.55141.8440.862
Plan curvature−14.0 to −0.0546.11041.1350.8920.014
−0.05–0.056.8939.2201.338
0.05–13.0746.99649.6451.056
Slope angle (°)0–1023.64125.5321.0800.146
10–2029.27044.6811.526
20–3026.29220.5670.782
30–4014.9877.8010.521
40–504.9751.4180.285
50–600.7800.0000.000
60–72.830.0550.0000.000
Slope aspectFlat0.0280.0000.0000.075
North14.21211.3480.798
North-east12.97611.3480.875
East12.04612.0571.001
South-east12.50516.3121.304
South11.98222.6951.894
South-west11.0449.2200.835
West11.3556.3830.562
North-west13.85310.6380.768
SPI<2055.73462.4111.1200.010
20–4015.93014.1840.890
40–607.4048.5111.149
60–804.2843.5460.828
>8016.64911.3480.682
TWI<417.8807.8010.4360.046
4–532.04935.4611.106
5–623.90334.0431.424
6–712.59914.1841.126
>713.5698.5110.627
STI<1055.15763.8301.1570.024
10–2022.49122.6951.009
20–309.4474.2550.450
30–404.5243.5460.784
>408.3815.6740.677
Distance to roads (m)<30011.01023.4042.1260.065
300–6008.95111.3481.268
600–9007.8525.6740.723
900–12007.03610.6381.512
>120065.15148.9360.751
Distance to rivers (m)<20016.68619.1491.1480.003
200–40014.83715.6031.052
400–60013.69512.7660.932
600–80012.00213.4751.123
>80042.78139.0070.912
Distance to faults (m)<100021.51022.6951.0550.022
1000–200016.10612.7660.793
2000–300013.19814.8941.129
3000–400011.1685.6740.508
>400038.01943.9721.157
LithologyGroup 111.62719.1491.6470.117
Group 20.0540.0000.000
Group 34.3831.4180.324
Group 412.7564.2550.334
Group 50.2390.0000.000
Group 63.4530.0000.000
Group 77.2313.5460.490
Group 810.2287.0920.693
Group 98.8466.3830.722
Group 101.2540.7090.566
Group 1113.94522.6951.627
Group 1225.98234.7521.338
Rainfall (mm/yr)<9006.0695.6740.9350.111
900–100018.64220.5671.103
1000–11009.02916.3121.807
1100–120010.54424.8232.354
1200–13008.68011.3481.307
1300–140020.15912.7660.633
1400–150011.2474.9650.441
1500–16008.6702.1280.245
1600–17004.3430.7090.163
>17002.6180.7090.271
SoilATc11.53814.1841.2290.326
CMd3.6472.8370.778
CMe7.99221.9862.751
FLc1.1860.0000.000
LVh70.80856.7380.801
LVx0.3220.0000.000
PLe0.4752.1284.479
RGc2.6300.7090.270
RGe1.4031.4181.011
NDVI−0.21 to 0.212.1612.8371.3130.128
0.21–0.366.6607.0921.065
0.36–0.4420.84544.6812.144
0.44–0.5234.88139.7161.139
0.52–0.6535.4545.6740.160
Land useFarmland28.82660.9932.1160.311
Forestland30.9742.8370.092
Forestland38.50036.1700.939
Water0.6030.0000.000
Residential areas1.0750.0000.000
Bare land0.0220.0000.000
Table 3. Coefficients of models.
Table 3. Coefficients of models.
ParametersCoefficient of FR_LR ModelCoefficient of IoE_LR Model
Elevation0.5961.978
Profile curvature0.005−0.626
Plan curvature1.39072.424
Slope angle0.1821.042
Slope aspect0.6339.670
SPI0.37066.354
TWI0.7291.570
STI0.3817.543
Distance to roads0.4365.933
Distance to rivers−0.722——
Distance to faults−0.060−4.887
Lithology−0.138−1.090
Rainfall0.6075.141
Soil−0.0300.008
NDVI0.2755.007
Land use0.3991.260
Constant−5.702−6.400
Table 4. Parameters of ROC curves using training data.
Table 4. Parameters of ROC curves using training data.
VariableAUCSE95% CI
FR_model0.7570.02850.702 to 0.806
IoE_model0.7460.02920.691 to 0.796
FR_SysFor_model0 9400.01320.906 to 0.965
IoE_SysFor_model0.9260.01510.889 to 0.954
FR_LR_model0.7790.02710.726 to 0.826
IoE_LR_model0.7830.02700.730 to 0.829
Table 5. Parameters of ROC curves using validating data.
Table 5. Parameters of ROC curves using validating data.
VariableAUCSE95% CI
FR_model0.6810.04980.590 to 0.762
IoE_model0.6910.04890.601 to 0.771
FR_SysFor_model0.8310.03880.753 to 0.893
IoE_SysFor_model0.8190.03990.739 to 0.883
FR_LR_model0.6960.04850.606 to 0.776
IoE_LR_model0.7020.04830.612 to 0.781

Share and Cite

MDPI and ACS Style

Chen, W.; Fan, L.; Li, C.; Pham, B.T. Spatial Prediction of Landslides Using Hybrid Integration of Artificial Intelligence Algorithms with Frequency Ratio and Index of Entropy in Nanzheng County, China. Appl. Sci. 2020, 10, 29. https://0-doi-org.brum.beds.ac.uk/10.3390/app10010029

AMA Style

Chen W, Fan L, Li C, Pham BT. Spatial Prediction of Landslides Using Hybrid Integration of Artificial Intelligence Algorithms with Frequency Ratio and Index of Entropy in Nanzheng County, China. Applied Sciences. 2020; 10(1):29. https://0-doi-org.brum.beds.ac.uk/10.3390/app10010029

Chicago/Turabian Style

Chen, Wei, Limin Fan, Cheng Li, and Binh Thai Pham. 2020. "Spatial Prediction of Landslides Using Hybrid Integration of Artificial Intelligence Algorithms with Frequency Ratio and Index of Entropy in Nanzheng County, China" Applied Sciences 10, no. 1: 29. https://0-doi-org.brum.beds.ac.uk/10.3390/app10010029

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop