Next Article in Journal
Robust Signaling for Bursty Interference
Next Article in Special Issue
Anomaly Detection in Paleoclimate Records Using Permutation Entropy
Previous Article in Journal
Short-Time Propagators and the Born–Jordan Quantization Rule
Previous Article in Special Issue
Optimization on theBuried Depth of Subsurface Drainage under Greenhouse Condition Based on Entropy Evaluation Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quantitative Assessment of Landslide Susceptibility Comparing Statistical Index, Index of Entropy, and Weights of Evidence in the Shangnan Area, China

1
Propaganda Department, Shaanxi Radio and TV University, Xi’an 710119, China
2
College of Geology & Environment, Xi’an University of Science and Technology, Xi’an 710054, China
*
Authors to whom correspondence should be addressed.
Entropy 2018, 20(11), 868; https://doi.org/10.3390/e20110868
Submission received: 12 October 2018 / Revised: 6 November 2018 / Accepted: 8 November 2018 / Published: 10 November 2018
(This article belongs to the Special Issue Applications of Information Theory in the Geosciences II)

Abstract

:
In this study, a comparative analysis of the statistical index (SI), index of entropy (IOE) and weights of evidence (WOE) models was introduced to landslide susceptibility mapping, and the performance of the three models was validated and systematically compared. As one of the most landslide-prone areas in Shaanxi Province, China, Shangnan County was selected as the study area. Firstly, a series of reports, remote sensing images and geological maps were collected, and field surveys were carried out to prepare a landslide inventory map. A total of 348 landslides were identified in study area, and they were reclassified as a training dataset (70% = 244 landslides) and testing dataset (30% = 104 landslides) by random selection. Thirteen conditioning factors were then employed. Corresponding thematic data layers and landslide susceptibility maps were generated based on ArcGIS software. Finally, the area under the curve (AUC) values were calculated for the training dataset and the testing dataset in order to validate and compare the performance of the three models. For the training dataset, the AUC plots showed that the WOE model had the highest accuracy rate of 76.05%, followed by the SI model (74.67%) and the IOE model (71.12%). In the case of the testing dataset, the prediction accuracy rates for the SI, IOE and WOE models were 73.75%, 63.89%, and 75.10%, respectively. It can be concluded that the WOE model had the best prediction capacity for landslide susceptibility mapping in Shangnan County. The landslide susceptibility map produced by the WOE model had a profound geological and engineering significance in terms of landslide hazard prevention and control in the study area and other similar areas.

1. Introduction

Landslides, as one of the most critical geological hazards in the world, seriously threaten lives, property and natural resources [1,2,3,4,5]. According to the latest statistics on geological disasters carried out by the Chinese Geological Environment Information Site, more than 270,000 geological hazards occurred from 2006 to 2016, causing a direct economic loss of $7.7 billion, and the proportion of loss caused by landslides has increased year by year (http://www.cigem.gov.cn). Hence, in order to reduce the damage caused by landslides, investigating landslide susceptibility maps has become an important task that needs to be addressed [3,6,7,8,9]. Previous studies of landslide susceptibility mapping found that the quality of the data, the depth of the research and the methods of analysis were the three most important factors with a primary effect on the accuracy and reliability of the assessment results [6,9,10,11].
Along with the application of global positioning systems (GPS), remote sensing (RS), and geographic information systems (GIS) to landslide susceptibility mapping, more and more researchers have begun to apply relevant theories to landslide susceptibility assessment. These methods can be categorized into heuristic, deterministic, and statistical approaches [12]. Heuristic approaches are completely based on expert opinions or approaches, which are intensively subjective [12,13]. Deterministic approaches need a large number of detailed input factors to build models, which require field-based geotechnical and groundwater data; thus, these approaches are often used to prepare maps of small areas [12,13,14]. Therefore, statistical models are most commonly used in landslide susceptibility mapping.
Statistical models can be further categorized into traditional statistical methods, advanced machine learning technologies, and hybrid integration approaches. Traditional statistical methods are widely used, such as frequency ratio [15,16], evidential belief function [17,18,19], statistical index [13,20], weights of evidence [21,22,23], index of entropy [24,25,26,27,28], and logistic regression [29,30]. In recent decades, machine learning technologies have continuously introduced new and powerful approaches, such as naïve Bayes [31,32], naïve Bayes tree [33,34,35], artificial neural networks [16,36,37], kernel logistic regression [34,38,39], support vector machine [40,41], alternating decision tree [34,39], random forest [12,26,42,43], and multivariate adaptive regression spline [44,45]. In the interest of improving the accuracy of the prediction, some ensemble models have been proposed, such as adaptive, neuro-fuzzy inference, system–genetic algorithms [46]; bagging-based decision tree [47,48,49]; bivariate statistical-based ensembles [19,50]; adaptive neuro-fuzzy inference, system-shuffled, frog-leaping algorithms [51]; and artificial neural network-maximum entropy [52].
Some review articles show that different models have different characteristics, and each of them has strengths and weaknesses [41,53]. In the current research, we address compare three statistical models, applying, analyzing and inspecting the statistical index (SI), index of entropy (IOE), and weights of evidence models (WOE) with regard to landslide susceptibility mapping, using the case study of Shangnan Country, China.

2. Study Area

Shaanxi Province is situated in middle of China. The study area (Shangnan County) is located in the southeastern part of Shaanxi Province, China, between the latitudes of 33°06′ and 33°44′ N, and the longitudes of 110°24′ and 111°01′ E (see Figure 1). It covers an area of about 2307 km2 and its altitude ranges from 189 to 2050 m above sea level. Shangnan County is located in the transitional section from the northern subtropical zone to the warm temperate zone, which is characterized by warm, abundant rainfall and four distinct seasons. A major part of the study area is covered by grassland (44.91%), followed by forestland (32.98%), farmland (21.58%), residential areas (0.41%), water bodies (0.09%), and bare land (0.02%).
Topographically, the slope gradients vary from 0° to 65°. Approximately 16.80% of the study area has a slope gradient less than 10°, whereas areas with a slope gradient larger than 50° account for 0.41% of the total study area. Areas with the slope gradient of 10–20°, 20–30°, 30–40°, and 40–50° account for 29.92%, 32.79%, 15.57%, and 4.51% of the total study area, respectively.
Geologically, the study area is located at the border of the Yangtze and the North China plates. The faults of the study area have mainly a Northwest-Southeast direction [12]. The outcropped strata in the area are mainly from the Archaic to the Ordovician, and the Devonian and Carboniferous are partially outcropped. Since the Quaternary, the Earth’s crust has risen strongly and differential block movement has occurred. Therefore, the river is mainly characterized by down-cutting erosion, forming a deep “V” shaped valley in Danjiang River. Geomorphologically, the northern region of the study area is the middle mountainous area, the southcentral region is the middle and low mountains, while the mid-east region is faulted basin. According to the landforms, the study area could be divided into mid-mountain zones, low-relief zones, and river valley zones, with altitudes larger than 1000 m, 500–1000 m, and less than 500 m, respectively.

3. Data

The amount, distribution and characteristics of existing landslides were the basis of the susceptibility assessment. A landslide inventory map of a study area is effective and is organized to demonstrate the basic information regarding existing landslides [44,54]. In this case, the historical data on landslides and related information—including the topographical, geological and meteorological conditions—were acquired using three approaches, namely the analysis of existing historical records, interpretation of satellite images and field surveys in Shangnan County, respectively. In total, 348 existing landslides were identified, of which most of the landslides in the study area are slides (326), the others include 12 rock falls and 10 debris flow [12,55]. According to an analyse in the GIS environment, the size of the largest landslide is more than 30,000 m2, the smallest landslide is nearly 15 m2, while the average is 9600 m2. In addition, the shape and scale of the landslides in Shangnan County were simplified as a centroid point to establish the susceptibility assessment models. Finally, 348 landslides were randomly divided into training data (70%) and testing data (30%) (Figure 1).
In this paper, a total of thirteen landslide conditioning factors were employed to establish a series of mathematical models; the conditioning factors included slope angle, slope aspect, elevation, plan curvature, profile curvature, stream power index (SPI), sediment transport index (STI), topographic wetness index (TWI), distance to faults, distance to rivers, distance to roads, normalized difference vegetation index (NDVI), and lithology.
Slope angle is related to the failure mode and scale of the landslide, and was used widely and frequently in landslide susceptibility assessment [31,56,57,58]. Thus, the values of the slope angles in Shangnan County were extracted from the digital elevation model (DEM) with a resolution of 30 m and divided into six categories with an interval of 10°, namely, <10°, 10–20°, 20–30°, 30–40°, 40–50°, and >50° (Figure 2a).
Slope aspect is another critical parameter used broadly in landslide susceptibility assessment. This factor can influence the meteorological conditions, such as rainfall, evaporation, temperature, etc. These meteorological conditions are generally connected to the stability of slopes [59,60]. Additionally, based on the DEM, the slope aspects in the study area were grouped into nine categories, as shown in Figure 2b.
The varieties of elevation reflect the changes in landforms between different geomorphic units. Therefore, elevation is also a relevant landslide conditioning factor used frequently in the establishment of landslide susceptibility assessment models [29,45,61]. In this study, the elevation values in Shangnan County were divided into six classes with an interval of 300 m, as follows: <500 m, 500–800 m, 800–1100 m, 1100–1400 m, 1400–1700 m, and >1700 m (Figure 2c).
Curvature, a technical term in topography, is the rate of change of the slope gradient or aspect in a particular direction [62]. Moreover, curvature can be further divided into plan curvature and profile curvature. The former is the curvature of a contour line formed by intersecting a horizontal plane with the surface, while the latter refers to the curvature in the vertical plane parallel to the slope direction [63,64]. For this reason, it was helpful to consider plan curvature and profile curvature in this study. By analyzing the DEM in the ArcGIS software (10.0, Esri, California, MA, USA), the plan curvature and profile curvature values in the study area were obtained and grouped into four classes based on the natural break method [25] (Figure 2d–e).
The stream power index (SPI) is a parameter measuring the stream power and erosion power of flowing water [65]. The scouring and infiltration of flowing water have a strong effect on the strength of the soil and rock that compose a slope. In the present study, the SPI values were arranged in four classes with an interval of 30, namely <30, 30–60, 60–90, and >90 (Figure 2f).
The sediment transport index (STI) is used to measure the erosive and transporting capacity of a stream [14]. In this study area, the STI values were divided into four categories with an interval of 10: <10, 10–20, 20–30, and >30 (Figure 2g).
The topographic wetness index (TWI) reflects the degree of accumulation of water at a site [66]. The TWI values in the study area were calculated and classified into four categories with an interval of 2 as follows: <5, 5–7, 7–9, and >9 (Figure 2h).
Generally speaking, faults can weaken the mechanical characteristics of the rock and soil of adjacent slopes [67]. Based on the ArcGIS software, buffers consisting of the Euclidean distance to faults were generated. Taking an equal interval of 1000 m, the values of the distance to faults are shown in Figure 2i, namely, <1000 m, 1000–2000 m, 2000–3000 m, 3000–4000 m, and >4000 m.
The seepage force generated by the discharge along slopes and rivers and the wetting effects of rivers have an adverse influence on the stability of slopes [68]. In this case, buffers consisting of the Euclidean distance to rivers were formed and are shown in Figure 2j. According to the equal interval classification method, there are five categories, namely <200 m, 200–400 m, 400–600 m, 600–800 m, and >800 m.
In Shangnan County, road building is one of the most major human engineering activities. Road construction frequently leads to the excavation of the toe of slopes, which may contribute to the occurrence of landslides [69]. In this case, the influence of roads was measured by the distance to roads, and the values were classified into five classes with an interval of 500 m: <500 m, 500–1000 m, 1000–1500 m, 1500–2000 m, and >2000 m, respectively (Figure 2k).
The normalized difference vegetation index (NDVI) is also universally applied in the process of landslide susceptibility assessment [25,70]. This parameter indicates the conditions of the vegetation coverage in the study area. By analyzing the near-infrared and the red band of Landsat 8 Operational Land Imager (OLI) images (http://www.gscloud.cn/), the NDVI values were calculated and classified into five classes based on the natural break method [34,71]: −0.25 to 0.17, 0.17–0.33, 0.33–0.42, 0.42–0.51, and 0.51–0.69 (Figure 2l).
Lithology is one of the most fundamental factors that determines the physical and mechanical properties of rock and soil [72,73]. Based on the field surveys and geological mapping, the lithological map of Shangnan County was digitized using the ArcGIS software. As is shown in Figure 2m, the lithological units in study area were grouped into nine categories based on the geological ages and lithofacies.

4. Modeling Approaches

4.1. Statistical Index (SI)

The statistical index model was first proposed by van Westen et al. [74]. In the SI model, a weight value for a parameter class can be defined as the natural logarithm of the landslide density in the class, divided by the landslide density in the whole study area [75,76]:
W i j = ln ( D i j D )  
where W i j is the weight for the class i of factor j , D i j is the landslide density within class i of the factor j , and D is the landslide density in the whole study area.

4.2. Index of Entropy (IOE)

The index of entropy is the second model used in this study. The entropy indicates the extent of the disorder of a system [77]. The equations used to calculate the information coefficient W j are expressed as below:
W j = I j × P j  
I j = H j max H j H j max , I = ( 0 , 1 ) , j = 1 , 2 , , n  
H j = i = 1 S j ( P i j ) log 2 ( P i j ) , j = 1 , 2 , , n  
H j max = log 2 S j  
( P i j ) = P i j j = 1 S j P i j  
P i j = m n  
where W j is the resultant weight value for the factors as a whole, P j is the slope failure probability for j = 1 , 2 , , n , I j is the information coefficient, Hj and Hjmax are the entropy values, Sj is the number of classes, and m and n are the landslide and domain percentages, respectively.

4.3. Weights of Evidence (WOE)

The WOE method is a probabilistic approach based on a log linear form of Bayes’ rule, expressed as:
P ( A | B ) = P ( B | A ) × P ( A ) P ( B )  
where A is the presence or absence of the landslide in the study area, and B is the landslide predictive factor. The approach calculates the weight for each B based on A, as follows [78,79]:
W i + = ln ( p ( B | A ) P ( B | A ¯ ) )  
W i = ln ( p ( B ¯ | A ) P ( B ¯ | A ¯ ) )  
where W i + is an indicator of the positive correlation, W i shows the level of negative correlation, B is the presence of a desired class of landslide conditioning factor, and B ¯ is the absence of desired class of landslide conditioning factor. A is the presence and A ¯ is the absence of the landslide. The difference between the two weights is called the weight contrast: C = W i + W i . The contrast reflects the overall spatial correlation between the desired class of landslide conditioning factor and the landslides.

4.4. Selection of Landslide Conditioning Factors

In landslide susceptibility modelling, landslides usually occur under different conditions, and the contribution of the conditioning factors to landslide occurrence is quite different [48]. Therefore, the removal of unimportant landslide conditioning factors to improve the performance of landslide models is necessary [80,81]. In this study, the SI, IOE, and WOE models were employed to construct the landslide susceptibility maps. Nevertheless, one of the most critical assumed conditions of these models is the independence assumption among the conditioning factors [38]. Therefore, in the present study, the coefficient of variation (CV) attribute evaluation (CVA) method was used to validate all thirteen landslide conditioning factors considered for the development of landslide susceptibility models. This method evaluates the worth of an attribute by computing the value of the coefficient of variation with respect to the class. It first creates a ranking of attributes based on the variation value, then divide this into two groups, using a verification method to select the best group [82].

5. Results and Discussion

5.1. Selection of Landslide Conditioning Factors

In the present study, based on the CVA method (a 10-fold cross-validation method [83,84], seed = 1), the importance of all the conditioning factors was measured according to average merit (AM), and the calculation results are illustrated in Table 1. The results show that all the AM values of the conditioning factors were larger than zero, indicating that the thirteen selected factors have positive influence on landslide occurrence. Of these factors, the highest AM value was for distance to roads (AM = 0.304), followed by elevation (AM = 0.296), distance to rivers (AM = 0.260), lithology (AM = 0.156), distance to faults (AM = 0.155), TWI (AM = 0.083), slope angle (AM = 0.069), NDVI (AM = 0.060), plan curvature (AM = 0.056), STI (AM = 0.035), slope aspect (AM = 0.032), STI (AM = 0.032), and profile curvature (AM = 0.025). Therefore, all thirteen landslide conditioning factors were selected for landslide susceptibility modeling in the present study.

5.2. Application of the SI Model

In this case, the SI model was applied to analyze the relationships between each conditioning factor and landslide occurrence (Table 2). From Table 2, it can be seen that for the slope angle 0–20°, the SI values were positive, which indicates that landslides were more prone to occurring in these areas. This is also in line with some other landslide susceptibility studies [81,85,86,87]. With regard to slope aspect, an eastern aspect had the highest SI value of 0.3024, while the lowest SI value was for southeast (–0.4019). In addition, no landslides occurred in flat areas (SI = 0), which conforms to actual situations and related research results [88,89]. When the altitude was lower than 800 m, there was a larger probability of landslides being triggered; all the landslides were not situated in areas with an altitude greater than 1400 m. In terms of curvature, the classes with a plan curvature of –1.09 to –0.11 (0.0124) and –0.11 to 0.88 (0.0264) had positive SI values, while the SI values were positive for classes with a profile curvature of –0.02 to 1.26 (0.0450) and 1.26 to 11.43 (0.0851). In the case of the SPI, compared with the other classes, the class of 0 to 30 (0.0801) had a more positive effect on landslide occurrence. In the case of STI, the class of >30 had the only negative SI value (–0.2712). In the case of the TWI, the intervals of 5–7 (0.0952) and 7–9 (0.1415) could be interpreted as promoting conditions. With regard to the distance to faults, the probability of landslide occurrence decreased with the increasing distance to faults, and the highest SI value of 0.2902 was for the class of 0–1000 m. For the distance to rivers, the only positive SI value of 0.2964 belonged to the class <200 m. For the distance to roads, landslides mainly spread in areas of where the distance to roads was within 500 m. Both the highest NDVI and the lowest NDVI had a positive impact on landslide occurrence. In the case of lithology, the SI values of the harder metamorphic rocks, softer metamorphic rocks, hard carbonate rocks, hard intrusive rocks and soft gravelly soils were –0.5121, 0.6650, –0.4742, –0.7160, and 0.3584, respectively.
Finally, the landslide susceptibility indexes (LSI) were calculated using the SI values and Equation (11). The corresponding landslide susceptibility map (LSM) (Figure 3) was generated using ArcGIS software. It is clear that the probability of landslide occurrence rises with the enlargement of the LSI. In the present study, the natural break method, which seeks to reduce the variance within classes and maximize the variance between classes [90], was used to the reclassify the LSI values into five categories, namely very low, low, moderate, high and very high.
LSI SI = Slope   angle SI + Slope   aspect SI + Elevation SI + Plan   curvature SI + Profile   curvature SI + SPI SI + STI SI + TWI SI + Distance   to   faults SI + Distance   to   rivers SI + Distance   to   roads SI + NDVI SI + Lithology SI

5.3. Application of the IOE Model

From Table 2, we acquired the Wj values of various conditioning factors. Wj is an index to measure the importance of factors. Thus, it can be seen that the most critical factor was altitude (Wj = 0.1923), followed by distance to faults (Wj = 0.1068), lithology (Wj = 0.0954), slope aspect (Wj = 0.0560), distance to roads (Wj = 0.0546), distance to rivers (Wj = 0.0511), slope angle (Wj = 0.0202), NDVI (Wj = 0.0145), TWI (Wj = 0.0090), STI (Wj = 0.0063), profile curvature (Wj = 0.0053), SPI (Wj = 0.0022), and plan curvature (Wj = 0.0006). It should be explained that the above ranking only applies to Shangnan County. The relative importance of conditioning factors usually varies for different study areas [91]. To produce a landslide susceptibility map using the LSI, the landslide occurrence probability values were calculated using Equation (12). Similarly, the produced landslide susceptibility map was further classified into five classes based on the natural break method, including very low, low, moderate, high, and very high (Figure 4).
LSI IOE = Slope   angle × 0.0202 + Slope   aspect × 0.0560 + Elevation × 0.1923 + Plan   curvature × 0.0006 + Profile   curvature × 0.0053 + SPI × 0.0022 + STI × 0.0063 + TWI × 0.0090   + Distance   to   roads × 0.0546 + NDVI × 0.0145 + Lithology × 0.0954

5.4. Application of the WOE Model

In Table 2, the weight contrast values are noted as C, which indicate the landslide susceptibility of various classes of conditioning factors. In terms of the slope angle, landslides are more likely to occur in areas with a slope angle of 0–10° (0.273) and 20–30° (0.029). For slope aspect, east (0.244) had the highest probability of triggering landslides, which is in line with the conclusion of the SI model. For altitude, most landslides are more prone to occurring at altitudes <800 m. For curvature, the results showed that the highest contrast value (0.130) was for plan curvatures between –0.11 and 0.88, while profile curvatures from –0.02 to 1.26 (0.108) were most prone to landslides. For the SPI, the WOE results were the same as the SI results, and the class 0–30 had the highest contrast value of 0.264. For the STI, class of 10–20 had the only positive value (0.177), which indicates that areas with STI values of 10–20 had a positive effect on landslide occurrence. For TWI, the highest contrast value (0.235) was found for class <5. In the case of distance to faults, distance to rivers and distance to roads, the highest contrast values belonged to the class <1000 m for distance to faults, the class <200 m for distance to rivers, and the class <500 m for distance to roads. For the NDVI, it was found that the range 0.17–0.33 was the only class for which the contrast value was larger than zero. In the case of lithology, hard carbonate rocks and hard intrusive rocks were identified as promoting landslides, this result did not coincide with the results of the SI model.
Finally, based on the results of the WOE model, the LSI values for the study area were calculated using Equation (13). The natural break method was introduced to reclassify landslide susceptibility into five classes: very low, low, moderate, high, and very high (Figure 5):
LSI WOE = Slope   angle C / S ( C ) + Slope   aspec C / S ( C ) + Elevation C / S ( C ) + Plan   curvature C / S ( C ) + Profile   curvature C / S ( C ) + SPI C / S ( C ) + STI C / S ( C ) + TWI C / S ( C ) + Distance   to   faults C / S ( C ) + Distance   to   rivers C / S ( C ) + Distance   to   roads C / S ( C )   + NDVI C / S ( C ) + Lithology C / S ( C )

5.5. Validation and Comparison of the Models

It is absolutely necessary to quantitatively measure the accuracy of the landslide susceptibility maps produced by the various classification models [92]. To assess the performance of the three landslide susceptibility mapping models described above, the corresponding area under the curve (AUC) curves for the training dataset and testing dataset were obtained. The receiver operating characteristics (ROC) curve and the AUC are two common indices used in the validation and comparison of different landslide susceptibility models [33,34,51,80,93]. In the present study, the AUC method, which was plotted using the cumulative area percentages as the horizontal axis and the cumulative percentage of landslides as the longitudinal axis [17,19,94], was used to compare the performance of the three models. Generally, the model with the highest AUC value was considered to show the best landslide susceptibility mapping performance.
In the case of the training dataset, the AUC values for the SI, IOE, and WOE models were 0.7467, 0.7112, and 0.7650, respectively, and the corresponding accuracy rates were 74.67%, 71.12%, and 76.50% (Figure 6a). It was clear that the landslide susceptibility map generated with the WOE model was more in line with actual situations. The performance of the SI model was second only to the WOE model. Compared with the other models, the accuracy of the IoE model was relatively low.
In the case of the testing dataset, the prediction accuracy values for the SI, IOE and WOE models were 73.75%, 63.89% and 75.10%, respectively (Figure 6b). The results showed that WOE had the best prediction capacity, followed by the SI model and the IOE model. In addition, the AUC values of the testing dataset were lower than those of the training dataset. When using the IOE model, the AUC value calculated with the testing dataset decreased by 0.0723 compared to the results found using the training dataset. Therefore, it could be concluded that the landslide susceptibility maps produced by the SI and WOE models both had good spatial effectiveness for the study area, and that the IoE model was not very suitable for landslide susceptibility mapping in Shangnan County.

6. Conclusions

In recent years, understanding of the serious effects of landslides on people’s life and property has increased. Thus, it is necessary to promote landslide susceptibility assessment in landslide hazard zones. Classical probability models and novel machine learning algorithms should be introduced to landslide susceptibility modeling with the aim of acquiring better prediction accuracy.
In this paper, the SI, IOE, and WOE models were employed to assess landslide susceptibility in Shangnan County, and the performance of the three models was compared. According to their relevance and suitability, thirteen conditioning factors were selected for modeling. The landslide data were then classified into two groups, namely a training dataset (70% of the landslides) and a testing dataset (30% of the landslides). The importance of the conditioning factors was also evaluated using the CVA method with AM values. The results showed that all the thirteen conditioning factors had a positive effect on landslide occurrence. The AUC plots generated with training dataset demonstrated that the WOE model (AUC = 0.7605) had the highest accuracy of landslide susceptibility mapping, followed by the SI model (AUC = 0.7467) and IOE model (AUC = 0.7112). Similarly, the prediction capacity of the three models was measured using AUC plots generated from the testing dataset. The results indicated that the WOE model had the best performance in landslide susceptibility prediction.
The landslide susceptibility map produced by the WOE model can be meaningful for landslide hazard prevention and control in Shangnan County and other mountainous areas with similar features. The landslide susceptibility maps can also be used as a basis for future landslide risk assessment studies of the study area and other areas with similar geo-environmental characteristics. The model can also be applied in other areas to expand its use.

Author Contributions

J.L. and Z.D. collected the field data. J.L. and Z.D. wrote the manuscript. All the authors discussed the results and edited the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant No. 41702298); the Natural Science Basic Research Plan in Shaanxi Province of China (program No. 2017JQ4020), and the Scientific Research Program funded by the Shaanxi Provincial Education Department (program No. 17JK0515).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ambrosi, C.; Strozzi, T.; Scapozza, C.; Wegmüller, U. Landslide hazard assessment in the Himalayas (Nepal and Bhutan) based on Earth-Observation data. Eng. Geol. 2018, 237, 217–228. [Google Scholar] [CrossRef]
  2. Palmisano, F.; Vitone, C.; Cotecchia, F. Methodology for landslide damage assessment. Procedia Eng. 2016, 161, 511–515. [Google Scholar] [CrossRef]
  3. Zhou, C.; Yin, K.; Cao, Y.; Ahmed, B.; Li, Y.; Catani, F.; Pourghasemi, H.R. Landslide susceptibility modeling applying machine learning methods: A case study from Longju in the Three Gorges Reservoir area, China. Comput. Geosci. 2018, 112, 23–37. [Google Scholar] [CrossRef]
  4. Samodra, G.; Chen, G.; Sartohadi, J.; Kasama, K. Generating landslide inventory by participatory mapping: An example in Purwosari Area, Yogyakarta, Java. Geomorphology 2018, 306, 306–313. [Google Scholar] [CrossRef]
  5. Zhuang, J.; Peng, J.; Wang, G.; Javed, I.; Wang, Y.; Li, W. Distribution and characteristics of landslide in loess plateau: A case study in Shaanxi Province. Eng. Geol. 2018, 236, 89–96. [Google Scholar] [CrossRef]
  6. Chen, W.; Peng, J.; Hong, H.; Shahabi, H.; Pradhan, B.; Liu, J.; Zhu, A.-X.; Pei, X.; Duan, Z. Landslide susceptibility modelling using GIS-based machine learning techniques for Chongren county, Jiangxi Province, China. Sci. Total Environ. 2018, 626, 1121–1135. [Google Scholar] [CrossRef] [PubMed]
  7. Ciampalini, A.; Raspini, F.; Lagomarsino, D.; Catani, F.; Casagli, N. Landslide susceptibility map refinement using PSInSAR data. Remote Sens. Environ. 2016, 184, 302–315. [Google Scholar] [CrossRef] [Green Version]
  8. Huang, F.; Yin, K.; Huang, J.; Gui, L.; Wang, P. Landslide susceptibility mapping based on self-organizing-map network and extreme learning machine. Eng. Geol. 2017, 223, 11–22. [Google Scholar] [CrossRef]
  9. Lin, G.F.; Chang, M.J.; Huang, Y.C.; Ho, J.Y. Assessment of susceptibility to rainfall-induced landslides using improved self-organizing linear output map, support vector machine, and logistic regression. Eng. Geol. 2017, 224, 62–74. [Google Scholar] [CrossRef]
  10. Zêzere, J.L.; Pereira, S.; Melo, R.; Oliveira, S.C.; Garcia, R.A. Mapping landslide susceptibility using data-driven methods. Sci. Total Environ. 2017, 589, 250–267. [Google Scholar] [CrossRef] [PubMed]
  11. Goetz, J.N.; Brenning, A.; Petschko, H.; Leopold, P. Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput. Geosci. 2015, 81, 1–11. [Google Scholar] [CrossRef]
  12. Chen, W.; Pourghasemi, H.R.; Naghibi, S.A. Prioritization of landslide conditioning factors and its spatial modeling in Shangnan County, China using GIS-based data mining algorithms. Bull. Eng. Geol. Environ. 2018, 77, 611–629. [Google Scholar] [CrossRef]
  13. Mandal, S.; Mandal, K. Bivariate statistical index for landslide susceptibility mapping in the Rorachu river basin of eastern Sikkim Himalaya, India. Spat. Inf. Res. 2018, 26, 59–75. [Google Scholar] [CrossRef]
  14. Regmi, A.D.; Yoshida, K.; Pourghasemi, H.R.; DhitaL, M.R.; Pradhan, B. Landslide susceptibility mapping along Bhalubang–Shiwapur area of mid-Western Nepal using frequency ratio and conditional probability models. J. Mt. Sci. 2014, 11, 1266–1285. [Google Scholar] [CrossRef]
  15. Nicu, I.C. Application of analytic hierarchy process, frequency ratio, and statistical index to landslide susceptibility: An approach to endangered cultural heritage. Environ. Earth Sci. 2018, 77, 79. [Google Scholar] [CrossRef]
  16. Aditian, A.; Kubota, T.; Shinohara, Y. Comparison of GIS-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of Ambon, Indonesia. Geomorphology 2018, 318, 101–111. [Google Scholar] [CrossRef]
  17. Althuwaynee, O.F.; Pradhan, B.; Park, H.-J.; Lee, J.H. A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping. Catena 2014, 114, 21–36. [Google Scholar] [CrossRef]
  18. Hong, H.; Kornejady, A.; Soltani, A.; Termeh, S.V.R.; Liu, J.; Zhu, A.X.; Hesar, A.Y.; Ahmad, B.B.; Wang, Y.C. Landslide susceptibility assessment in the Anfu County, China: comparing different statistical and probabilistic models considering the new topo-hydrological factor (HAND). Earth Sci. Inf. 2018, 1–18. [Google Scholar] [CrossRef]
  19. Chen, W.; Shahabi, H.; Shirzadi, A.; Li, T.; Guo, C.; Hong, H.; Li, W.; Pan, D.; Hui, J.; Ma, M.; et al. A novel ensemble approach of bivariate statistical based logistic model tree classifier for landslide susceptibility assessment. Geocarto Int. 2018, 1–32. [Google Scholar] [CrossRef]
  20. Razavizadeh, S.; Solaimani, K.; Massironi, M.; Kavian, A. Mapping landslide susceptibility with frequency ratio, statistical index, and weights of evidence models: A case study in Northern Iran. Environ. Earth Sci. 2017, 76, 499. [Google Scholar] [CrossRef]
  21. Ding, Q.; Chen, W.; Hong, H. Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping. Geocarto Int. 2017, 32, 619–639. [Google Scholar] [CrossRef]
  22. Wang, L.-J.; Guo, M.; Sawada, K.; Lin, J.; Zhang, J. A comparative study of landslide susceptibility maps using logistic regression, frequency ratio, decision tree, weights of evidence and artificial neural network. Geosci. J. 2016, 20, 117–136. [Google Scholar] [CrossRef]
  23. Tahmassebipoor, N.; Rahmati, O.; Noormohamadi, F.; Lee, S. Spatial analysis of groundwater potential using weights-of-evidence and evidential belief function models and remote sensing. Arab. J. Geosci. 2016, 9, 1–18. [Google Scholar] [CrossRef]
  24. Bui, T.D.; Shahabi, H.; Shirzadi, A.; Chapi, K.; Alizadeh, M.; Chen, W.; Mohammadi, A.; Ahmad, B.; Panahi, M.; Hong, H.; et al. Landslide detection and susceptibility mapping by AIRSAR data using support vector machine and index of entropy models in Cameron Highlands, Malaysia. Remote Sens. 2018, 10, 1527. [Google Scholar]
  25. Chen, W.; Pourghasemi, H.R.; Naghibi, S.A. A comparative study of landslide susceptibility maps produced using support vector machine with different kernel functions and entropy data mining models in China. Bull. Eng. Geol. Environ. 2018, 77, 647–664. [Google Scholar] [CrossRef]
  26. Tsangaratos, P.; Ilia, I.; Hong, H.; Chen, W.; Xu, C. Applying information theory and GIS-based quantitative methods to produce landslide susceptibility maps in Nancheng County, China. Landslides 2017, 14, 1091–1111. [Google Scholar] [CrossRef]
  27. Youssef, A.M.; Al-Kathery, M.; Pradhan, B. Landslide susceptibility mapping at Al-Hasher area, Jizan (Saudi Arabia) using GIS-based frequency ratio and index of entropy models. Geosci. J. 2015, 19, 113–134. [Google Scholar] [CrossRef]
  28. Devkota, K.C.; Regmi, A.D.; Pourghasemi, H.R.; Yoshida, K.; Pradhan, B.; Ryu, I.C.; Dhital, M.R.; Althuwaynee, O.F. Landslide susceptibility mapping using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling–Narayanghat road section in Nepal Himalaya. Nat. Hazards 2013, 65, 135–165. [Google Scholar] [CrossRef] [Green Version]
  29. Trigila, A.; Iadanza, C.; Esposito, C.; Scarascia-Mugnozza, G. Comparison of logistic regression and random forests techniques for shallow landslide susceptibility assessment in Giampilieri (Ne Sicily, Italy). Geomorphology 2015, 249, 119–136. [Google Scholar] [CrossRef]
  30. Tsangaratos, P.; Ilia, I. Comparison of a logistic regression and naïve bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. Catena 2016, 145, 164–179. [Google Scholar] [CrossRef]
  31. Chen, W.; Yan, X.; Zhao, Z.; Hong, H.; Bui, D.T.; Pradhan, B. Spatial prediction of landslide susceptibility using data mining-based kernel logistic regression, naive Bayes and RBFNetwork models for the Long County area (China). Bull. Eng. Geol. Environ. 2018, 1–20. [Google Scholar] [CrossRef]
  32. Pham, B.T.; Bui, T.D.; Pourghasemi, H.R.; Indra, P.; Dholakia, M. Landslide susceptibility assesssment in the Uttarakhand area (India) using GIS: A comparison study of prediction capability of naïve Bayes, multilayer perceptron neural networks, and functional trees methods. Theor. Appl. Climatol. 2017, 128, 255–273. [Google Scholar] [CrossRef]
  33. Chen, W.; Zhang, S.; Li, R.; Shahabi, H. Performance evaluation of the GIS-based data mining techniques of best-first decision tree, random forest, and naïve Bayes tree for landslide susceptibility modeling. Sci. Total Environ. 2018, 644, 1006–1018. [Google Scholar] [CrossRef]
  34. Chen, W.; Xie, X.; Peng, J.; Wang, J.; Duan, Z.; Hong, H. GIS-based landslide susceptibility modelling: A comparative assessment of kernel logistic regression, naïve-Bayes tree, and alternating decision tree models. Geomat. Nat. Hazards Risk 2017, 8, 950–973. [Google Scholar] [CrossRef]
  35. Pham, B.T.; Bui, T.D.; Prakash, I.; Dholakia, M.B. Evaluation of predictive ability of support vector machines and naive Bayes trees methods for spatial prediction of landslides in Uttarakhand state (India) using GIS. J. Geomat. 2016, 10, 71–79. [Google Scholar]
  36. Polykretis, C.; Chalkias, C. Comparison and evaluation of landslide susceptibility maps obtained from weight of evidence, logistic regression, and artificial neural network models. Nat. Hazards 2018, 36, 1–26. [Google Scholar] [CrossRef]
  37. Zhu, A.X.; Miao, Y.; Wang, R.; Zhu, T.; Deng, Y.; Liu, J.; Yang, L.; Qin, C.Z.; Hong, H. A comparative study of an expert knowledge-based model and two data-driven models for landslide susceptibility mapping. Catena 2018, 166, 317–327. [Google Scholar] [CrossRef]
  38. Bui, T.D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar]
  39. Hong, H.; Pradhan, B.; Xu, C.; Bui, T.D. Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines. Catena 2015, 133, 266–281. [Google Scholar] [CrossRef]
  40. Yao, X.; Tham, L.G.; Dai, F.C. Landslide susceptibility mapping based on support vector machine: A case study on natural slopes of Hong Kong, China. Geomorphology 2008, 101, 572–582. [Google Scholar] [CrossRef]
  41. Huang, Y.; Zhao, L. Review on landslide susceptibility mapping using support vector machines. Catena 2018, 165, 520–529. [Google Scholar] [CrossRef]
  42. Zhang, K.; Wu, X.; Niu, R.; Yang, K.; Zhao, L. The assessment of landslide susceptibility mapping using random forest and decision tree methods in the Three Gorges Reservoir area, China. Environ. Earth Sci. 2017, 76, 405. [Google Scholar] [CrossRef]
  43. Catani, F.; Lagomarsino, D.; Segoni, S.; Tofani, V. Landslide susceptibility estimation by random forests technique: Sensitivity and scaling issues. Nat. Hazards Earth Syst. Sci. 2013, 13, 2815–2831. [Google Scholar] [CrossRef]
  44. Pourghasemi, H.R.; Rossi, M. Landslide susceptibility modeling in a landslide prone area in Mazandarn Province, north of Iran: A comparison between GLM, GAM, MARS, and M-AHP methods. Theor. Appl. Climatol. 2017, 130, 609–633. [Google Scholar] [CrossRef]
  45. Conoscenti, C.; Ciaccio, M.; Caraballo-Arias, N.A.; Gómez-Gutiérrez, Á.; Rotigliano, E.; Agnesi, V. Assessment of susceptibility to earth-flow landslide using logistic regression and multivariate adaptive regression splines: A case of the Belice River basin (western Sicily, Italy). Geomorphology 2015, 242, 49–64. [Google Scholar] [CrossRef]
  46. Chen, W.; Panahi, M.; Pourghasemi, H.R. Performance evaluation of GIS-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (ANFIS) with genetic algorithm (GA), differential evolution (DE), and particle swarm optimization (PSO) for landslide spatial modelling. Catena 2017, 157, 310–324. [Google Scholar] [CrossRef]
  47. Pham, B.T.; Bui, T.D.; Prakash, I. Landslide susceptibility assessment using bagging ensemble based alternating decision trees, logistic regression and j48 decision trees methods: A comparative study. Geotech. Geol. Eng. 2017, 35, 2597–2611. [Google Scholar] [CrossRef]
  48. Hong, H.; Liu, J.; Bui, D.T.; Pradhan, B.; Acharya, T.D.; Pham, B.T.; Zhu, A.X.; Chen, W.; Ahmad, B.B. Landslide susceptibility mapping using j48 decision tree with adaboost, bagging and rotation forest ensembles in the Guangchang area (China). Catena 2018, 163, 399–413. [Google Scholar] [CrossRef]
  49. Bui, T.D.; Ho, T.-C.; Pradhan, B.; Pham, B.-T.; Nhu, V.-H.; Revhaug, I. GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with adaboost, bagging, and multiboost ensemble frameworks. Environ. Earth Sci. 2016, 75, 1–22. [Google Scholar]
  50. Aghdam, N.I.; Varzandeh, M.H.M.; Pradhan, B. Landslide susceptibility mapping using an ensemble statistical index (Wi) and adaptive neuro-fuzzy inference system (ANFIS) model at Alborz Mountains (Iran). Environ. Earth Sci. 2016, 75, 1–20. [Google Scholar] [CrossRef]
  51. Chen, W.; Panahi, M.; Tsangaratos, P.; Shahabi, H.; Ilia, I.; Panahi, S.; Li, S.; Jaafari, A.; Ahmad, B.B. Applying population-based evolutionary algorithms and a neuro-fuzzy system for modeling landslide susceptibility. Catena 2019, 172, 212–231. [Google Scholar] [CrossRef]
  52. Kornejady, A.; Ownegh, M.; Bahremand, A. Landslide susceptibility assessment using maximum entropy model with two different data sampling methods. Catena 2017, 152, 144–162. [Google Scholar] [CrossRef]
  53. Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
  54. Guzzetti, F.; Mondini, A.C.; Cardinali, M.; Fiorucci, F.; Santangelo, M.; Chang, K.-T. Landslide inventory maps: New tools for an old problem. Earth Sci. Rev. 2012, 112, 42–66. [Google Scholar] [CrossRef]
  55. Hungr, O.; Leroueil, S.; Picarelli, L. The varnes classification of landslide types, an update. Landslides 2014, 11, 167–194. [Google Scholar] [CrossRef]
  56. Hong, H.; Ilia, I.; Tsangaratos, P.; Chen, W.; Xu, C. A hybrid fuzzy weight of evidence method in landslide susceptibility analysis on the Wuyuan area, China. Geomorphology 2017, 290, 1–16. [Google Scholar] [CrossRef]
  57. Pham, B.T.; Pradhan, B.; Bui, T.D.; Prakash, I.; Dholakia, M.B. A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India). Environ. Model. Softw. 2016, 84, 240–250. [Google Scholar] [CrossRef]
  58. Ramakrishnan, D.; Singh, T.N.; Verma, A.K.; Gulati, A.; Tiwari, K.C. Soft computing and GIS for landslide susceptibility assessment in Tawaghat area, Kumaon Himalaya, India. Nat. Hazards 2013, 65, 315–330. [Google Scholar] [CrossRef]
  59. Caiyan, W.; Jianping, Q.; Meng, W. Landslides and slope aspect in the Three Gorges Reservoir area based on GIS and information value model. Wuhan Univ. J. Nat. Sci. 2006, 11, 773–779. [Google Scholar] [CrossRef]
  60. Paranunzio, R.; Laio, F.; Nigrelli, G.; Chiarle, M. A method to reveal climatic variables triggering slope failures at high elevation. Nat. Hazards 2015, 76, 1039–1061. [Google Scholar] [CrossRef]
  61. Chen, W.; Shahabi, H.; Shirzadi, A.; Hong, H.Y.; Akgun, A.; Tian, Y.Y.; Liu, J.Z.; Zhu, A.X.; Li, S.J. Novel hybrid artificial intelligence approach of bivariate statistical-methods-based kernel logistic regression classifier for landslide susceptibility modeling. Bull. Eng. Geol. Environ. 2018, 1–23. [Google Scholar] [CrossRef]
  62. Pourghasemi, H.R.; Pradhan, B.; Gokceoglu, C. Application of fuzzy logic and analytical hierarchy process (AHP) to landslide susceptibility mapping at Haraz Watershed, Iran. Nat. Hazards 2012, 63, 965–996. [Google Scholar] [CrossRef]
  63. Meten, M.; Prakashbhandary, N.; Yatabe, R. Effect of landslide factor combinations on the prediction accuracy of landslide susceptibility maps in the Blue Nile Gorge of Central Ethiopia. Geoenviron. Disasters 2015, 2, 9. [Google Scholar] [CrossRef]
  64. Ohlmacher, G.C. Plan curvature and landslide probability in regions dominated by earth flows and earth slides. Eng. Geol. 2007, 91, 117–134. [Google Scholar] [CrossRef]
  65. Chen, C.-Y.; Chang, J.-M. Landslide dam formation susceptibility analysis based on geomorphic features. Landslides 2016, 13, 1019–1033. [Google Scholar] [CrossRef]
  66. Kumar, R.; Anbalagan, R. Landslide susceptibility mapping using analytical hierarchy process (AHP) in Tehri reservoir rim region, Uttarakhand. J. Geol. Soc. India 2016, 87, 271–286. [Google Scholar] [CrossRef]
  67. Khorsandi, A.; Ghoreishi, S.H. Studying the interaction between active faults and landslide phenomenon: Case study of landslide in Latian, northeast Tehran, Iran. Geotech. Geol. Eng. 2013, 31, 617–625. [Google Scholar] [CrossRef]
  68. Li, X.-G.; Wang, A.-M.; Wang, Z.-M. Stability analysis and monitoring study of Jijia river landslide based on WeBGIS. J. Coal Sci. Eng. China 2010, 16, 41–46. [Google Scholar] [CrossRef]
  69. Ayalew, L.; Yamagishi, H. The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan. Geomorphology 2005, 65, 15–31. [Google Scholar] [CrossRef]
  70. Gheshlaghi, H.A.; Feizizadeh, B. An integrated approach of analytical network process and fuzzy based spatial decision making systems applied to landslide risk mapping. J. Afr. Earth Sci. 2017, 133, 15–24. [Google Scholar] [CrossRef]
  71. Chen, W.; Xie, X.; Wang, J.; Pradhan, B.; Hong, H.; Bui, T.D.; Duan, Z.; Ma, J. A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility. Catena 2017, 151, 147–160. [Google Scholar] [CrossRef]
  72. Dai, F.C.; Lee, C.F.; Li, J.; Xu, Z.W. Assessment of landslide susceptibility on the natural terrain of Lantau Island, Hong Kong. Environ. Geol. 2001, 40, 381–391. [Google Scholar]
  73. Koukis, G.; Pyrgiotis, L.; Kouki, A. Landslide phenomena in greece: Types of movement related to the lithology and structure of the geological formations. In Engineering Geology for Society and Territory: Landslide Processes; Lollino, G., Giordan, D., Crosta, G.B., Corominas, J., Azzam, R., Wasowski, J., Sciarra, N., Eds.; Springer International Publishing: Cham, Switzerland, 2015; Volume 2, pp. 1023–1027. [Google Scholar]
  74. Westen, C.J.V.; Rengers, N.; Terlien, M.T.J.; Soeters, R. Prediction of the occurrence of slope instability phenomenal through GIS-based hazard zonation. Geol. Rundsch. 1997, 86, 404–414. [Google Scholar] [CrossRef]
  75. Kayastha, P.; Dhital, M.R.; Smedt, F.D. Evaluation and comparison of GIS based landslide susceptibility mapping procedures in Kulekhani Watershed, Nepal. J. Geol. Soc. India 2013, 81, 219–231. [Google Scholar] [CrossRef]
  76. Regmi, A.D.; Devkota, K.C.; Yoshida, K.; Pradhan, B.; Pourghasemi, H.R.; Kumamoto, T.; Akgun, A. Application of frequency ratio, statistical index, and weights-of-evidence models and their comparison in landslide susceptibility mapping in Central Nepal Himalaya. Arab. J. Geosci. 2014, 7, 725–742. [Google Scholar] [CrossRef]
  77. Shi, Y.; Jin, F. Landslide stability analysis based on generalized information entropy. In Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, Wuhan, China, 4–5 July 2009; Volume 3, pp. 83–85. [Google Scholar]
  78. Bonham-Carter, G.E.; Cox, S. Geographic Information Systems for Geoscientists: Modelling with GIS; Elsevier: Amsterdam, The Netherlands, 2010; pp. 1–2. [Google Scholar]
  79. Chen, W.; Li, H.; Hou, E.; Wang, S.; Wang, G.; Panahi, M.; Li, T.; Peng, T.; Guo, C.; Niu, C.; et al. GIS-based groundwater potential analysis using novel ensemble weights-of-evidence with logistic regression and functional tree models. Sci. Total Environ. 2018, 634, 853–867. [Google Scholar] [CrossRef] [PubMed]
  80. Pham, B.T.; Prakash, I.; Bui, D.T. Spatial prediction of landslides using a hybrid machine learning approach based on random subspace and classification and regression trees. Geomorphology 2018, 303, 256–270. [Google Scholar] [CrossRef]
  81. Chen, W.; Xie, X.; Peng, J.; Shahabi, H.; Hong, H.; Bui, T.D.; Duan, Z.; Li, S.; Zhu, A.-X. GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method. Catena 2018, 164, 135–149. [Google Scholar] [CrossRef]
  82. Witten, I.H.; Frank, E.; Mark, A.H. Data Mining: Practical Machine Learning Tools and Techniques, 3rd ed.; Morgan Kaufmann: Burlington, NJ, USA, 2011. [Google Scholar]
  83. Song, Y.; Gong, J.; Gao, S.; Wang, D.; Cui, T.; Li, Y.; Wei, B. Susceptibility assessment of earthquake-induced landslides using Bayesian network: A case study in Beichuan, China. Comput. Geosci. 2012, 42, 189–199. [Google Scholar] [CrossRef]
  84. Kohavi, R. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, QC, Canada, 20–25 August 1995; pp. 1137–1143. [Google Scholar]
  85. Demir, G. Landslide susceptibility mapping by using statistical analysis in the North Anatolian Fault Zone (NAFZ) on the northern part of Suşehri Town, Turkey. Nat. Hazards 2018, 92, 133–154. [Google Scholar] [CrossRef]
  86. Lee, S.; Talib, J.A. Probabilistic landslide susceptibility and factor effect analysis. Environ. Geol. 2005, 47, 982–990. [Google Scholar] [CrossRef]
  87. Xie, Z.; Chen, G.; Meng, X.; Zhang, Y.; Qiao, L.; Tan, L. A comparative study of landslide susceptibility mapping using weight of evidence, logistic regression and support vector machine and evaluated by SBAS-InSAR monitoring: Zhouqu to Wudu segment in Bailong River Basin, China. Environ. Earth Sci. 2017, 76, 313. [Google Scholar] [CrossRef]
  88. Chen, W.; Pourghasemi, H.R.; Panahi, M.; Kornejady, A.; Wang, J.; Xie, X.; Cao, S. Spatial prediction of landslide susceptibility using an adaptive neuro-fuzzy inference system combined with frequency ratio, generalized additive model, and support vector machine techniques. Geomorphology 2017, 297, 69–85. [Google Scholar] [CrossRef]
  89. Pourghasemi, H.R.; Moradi, H.R.; Aghda, S.M.F. Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances. Nat. Hazards 2013, 69, 749–779. [Google Scholar] [CrossRef]
  90. Jenks, G.F. The data model concept in statistical mapping. Int. Yearb. Cartogr. 1967, 7, 186–190. [Google Scholar]
  91. Bijukchhen, S.M.; Kayastha, P.; Dhital, M.R. A comparative evaluation of heuristic and bivariate statistical modelling for landslide susceptibility mappings in Ghurmi–Dhad Khola, East Nepal. Arab. J. Geosci. 2013, 6, 2727–2743. [Google Scholar] [CrossRef]
  92. Chung, C.; Fabbri, A. Three Bayesian prediction models for landslide hazard. In Proceedings of the International Association for Mathematical Geology 1998 Annual Meeting (IAMG’98), Ischia, Italy, 4–9 October 1998; pp. 204–211. [Google Scholar]
  93. Chen, W.; Shirzadi, A.; Shahabi, H.; Ahmad, B.B.; Zhang, S.; Hong, H.; Zhang, N. A novel hybrid artificial intelligence approach based on the rotation forest ensemble and naïve Bayes tree classifiers for a landslide susceptibility assessment in Langao County, China. Geomat. Nat. Hazards Risk 2017, 8, 1955–1977. [Google Scholar] [CrossRef] [Green Version]
  94. Solaimani, K.; Mousavi, S.Z.; Kavian, A. Landslide susceptibility mapping based on frequency ratio and logistic regression models. Arab. J. Geosci. 2013, 6, 2557–2569. [Google Scholar] [CrossRef]
Figure 1. Location map of the study area.
Figure 1. Location map of the study area.
Entropy 20 00868 g001
Figure 2. Landslide conditioning factors: (a) slope angle; (b) slope aspect; (c) elevation; (d) plan curvature; (e) profile curvature; (f) stream power index (SPI); (g) sediment transport index (STI); (h) topographic wetness index (TWI); (i) distance to faults; (j) distance to rivers; (k) distance to roads; (l) normalized difference vegetation index (NDVI); (m) Lithology.
Figure 2. Landslide conditioning factors: (a) slope angle; (b) slope aspect; (c) elevation; (d) plan curvature; (e) profile curvature; (f) stream power index (SPI); (g) sediment transport index (STI); (h) topographic wetness index (TWI); (i) distance to faults; (j) distance to rivers; (k) distance to roads; (l) normalized difference vegetation index (NDVI); (m) Lithology.
Entropy 20 00868 g002aEntropy 20 00868 g002bEntropy 20 00868 g002cEntropy 20 00868 g002d
Figure 3. Landslide susceptibility map produced using the statistical index (SI) model.
Figure 3. Landslide susceptibility map produced using the statistical index (SI) model.
Entropy 20 00868 g003
Figure 4. Landslide susceptibility map produced using the index of entropy (IOE) model.
Figure 4. Landslide susceptibility map produced using the index of entropy (IOE) model.
Entropy 20 00868 g004
Figure 5. Landslide susceptibility map produced using the weights of evidence (WOE) model.
Figure 5. Landslide susceptibility map produced using the weights of evidence (WOE) model.
Entropy 20 00868 g005
Figure 6. Area under the curve (AUC) curves of the models using (a) training data and (b) testing data.
Figure 6. Area under the curve (AUC) curves of the models using (a) training data and (b) testing data.
Entropy 20 00868 g006
Table 1. Importance of conditioning factors based on the coefficient of variation attribute (CVA) method.
Table 1. Importance of conditioning factors based on the coefficient of variation attribute (CVA) method.
Landslide Conditioning FactorsAverage Merit (AM)Standard Deviation (SD)
Distance to roads0.304±0.021
Elevation0.296±0.027
Distance to rivers0.260±0.016
Lithology0.156±0.021
Distance to faults0.155±0.021
TWI 10.083±0.029
Slope angle0.069±0.027
NDVI 20.060±0.026
Plan curvature0.056±0.026
SPI 30.035±0.015
Slope aspect0.032±0.009
STI 40.032±0.022
Profile curvature0.025±0.027
1 Topographic wetness index (TWI); 2 Normalized difference vegetation index (NDVI); 3 Stream power index (SPI); 4 Sediment transport index (STI).
Table 2. Correlation between landslides and conditioning factors using the statistical index (SI), index of entropy (IOE), and weights of evidence (WOE) models.
Table 2. Correlation between landslides and conditioning factors using the statistical index (SI), index of entropy (IOE), and weights of evidence (WOE) models.
Conditioning FactorsClassesNo. of PixelsNo. of LandslideSIWjC
Slope angle (°)0–10347,597400.20230.02020.273
10–20778,722870.1728-–0.001
20–30833,30767–0.1562-0.029
30–40489,01943–0.0667-–0.234
40–50135,0216–0.7492-–0.155
50–6512,2011–0.1370-–0.143
Slope aspectFlat16700.0000.05600.000
North300,99427–0.0467-0.028
Northeast318,751320.0658-0.213
East345,947440.3024-0.244
Southeast349,81622–0.4019-–0.309
South312,150340.1474-0.170
Southwest333,79924–0.2680-–0.503
West320,387310.0290-0.104
Northwest313,856300.0168-–0.101
Altitude (m)<500345,079420.25840.19230.331
500–8001,167,2401610.3835-0.842
800–1100688,63332–0.7045-–0.837
1100–1400350,3089–1.2971-–1.656
1400–170038,95500.000-–1.287
>2050565200.000-0.000
Plan curvature–9.57 to –1.09265,05423–0.07990.0006–0.043
–1.09 to –0.11872,136830.0124-–0.036
–0.11 to 0.881,046,5451010.0264-0.130
0.88–11.42412,13237-0.0459-–0.155
Profile curvature–11.93 to –1.30285,20721–0.24420.0053–0.216
–1.30 to –0.02950,38188–0.0150-–0.060
–0.02 to –1.261,047,6121030.0450-0.108
1.26–11.43312,667320.0851-0.064
SPI0–301,256,9991280.08010.00220.264
30–60499,68244–0.0653-–0.355
60–90243,080230.0066-0.053
>90596,10649–0.1341-–0.123
STI0–10963,339990.08920.0063–0.137
10–20723,470700.0289-0.177
20–30392,792380.0288-–0.059
>30516,26637–0.2712-–0.149
TWI<51,095,48391–0.12360.00900.235
5–71,131,6521170.0952-–0.171
7–9258,581280.1415-0.031
>9110,1518–0.2579-–0.180
Distance to faults (m)0–10001,353,0651700.29020.10680.739
1000–2000596,47355–0.0192-–0.025
2000–3000219,38913–0.4614-–0.486
3000–400099,4972–1.5425-–1.564
>4000327,4204–2.0405-–2.151
Distance to rivers (m)<2001,099,4251390.29640.05110.530
200–400770,61372–0.0060-0.065
400–600405,04819–0.6951-–0.781
600–800163,8146–0.9425-–1.160
>800156,9448–0.6119--–0.606
Distance to roads (m)<500696,1091130.54640.05460.850
500–1000547,84942–0.2038-–0.228
1000–1500439,07239–0.0566-–0.102
1500–2000334,49517–0.6149-–0.681
>2000578,31933–0.4991-–0.590
NDVI–0.23 to 0.1764,49680.27730.0145–0.439
0.17–0.33232,43217–0.2509-0.107
0.33–0.42713,85566–0.0165-–0.081
0.42–0.51965,45078–0.1514-–0.028
0.51–0.71619,630750.2529-0.106
LithologyHarder metamorphic rocks514,86029–0.51210.0954–0.824
Softer metamorphic rocks771,4471410.6650-–0.606
Hard carbonate rocks734,97543–0.4742-0.361
Hard intrusive rocks522,46424–0.7160-1.159
Soft gravelly soils52,03870.3584-–0.607

Share and Cite

MDPI and ACS Style

Liu, J.; Duan, Z. Quantitative Assessment of Landslide Susceptibility Comparing Statistical Index, Index of Entropy, and Weights of Evidence in the Shangnan Area, China. Entropy 2018, 20, 868. https://doi.org/10.3390/e20110868

AMA Style

Liu J, Duan Z. Quantitative Assessment of Landslide Susceptibility Comparing Statistical Index, Index of Entropy, and Weights of Evidence in the Shangnan Area, China. Entropy. 2018; 20(11):868. https://doi.org/10.3390/e20110868

Chicago/Turabian Style

Liu, Jie, and Zhao Duan. 2018. "Quantitative Assessment of Landslide Susceptibility Comparing Statistical Index, Index of Entropy, and Weights of Evidence in the Shangnan Area, China" Entropy 20, no. 11: 868. https://doi.org/10.3390/e20110868

APA Style

Liu, J., & Duan, Z. (2018). Quantitative Assessment of Landslide Susceptibility Comparing Statistical Index, Index of Entropy, and Weights of Evidence in the Shangnan Area, China. Entropy, 20(11), 868. https://doi.org/10.3390/e20110868

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop