Landslide susceptibility assessment using machine learning with a novel SHAP-based sampling strategy
Lei-Lei Liu , Can Duan , Jun-Hua Gao , Hao Xiao , Wen-Qing Zhu , Can Yang
Geoscience Frontiers ›› 2026, Vol. 17 ›› Issue (2) : 102188
The landslide and non-landslide samples are important inputs for machine learning-based landslide susceptibility assessment. Compared to landslide samples, non-landslide samples generally present higher uncertainty due to random sampling. However, most sampling strategies (e.g., the feature space-based) for non-landslides only consider the characteristics of a single factor or the overall characteristics of all factors, which subsequently leads to either excessive artificial concentration of non-landslide samples or sampling information redundancy. To address these issues, a SHapley Additive exPlanations (SHAP) based sampling strategy considering combined characteristics of landslide conditioning factors (LCFs) is proposed. This strategy sorts the importance of LCFs based on SHAP algorithm and generates multiple sampling spaces using different numbers of LCFs in the sense of importance order. The optimal sampling space is selected according to the Bayesian optimization algorithm. Then, random forest (RF) and extreme gradient boosting (XGBoost) models are utilized to assess the susceptibility of Chaling County, Yanling County, and Guidong County, China, based on the proposed strategy and traditional random sampling. The results indicate that, compared with the traditional RF and XGBoost models, the improved models show better performance with an 8.2% and 9.0% increase in the AUC, respectively. Furthermore, the SHAP-based sampling framework demonstrates good adaptability across the study areas with different geological and geomorphic conditions, suggesting its potential transferability to other regions, although local optimization of parameter settings may still be required.
Landslide susceptibility / SHapley Additive exPlanations (SHAP) / Interpretable machine learning / Sampling strategy / Landslide conditioning factors (LCFs)
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
| [61] |
|
| [62] |
|
| [63] |
|
| [64] |
|
| [65] |
|
| [66] |
|
| [67] |
|
| [68] |
|
| [69] |
|
| [70] |
|
| [71] |
|
| [72] |
|
| [73] |
|
| [74] |
|
| [75] |
|
| [76] |
|
/
| 〈 |
|
〉 |