基于IV-RF模型地质灾害易发性及可解释性分析
Susceptibility and Interpretability Analysis of Geological Hazards Based on the IV-RF Model
摘要: 云南省地质构造复杂,地质灾害多发易发,开展地质灾害易发性预测对防灾减灾意义重大。本研究以镇雄县为研究区域,采用信息量模型(Information Value, IV)量化了各因子属性值对灾害的影响,并耦合随机森林模型(Random Forest, RF)建立评价模型同时,采用沙普利可加和解释方法(Shapley Additive Explanations, SHAP)对模型进行可解释性分析,以明确各因子对地质灾害的影响。研究收集了镇雄县地质灾害点及16个影响因子数据,经处理后用于模型构建与验证。结果表明,IV-RF复合模型ROC曲线下面积(AUC)达到0.93,具有较高的预测精度和可靠性,且地质灾害易发性呈现显著空间差异,极高和高易发区集中在县域中部、南部及东部部分区域。SHAP分析显示,高程(DEM)、距道路距离(Distance to Roads, DistRd)、归一化植被指数(NDVI)等因子对地质灾害影响显著,各因子信息量值增大通常与地质灾害易发性增加相关。本研究为区域地质灾害风险管理提供了科学有效的工具和重要参考。
Abstract: Yunnan Province is characterized by complex geological structures and a high frequency of geological hazards. Accurate prediction of geological hazard susceptibility is of great significance for disaster prevention and mitigation. Taking Zhenxiong County as the study area, this study uses the Information Value (IV) model to quantify the influence of attribute values of each factor on hazards, and couples it with the Random Forest (RF) model to construct an evaluation model. Meanwhile, the Shapley Additive Explanations (SHAP) method is employed to conduct interpretability analysis of the model, so as to clarify the impact of each factor on geological hazards. Data on geological hazard locations and 16 influencing factors in Zhenxiong County were collected, processed, and then used for model construction and validation. The results show that the area under the ROC curve (AUC) of the IV-RF hybrid model is 0.93, indicating high prediction accuracy and reliability. In addition, geological hazard susceptibility shows significant spatial heterogeneity, with very high and high susceptibility zones concentrated in the central, southern and parts of eastern regions of the county. SHAP analysis reveals that factors such as elevation (DEM), distance to roads (DistRd), and Normalized Difference Vegetation Index (NDVI) impose significant impacts on geological hazards, and the increase in information values of each factor is generally positively correlated with the increase in geological hazard susceptibility. This study provides a scientific and effective tool and an important reference for regional geological hazard risk management.
文章引用:陈菊芳. 基于IV-RF模型地质灾害易发性及可解释性分析[J]. 地球科学前沿, 2026, 16(5): 823-835. https://doi.org/10.12677/ag.2026.165075

参考文献

[1] 马玲玲, 高海东, 于国强, 等. 基于耦合信息量与机器学习的陕西省地质灾害易发性评价[J]. 自然灾害学报, 2026, 35(1): 92-103.
[2] 王宇. 云南省地质灾害防治与研究历史评述[J]. 灾害学, 2019, 34(3): 134-139.
[3] 谭玉敏, 郭栋, 白冰心, 等. 基于信息量模型的涪陵区地质灾害易发性评价[J]. 地球信息科学学报, 2015, 17(12): 1554-1562.
[4] 赵艳南, 牛瑞卿. 基于证据权法的滑坡危险性区划探索[J]. 地理与地理信息科学, 2010, 26(6): 19-23.
[5] 毕结昂, 徐佩华, 宋盛渊, 等. 基于信息量-逻辑回归耦合模型的玛纳斯河流域地质灾害易发性评价[J]. 工程地质学报, 2022, 30(5): 1549-1560.
[6] 武雪玲, 任福, 牛瑞卿, 等. 斜坡单元支持下的滑坡易发性评价支持向量机模型[J]. 武汉大学学报(信息科学版), 2013, 38(12): 1499-1503.
[7] 米国良, 朱兴华, 陶雪文, 等. 基于随机森林模型的城市路面塌陷易发性评价——以杭州市平原沟谷区为例[J/OL]. 中国地质灾害与防治学报, 1-15.
https://link.cnki.net/urlid/11.2852.p.20250825.1534.002, 2026-03-30.
[8] 余飞燕, 吕蒙, 杨军. 云南省镇雄县滑坡地质灾害主要致灾因子筛选及危险性分区[J]. 地质与资源, 2015, 24(2): 152-156.
[9] 胡波, 侯岳林, 李文, 等. 融合InSAR与改进CNN–BiLSTM的镇雄县滑坡形变分析与预测方法研究[J/OL]. 安全与环境学报, 1-12. https://link.cnki.net/doi/10.13637/j.issn.1009-6094.2025.1290, 2026-03-30.[CrossRef
[10] 薛智文, 许冲, 付登文, 等. 2024年1月22日云南镇雄滑坡灾害成因分析[J]. 地震研究, 2025, 48(1): 80-88.
[11] 张雷雨, 杨毅, 梁霄. 地理加权回归模型的多重共线性诊断方法[J]. 测绘与空间地理信息, 2017, 40(10): 28-31.
[12] Ma, J., Wang, X. and Yuan, G. (2023) Evaluation of Geological Hazard Susceptibility Based on the Regional Division Information Value Method. ISPRS International Journal of Geo-Information, 12, Article 17. [Google Scholar] [CrossRef
[13] Zhang, J., Ma, X., Zhang, J., Sun, D., Zhou, X., Mi, C., et al. (2023) Insights into Geospatial Heterogeneity of Landslide Susceptibility Based on the SHAP-XGBoost Model. Journal of Environmental Management, 332, Article 117357. [Google Scholar] [CrossRef] [PubMed]
[14] Lundberg, S.M., Erion, G.G. and Lee, S.I. (2018) Consistent Individualized Feature Attribution for Tree Ensembles. arXiv:1802.03888.