基于随机森林模型的湫水河流域洪水过程模拟
Random Forests Model Based Flood Process Simulation in the Qiushui River Basin
DOI: 10.12677/JWRR.2018.75051, PDF,    国家自然科学基金支持
作者: 唐甜甜, 王 颖, 肖章玲, 李彬权:河海大学水文水资源学院,江苏 南京;樊 静:江苏省水文水资源勘测局泰州分局,江苏 泰州
关键词: 随机森林模型洪水预报湫水河流域黄土高原Random Forests Model Flood Forecasting Qiushui River Basin Loess Plateau
摘要: 黄河中游干旱半干旱地区洪水预报精度水平普遍不高,其主要原因在于降雨时空高度变异性以及大范围水土保持措施对产汇流的强烈干扰。随着现代统计理论发展,智能机器学习算法为该地区洪水预报提供了新的途径。以黄河中游左岸湫水河流域为例,采用随机森林算法建立暴雨洪水预报模型,对汛期场次洪水过程进行模拟,结果表明:当计算时间步长为1小时,随机森林模型的确定性系数(NSE)平均值为0.47,以NSE ≥ 0.60衡量,合格率为42%;当计算时间步长为0.5小时,NSE平均值为0.76,场次洪水预报合格率为88%;由此可知,输入资料精度是决定模型精度的主要因素。此外,不同时间步长条件下,随机森林模型的应用效果均明显优于传统的多元回归统计模型,表明随机森林模型适用于湫水河流域的洪水过程预报,对黄河中游黄土高原地区防洪预警具有一定参考价值。
Abstract: The accuracy level of flood forecasting in arid and semi-arid areas of the middle Yellow River region is generally not high, which is mainly due to the spatial and temporal variability of rainfall and the inten-sive disturbances of large-scale soil and water conservation measures on the runoff production and routing processes. With the development of modern statistical theory, intelligent machine learning algorithms provide a new way for flood forecasting in this region. Taking the Qiushui River Basin on the left bank of the middle reaches of the Yellow River as an example, the random forest algorithm was used to establish the storm-flood forecasting model and simulate the rainfall-runoff during the flood season. The results showed that when the calculation time step was 1 hour, the average value of the Nash-Sutcliffe efficiency (NSE) of the Random Forest model was 0.47, and the qualified rate was 42% when NSE ≥ 0.60 was measured. When the calculation time step was 0.5 hours, the average NSE value was 0.76, and the corresponding qualified rate increased to 88%. Therefore, the accuracy of the input data was a main factor affecting the model accuracy in this region. In addition, under different time steps conditions, the performance of the Random Forest model is obviously better than that of the traditional multivariate regression statistical model, indicating that the random forest model is suitable for flood process prediction in the Qiushui River basin, and has a certain reference value for the flood warning in the Loess Plateau in the middle reaches of the Yellow River.
文章引用:唐甜甜, 王颖, 肖章玲, 樊静, 李彬权. 基于随机森林模型的湫水河流域洪水过程模拟[J]. 水资源研究, 2018, 7(5): 456-463. https://doi.org/10.12677/JWRR.2018.75051

参考文献

[1] 施勇, 栾震宇, 陈炼钢, 等. 长江中下游实时洪水预报模拟[J]. 水科学进展, 2010, 21(6): 847-852. SHI Yong, LUAN Zhenyu, CHEN Liangang, et al. Real-time flood forecasting in the middle and lower reaches of the Yangtze River. Advances in Water Science, 2010, 21(6): 847-852. (in Chinese)
[2] 陈玉林, 韩家田. 半干旱地区洪水预报的若干问题[J]. 水科学进展, 2003, 14(5): 612-616. CHEN Yulin, HAN Jiatian. Problems on flood forecasting in the semi-arid region. Advances in Water Science, 2003, 14(5): 612-616. (in Chinese)
[3] AL-QURASHI, A., MCINTYRE, N., and WHEATER, R. H. Application of the Kineros2 rain-fall-runoff model to an arid catchment in Oman. Journal of Hydrology, 2008, 355(1-4): 91-105. [Google Scholar] [CrossRef
[4] 李彬权, 牛小茹, 梁忠民, 等. 黄河中游干旱半干旱区水文模型研究进展[J]. 人民黄河, 2017, 39(3): 1-4. LI Binquan, NIU Xiaoru, LIANG Zhongmin, et al. Progress of research on hydrological models for arid and semi-arid areas of the middle Yellow River. Yellow River, 2017, 39(3): 1-4. (in Chinese)
[5] 梁忠民, 李彬权, 邱淑会, 等. 大数据分析技术在水文预报中的应用研究——以黄河河龙区间为例[M]. 南京: 河海大学出版社, 2018. LIANG Zhongmin, LI Binquan, QIU Shuhui, et al. Research on the application of big data analysis techniques in hydrological forecasting: Case studies on the Helong region of the Yellow River. Nanjing: Hohai University Press, 2018. (in Chinese)
[6] LI, B., LIANG, Z., ZHANG, J., et al. Attribution analysis of runoff decline in a semiarid region of the Loess Plateau, China. Theoretical and Applied Climatology, 2018, 131(1-2): 845-855. [Google Scholar] [CrossRef
[7] GRAHAM-ROWE, D., BUXTON, B., HAYWARD, V., et al. Big data: Data wrangling. Nature, 2008, 455(7209): 15. [Google Scholar] [CrossRef] [PubMed]
[8] 李国杰, 程学旗. 大数据研究: 未来科技及经济社会发展的重大战略领域——大数据的研究现状与科学思考[J]. 中国科学院院刊, 2012, 27(6): 647-657. LI Guojie, CHENG Xueqi. Research status and scientific thinking of big data. Bulletin of Chinese Academy of Sciences, 2012, 27(6): 647-657. (in Chinese)
[9] 芮孝芳. 水文学与“大数据” [J]. 水利水电科技进展, 2016, 36(3): 1-4. RUI Xiaofang. Hydrology and big data. Advances in Science and Technology of Water Resources, 2016, 36(3): 1-4. (in Chinese)
[10] MOUNT, N. J., MAIER, H. R., TOTH, E., et al. Data-driven modelling approaches for socio-hydrology: Opportunities and challenges within the Panta Rhei Science Plan. Hydrological Sciences Journal, 2016, 61(7): 1192-1208. [Google Scholar] [CrossRef
[11] MA, J., SUN, W., YANG, G., et al. Hydrological analysis using satellite remote sensing big data and CREST model. IEEE Access, 2018, 6: 9006-9016.[CrossRef
[12] BREIMAN, L. Random forests. Machine Learning, 2001, 45(1): 5-32. [Google Scholar] [CrossRef
[13] LI, B., YANG, G., WAN, R., et al. Comparison of random forests and other statistical methods for the prediction of lake water level: A case study of the Poyang Lake in China. Hydrology Research, 2016, 47(S1): 69-83. [Google Scholar] [CrossRef
[14] LIANG, Z., TANG, T., LI, B., et al. Long-term streamflow forecasting using SWAT through the integration of the random forests precipitation generator: Case study of Danjiangkou Reservoir. Hydrology Research, 2017, 49(5): Article ID: nh2017085.
[15] 赵文秀, 张晓丽, 李国会. 基于随机森林和RBF神经网络的长期径流预报[J]. 人民黄河, 2015, 37(2): 10-12. ZHAO Wenxiu, ZHANG Xiaoli, and LI Guohui. Research on the long-term runoff forecast based on random forest model and RBF network. Yellow River, 2015, 37(2): 10-12. (in Chinese)