基于特征工程的强降水物理要素提取及分析
Using Feature Engineering to Extract Important Physical Parameters for Heavy Rain
DOI: 10.12677/CSA.2022.121016, PDF,    国家自然科学基金支持
作者: 钟 琦*, 方祖亮, 王秀明:中国气象局气象干部培训学院,北京;孙 卓:河北省气象台,河北 石家庄;张智察:浙江省气象台,浙江 杭州
关键词: 机器学习特征提取强降水物理要素华北地区Machine Learning Feature Extraction Heavy Rain Physical Parameters North China
摘要: 灾害性强降水事件对人民生命和财产安全具有重大影响,其精准预报是防灾减灾的重要内容,但因其突发性强、变化快和强度大的特点,常用数值模式的预报准确率很低。近年来随着人工智能技术快速发展,融合物理特征和机器学习的强降水预报成为热点研究。本文利用特征工程提取强降水预报关键物理因子,发现随着预报精细度提升,模式直接预报降水的贡献越来越小,而相关物理特征更具参考价值。对6小时以上累计降水,中高层的动力维持作用、整层抬升和整层含水量具有较大贡献;而对于3小时以内降水预报,近地面层的动力辐合、不稳定能量特征、及影响降水效率的云微物理因子重要性更加凸显,这与降水系统的多尺度特征和模式可预报性相对应。通过特征工程精简后的物理特征可有效去除冗余提升机器学习模型效率,为提升强降水的预报准确率提供支撑。
Abstract: Hazardous heavy rain events have significant impacts on people’s lives and property safety. It’s important to make accurate prediction to reduce the loss. However, due to its high intensity and rapid change, the accuracy of operational numerical models is very low. In recent years, with the rapid development of artificial intelligence technology, more and more studies devote on integrating physical understanding and machine learning to improve heavy rain forecast. In this study, the key physical factors of heavy precipitation prediction are extracted by feature engineering methods. It is found that the finer the forecast is, the contribution of forecasted rain is smaller, but the physical features are more important. For the above 6 h rainfall, the dynamics of the middle and high levels, the uplift of the whole layer and the water content of the whole layer contribute greatly; for forecast less than 3 hours, the importance of dynamic uplift at low level, the unstable energy and some microphysical factors are prominent, which is in according to the multi-scale characteristics of the atmosphere. The physical features simplified by feature engineering can effectively remove redundancy and improve the efficiency of machine learning model. The results would help to improve the prediction of heavy rain events.
文章引用:钟琦, 方祖亮, 孙卓, 王秀明, 张智察. 基于特征工程的强降水物理要素提取及分析[J]. 计算机科学与应用, 2022, 12(1): 147-157. https://doi.org/10.12677/CSA.2022.121016

参考文献

[1] Hernández, E., Sanchez-Anguix, V., Julian, V., Palanca, J. and Duque, N. (2016) Rainfall Prediction: A Deep Learning Approach. In: Martínez-Álvarez, F., Troncoso, A., Quintián, H. and Corchado, E., Eds., Hybrid Artificial Intelligent Systems. HAIS 2016. Lecture Notes in Computer Science, Vol. 9648. Springer, Cham. [Google Scholar] [CrossRef
[2] 邢彩盈, 张京红, 黄海静. 基于BP神经网络的海口住宅室内气温预测[J]. 贵州气象, 2016, 40(5): 38-42.
[3] 门晓磊, 焦瑞莉, 王鼎, 等. 基于机器学习的华北气温多模式集合预报的订正方法[J]. 气候与环境研究, 2019, 24(1): 116-124.
[4] Czernecki, B., Taszarek, M., Marosz, M., et al. (2019) Application of Machine Learning to Large Hail Prediction—The Importance of Radar Reflectivity, Lightning Occurrence and Convective Parameters Derived from ERA5. Atmospheric Research, 227, 249-262. [Google Scholar] [CrossRef
[5] Han, H., Lee, S., Im, J., et al. (2015) Detection of Convective Initiation Using Meteorological Imager Onboard Communication, Ocean, and Meteorological Satellite Based on Machine Learning Approaches. Remote Sensing, 7, 9184-9204. [Google Scholar] [CrossRef
[6] 陈浩, 宁忱, 南卓铜, 等. 基于机器学习模型的青藏高原日降水数据的订正研究[J]. 冰川冻土, 2017, 39(3): 583-592.
[7] 张晨阳, 杨雪冰, 张文生. 气象大数据超短临精准降水机器学习与典型应用[J]. 农业大数据学报, 2019(1): 78-87.
[8] 黄威, 牛若芸. 基于集合预报和支持向量机的中期强降雨集成预报试验[J]. 气象, 2017, 43(9): 1110-1116.
[9] 冀慧杰, 倪枫, 刘姜, 等. 基于XGB-BFS特征选择算法的电信客户流失预测[J]. 计算机技术与发展, 2021, 31(5): 21-25.
[10] 李占山, 刘兆赓. 基于XGBoost的特征选择算法[J]. 通信学报, 2019, 40(10): 101-108.
[11] 杨长春, 徐筱, 宦娟, 等. 基于随机森林的学生画像特征选择方法[J]. 计算机工程与设计, 2020, 40(10): 2827-2834.
[12] 王智, 张志强, 谢晓芹, 等. 基于提升树的PM2.5浓度预测模型[J]. 软件, 2018, 39(10): 156-163.