基于深度学习和综合评价算法的城市极端降水灾害预警预测
Urban Extreme Precipitation Disaster Early Warning and Prediction Based on Deep Learning and Comprehensive Evaluation Algorithms
摘要: 针对数值模式在短临尺度局地极端降水预报中的漏报风险,以及固定阈值方法难以刻画多气象要素非线性耦合的问题,本文提出一种融合多尺度特征与特征词元化Transformer (FT-Transformer)的极端降水事件风险评估模型。该模型通过特征词元化与自注意力机制自动捕捉降水强度、累积量与温湿压等背景场的高阶交互,构建分类与危害指数回归相结合的多任务学习框架,利用回归任务正则化缓解极端样本稀缺带来的过拟合问题,并采用保序回归(Isotonic Regression)校准输出概率。基于郑州区域2015~2021年共25,465个降水事件的实验表明:模型在2021年测试集(3808个事件,含郑州“7∙20”特大暴雨)上取得F1 = 0.9573、AUC = 0.9998、高危事件召回率98.25%,显著优于随机森林(F1 = 0.9148)和逻辑回归(F1 = 0.9422)等基线方法;概率校准使平均绝对误差下降约35%。对“7∙20”特大暴雨的回顾性分析显示,模型能准确识别该极端事件并给出合理的风险刻度,验证了方法的有效性。
Abstract: Aiming at the missed-alarm risk of numerical weather prediction (NWP) models in short-term local extreme precipitation forecasting and the difficulty of fixed-threshold methods in characterizing nonlinear coupling among multiple meteorological factors, this paper proposes an extreme precipitation event risk assessment model integrating multi-scale features with Feature-Tokenizer Transformer (FT-Transformer). The model automatically captures high-order interactions among precipitation intensity, accumulation, and background fields (temperature, humidity, pressure) through feature tokenization and self-attention mechanism. A multi-task learning framework combining classification and hazard index regression is constructed, where the regression task serves as regularization to alleviate overfitting caused by extreme sample scarcity, and Isotonic Regression is employed to calibrate output probabilities. Experiments on 25,465 precipitation events from Zhengzhou area during 2015~2021 demonstrate that the model achieves F1 = 0.9573, AUC = 0.9998, and high-risk recall of 98.25% on the 2021 test set (3,808 events including the “July 20th” extreme rainstorm), significantly outperforming Random Forest (F1 = 0.9148) and Logistic Regression (F1 = 0.9422). Probability calibration reduces the mean absolute error by approximately 35%. Retrospective analysis of the “July 20th” extreme rainstorm shows that the model accurately identifies this extreme event and provides reasonable risk scores, validating the effectiveness of the proposed method.
文章引用:徐泉猷, 许武磊, 李永达. 基于深度学习和综合评价算法的城市极端降水灾害预警预测[J]. 应用数学进展, 2026, 15(4): 567-575. https://doi.org/10.12677/aam.2026.154183

参考文献

[1] IPCC (2021) Climate Change 2021: The Physical Science Basis. Cambridge University Press.
[2] 卢珊, 胡泽勇, 王百朋, 等. 近56年中国极端降水事件的时空变化格局[J]. 高原气象, 2020, 39(4): 683-693.
[3] 苏爱芳, 吕晓娜, 崔丽曼, 等. 郑州“7∙20”极端暴雨天气的基本观测分析[J]. 暴雨灾害, 2021, 40(5): 445-454.
[4] Bauer, P., Thorpe, A. and Brunet, G. (2015) The Quiet Revolution of Numerical Weather Prediction. Nature, 525, 47-55. [Google Scholar] [CrossRef] [PubMed]
[5] Sun, J., Xue, M., Wilson, J.W., Zawadzki, I., Ballard, S.P., Onvlee-Hooimeyer, J., et al. (2014) Use of NWP for Nowcasting Convective Precipitation: Recent Progress and Challenges. Bulletin of the American Meteorological Society, 95, 409-426. [Google Scholar] [CrossRef
[6] 王奇. 城市暴雨内涝灾害风险评估及防御措施研究[J]. 水利科学与寒区工程, 2024, 7(11): 110-113.
[7] 孙玉龙, 王丽荣, 李婷, 等. 基于随机森林算法的河北省暴雨道路灾害风险评估模型研究[J]. 中国防汛抗旱, 2026, 36(3): 30-36.
[8] 杨璐, 南刚强, 陈明轩, 等. 基于三种机器学习方法的降水相态高分辨率格点预报模型的构建及对比分析[J]. 气象学报, 2021, 79(6): 1022-1034.
[9] 李鸿霄, 张怀念, 杨洁, 等. 基于XGBoost对大数据驱动地形气候交互模型的构建与暴雨灾害预测[J]. 数学的实践与认识, 2025, 55(10): 98-108.
[10] 戈苗苗, 陆振宇, 梁邵阳, 等. 基于改进双阶段注意力机制的降水智能预报[J]. 南京信息工程大学学报(自然科学版), 2021, 13(6): 744-752.
[11] Grinsztajn, L., Oyallon, E. and Varoquaux, G. (2022) Why Do Tree-Based Models Still Outperform Deep Learning on Typical Tabular Data? Advances in Neural Information Processing Systems, 35, 507-520. [Google Scholar] [CrossRef
[12] He, J., Yang, K., Tang, W., Lu, H., Qin, J., Chen, Y., et al. (2020) The First High-Resolution Meteorological Forcing Dataset for Land Process Studies over China. Scientific Data, 7, 1-11. [Google Scholar] [CrossRef] [PubMed]
[13] Gorishniy, Y., Rubachev, I. and Khrulkov, V. (2021) Revisiting Deep Learning Models for Tabular Data. Advances in Neural Information Processing Systems, 34, 18932-18943.