融合机理启发式时间特征工程的城市内涝水位预测研究
Research on Urban Flooding Water Level Prediction Integrating Mechanism-Heuristic Temporal Feature Engineering
DOI: 10.12677/aep.2026.165089, PDF,   
作者: 金家明:浙江省建设厅建筑工程科创中心,城市地下生命线安全工程技术科创中心,浙江 杭州;浙江建设职业技术学院建筑设备学院,浙江 杭州;蔡海捷, 张 典*:预泓科技(上海)有限公司,上海
关键词: 城市内涝水位预测深度学习机理启发式特征工程智慧排水系统Urban Flooding Water Level Forecasting Deep Learning Mechanism-Heuristic Feature Engineering Smart Drainage System
摘要: 随着全球气候变化的加剧及城市化进程的迅猛推进,城市内涝已成为制约我国中心城区可持续发展的核心瓶颈之一。传统基于物理机制的水文学模型(如SWMM)虽然在机理描述上具有显著优势,但在实时预警场景中往往面临计算效率低、参数获取难及模型维护成本高等现实挑战。针对这一痛点,研究提出了一种融合机理启发式时间特征工程的异构数理模型预测框架。该框架的核心创新在于将水文物理循环机制转化为可量化的时间特征空间,包括:利用二十四节气标识捕捉地球公转驱动的季节性降雨背景、采用三角函数变换映射日内排水周期,以及通过防泄露滚动统计(Shift-Rolling)和多步滞后构造长短期历史记忆。以某典型山前滨河城市为研究对象,在中心城区关键水位站进行了多模型验证。实验对比了包括集成学习(LightGBM、随机森林)、支持向量机(SVM)及深度时序网络(LSTM、CNNLSTM、Transformer)在内的异构模型库。结果表明,基于机理启发式特征工程的LightGBM模型在测试集上取得了0.9887的纳什效率系数(NSE),其预测精度和计算实时性显著优于传统物理模型。研究证实,引入蕴含物理规律的时间特征能显著增强数理模型的泛化能力与稳健性,为智慧排水系统的动态风险评估与科学防涝决策提供了高效的技术支撑。
Abstract: With the exacerbation of global climate change and the rapid advancement of urbanization, urban waterlogging has become a major bottleneck restricting the sustainable development of central urban areas in China. Although traditional physics-based hydrological models (e.g., SWMM) possess significant advantages in mechanistic representation, they frequently face practical challenges in real-time early warning scenarios, such as low computational efficiency, difficulties in parameter acquisition, and high model maintenance costs. To address these limitations, this study proposes a heterogeneous predictive modeling framework integrating mechanism-heuristic temporal feature engineering. The core innovation of this framework lies in translating physical hydrological cycle mechanisms into a quantifiable temporal feature space. Specifically, this includes: utilizing the Twenty-Four Solar Terms to capture the seasonal rainfall background driven by the Earth’s revolution; employing trigonometric transformations to map intra-day drainage cycles; and constructing long- and short-term historical memory via data leakage-free shift-rolling statistics and multi-step lags. Taking a typical piedmont riverine city as the case study, multi-model validation was conducted at key water level stations within the central urban area. The experiments compared a diverse library of heterogeneous models, including ensemble learning algorithms (LightGBM, Random Forest), Support Vector Machines (SVM), and deep time-series networks (LSTM, CNN-LSTM, Transformer). The results indicate that the LightGBM model, enhanced by mechanism-heuristic feature engineering, achieved a Nash-Sutcliffe Efficiency (NSE) coefficient of 0.9887 on the test set. Its prediction accuracy and real-time computational efficiency significantly outperformed traditional physics-based models. This study confirms that incorporating temporal features embedded with physical principles can significantly enhance the generalization capability and robustness of data-driven models, providing highly efficient technical support for dynamic risk assessment and scientific waterlogging mitigation decisions in smart drainage systems.
文章引用:金家明, 蔡海捷, 张典. 融合机理启发式时间特征工程的城市内涝水位预测研究[J]. 环境保护前沿, 2026, 16(5): 900-910. https://doi.org/10.12677/aep.2026.165089

参考文献

[1] Wang, Q., Tang, M. and Shi, P. (2025) Depth Prediction of Urban Waterlogging Based on BiTCN-GRU Modeling. PLOS One, 20, e0321637. [Google Scholar] [CrossRef] [PubMed]
[2] Fu, S., Schultz, D.M., Lyu, H., Zheng, Z. and Zhang, C. (2025) Creating a National Urban Flood Dataset for China from News Texts (2000-2022) at the County Level. Hydrology and Earth System Sciences, 29, 767-783. [Google Scholar] [CrossRef
[3] Deng, Y. and Lu, X. (2026) Explainable Machine Learning-Based Urban Waterlogging Prediction Framework. Urban Science, 10, Article 156. [Google Scholar] [CrossRef
[4] Fan, Z., Zhang, J., Chen, Y. and Xu, H. (2025) Urban Flood Prediction Model Based on Transformer-LSTM-Sparrow Search Algorithm. Water, 17, Article 1404. [Google Scholar] [CrossRef
[5] Hu, Y., Tang, M., Ma, S., Zhu, Z., Zhou, Q., Xie, Q., et al. (2025) A Rapid Prediction Method for Key Information of the Urban Flood Control Engineering System Based on Machine Learning: An Empirical Study of the Wusha River Basin. Water, 17, Article 784. [Google Scholar] [CrossRef
[6] Santiago-Collazo, F., Bayer, L., Viloria, O., Cardenas, L. and Bilskie, M. (2023) SWMM Model Limitations Report. University of Georgia’s College of Engineering.
https://www.thempc.org/docs/lit/CoreMpo/Studies/Ufm/2023/FMT-2023-08-31-SWMM-LimitationsReport.pdf
[7] Zhang, T., Zhu, T., Gao, K., Zhou, W. and Yu, P.S. (2023) Balancing Learning Model Privacy, Fairness, and Accuracy with Early Stopping Criteria. IEEE Transactions on Neural Networks and Learning Systems, 34, 5557-5569. [Google Scholar] [CrossRef] [PubMed]
[8] Zhu, Z., Wu, G., Gu, Y., Huang, J., Chen, Z. and Lu, H. (2024) Time-Varying Discrete Cosine Transform Based on Shaping Regularization and Its Application in Seismic Data Analysis. Journal of Geophysics and Engineering, 21, 496-506. [Google Scholar] [CrossRef
[9] Li, J.F., Lu, Z.Y., Xu, C., and Zhang, W.X. (2024) Analysis of Geographical Origin of Solar Terms Based on the STTMD Method. Scientific Reports, 14, Article No. 22708. [Google Scholar] [CrossRef] [PubMed]
[10] Álvarez-Estrella, J., Muñoz, P., Bendix, J., Contreras, P. and Célleri, R. (2024) Enhancing Peak Runoff Forecasting through Feature Engineering Applied to X-Band Radar Data. Water, 16, Article 968. [Google Scholar] [CrossRef
[11] Hodson, T.O. (2022) Root-Mean-Square Error (RMSE) or Mean Absolute Error (MAE): When to Use Them or Not. Geoscientific Model Development, 15, 5481-5487. [Google Scholar] [CrossRef