基于深度可分离组卷积的临近预报
Precipitation Nowcasting Based on Depthwise Separable Group Convolution
摘要: 强对流天气具有突发性、局地性等特征,它同时又可能造成各种重大损失。精准、实时的强对流天气预报对风暴预警、航空运行及大型集会组织等活动的规划均具有重要价值。临近降水预报作为强对流预报的一个重要工具,因此越来越受到气象预报专家与研究人员的高度关注。过去数十年间,科研人员主要通过求解大气运动方程组,借助数值天气预报开展临近降水预报工作。该方法虽取得一定成效,但庞大的计算资源需求与复杂的方程求解过程,导致其在临近预报中的应用面临较大困难。近年来,研究人员采用自注意力机制、扩散模型等深度学习方法,有效提升了预报性能,却普遍受困于模型尺寸过大、轻量化程度不足等问题,使得模型难以在预报性能与参数规模之间实现平衡。为在模型参数与预报性能之间实现一定平衡,本文提出一种基于深度可分离组卷积的编码器–转换器–解码器模型。该模型聚焦于强对流天气中强对流降水的及时预报,其内部通过多层深度可分离组卷积实现降水信息的多尺度特征提取,同时通过时空特征融合模块实现多层级特征的融合。SEVIR数据集上的实验结果表明我们提出的模型预报效果优于其他对比方法,量化实验与可视化实验结果进一步验证了该模型的有效性与实用性。
Abstract: Severe convective weather, characterized by its abrupt onset and high localization often leads to significant losses. Accurate and real-time forecasting of such events is therefore great importance for storm warnings, aviation operations, and the planning of large-scale gatherings. As a key tool for predicting severe convective events, nowcasting of precipitation has attracted increasing attention from meteorologists and researchers. Over the past several decades, most studies have relied on solving atmospheric dynamic equations through numerical weather prediction (NWP) methods. Although this approach has achieved notable progress, its application in short-term forecasting remains challenging due to the intensive computational demands and the complexity of solving these equations. In recent years, deep learning techniques, such as self-attention mechanisms and diffusion models, have been introduced to enhance forecasting performance. However, these models often suffer from excessive parameter sizes and insufficient lightweight design, making it difficult to balance predictive accuracy with computational efficiency. To address this issue, this study proposes an encoder-transformer-decoder architecture based on depthwise separable group convolution. The model focuses on timely forecasting of convective precipitation associated with severe convective weather. Within the architecture, multi-scale precipitation features are extracted through multiple layers of depthwise separable group convolutions, while a spatiotemporal fusion module integrates multi-level features. Experiments conducted on the SEVIR dataset demonstrate that the proposed model outperforms other baseline methods. Both quantitative and visualization results further confirm the model’s effectiveness and practical value.
文章引用:张海波, 张福贵, 熊太松. 基于深度可分离组卷积的临近预报[J]. 计算机科学与应用, 2025, 15(11): 185-195. https://doi.org/10.12677/csa.2025.1511296

参考文献

[1] Jing, J., Li, Q., Peng, X., Ma, Q. and Tang, S. (2020) HPRNN: A Hierarchical Sequence Prediction Model for Long-Term Weather Radar Echo Extrapolation. ICASSP 2020—2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, 4-8 May 2020, 4142-4146. [Google Scholar] [CrossRef
[2] Ma, Z., Zhang, H. and Liu, J. (2022) Focal Frame Loss: A Simple but Effective Loss for Precipitation Nowcasting. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 15, 6781-6788. [Google Scholar] [CrossRef
[3] Ehsani, M.R., Zarei, A., Gupta, H.V., Barnard, K., Lyons, E. and Behrangi, A. (2022) Nowcasting-Nets: Representation Learning to Mitigate Latency Gap of Satellite Precipitation Products Using Convolutional and Recurrent Neural Networks. IEEE Transactions on Geoscience and Remote Sensing, 60, 1-21. [Google Scholar] [CrossRef
[4] 董润婷, 吴利, 王晓英, 等. 深度学习在天气预报领域的应用分析及研究进展综述[J]. 计算机应用, 2023, 43(6): 1958-1968.
[5] Ayzel, G., Heistermann, M. and Winterrath, T. (2019) Optical Flow Models as an Open Benchmark for Radar-Based Precipitation Nowcasting (Rainymotion V0.1). Geoscientific Model Development, 12, 1387-1402. [Google Scholar] [CrossRef
[6] Bowler, N.E., Pierce, C.E. and Seed, A.W. (2006) STEPS: A Probabilistic Precipitation Forecasting Scheme Which Merges an Extrapolation Nowcast with Downscaled NWP. Quarterly Journal of the Royal Meteorological Society, 132, 2127-2155. [Google Scholar] [CrossRef
[7] 方巍, 齐媚涵. 基于深度学习的高时空分辨率降水临近预报方法[J]. 地球科学与环境学报, 2023, 45(3): 706-718.
[8] 方巍, 沈亮, 邹立尧, 等. 基于GCA-ConvLSTM预测网格的短临降水雷达回波外推方法[J]. 暴雨灾害, 2023, 42(4): 427-436.
[9] Shi, X., Chen, Z., Wang, H., Yeung, D.Y., et al. (2015) Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting. Annual Conference on Neural Information Processing Systems (NIPS 2015), Montreal, 7-12 December 2015, 802-810.
https://proceedings.neurips.cc/paper/2015/hash/07563a3fe3bbe7e3ba84431ad9d055af-Abstract.html
[10] Graves, A. (2012) Long Short-Term Memory. In: Graves, A., Ed., Supervised Sequence Labelling with Recurrent Neural Networks, Springer, 37-45. [Google Scholar] [CrossRef
[11] Shi, X., Gao, Z., Lausen, L., Wang, H., et al. (2017) Deep Learning for Precipitation Nowcasting: A Benchmark and a New Model. Annual Conference on Neural Information Processing Systems (NIPS 2017), California, 4-9 December 2017, 5618-5628.
https://proceedings.neurips.cc/paper/2017/hash/a6db4ed04f1621a119799fd3d7545d3d-Abstract.html
[12] Wang, Y., Long, M., Wang, J., Gao, Z., et al. (2017) PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs. Annual Conference on Neural Information Processing Systems (NIPS 2017), California, 4-9 December 2017, 880-889.
https://proceedings.neurips.cc/paper/2017/hash/e5f6ad6ce374177eef023bf5d0c018b6-Abstract.html
[13] Peng, D., Chen, M., Zhang, Y. and Tian, Z. (2025) Enhanced Optic-Flow Extrapolation for Doppler Radar Nowcasting with Dynamic Weight Attention. Expert Systems with Applications, 267, Article ID: 126168. [Google Scholar] [CrossRef
[14] Gao, Z., Tan, C., Wu, L. and Li, S.Z. (2022) SimVP: Simpler Yet Better Video Prediction. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 3160-3170. [Google Scholar] [CrossRef
[15] Castro, R., Souto, Y.M., Ogasawara, E., Porto, F. and Bezerra, E. (2021) STConvS2S: Spatiotemporal Convolutional Sequence to Sequence Network for Weather Forecasting. Neurocomputing, 426, 285-298. [Google Scholar] [CrossRef
[16] Xu, L., Lu, W., Yu, H., Yao, F., Sun, X. and Fu, K. (2024) SFTformer: A Spatial-Frequency-Temporal Correlation-Decoupling Transformer for Radar Echo Extrapolation. IEEE Transactions on Geoscience and Remote Sensing, 62, 1-15. [Google Scholar] [CrossRef
[17] Bai, C., Sun, F., Zhang, J., Song, Y. and Chen, S. (2022) Rainformer: Features Extraction Balanced Network for Radar-Based Precipitation Nowcasting. IEEE Geoscience and Remote Sensing Letters, 19, 1-5. [Google Scholar] [CrossRef
[18] Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Computer VisionECCV 2018, Springer, 3-19. [Google Scholar] [CrossRef
[19] Li, D., Deng, K., Zhang, D., Liu, Y., Leng, H., Yin, F., et al. (2023) LPT-QPN: A Lightweight Physics-Informed Transformer for Quantitative Precipitation Nowcasting. IEEE Transactions on Geoscience and Remote Sensing, 61, 1-19. [Google Scholar] [CrossRef
[20] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N. and Polosukhin, I. (2017) Attention Is All You Need. Advances in Neural Information Processing Systems, 30, 6000-6010.
[21] Guibas, J., Mardani, M., Li, Z., Tao, A., Anandkumar, A. and Catanzaro, B. (2021) Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers. arXiv: 2111.13587.
[22] Buchholz, T. and Jug, F. (2022) Fourier Image Transformer. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, 19-20 June 2022, 1845-1853. [Google Scholar] [CrossRef
[23] Gao, Z., Shi, X., Han, B., Wang, H., Jin, X., Maddix, D., Wang, Y.B., et al. (2023) Prediff: Precipitation Nowcasting with Latent Diffusion Models. Advances in Neural Information Processing Systems, 36, 78621-78656
[24] Yu, D., Li, X., Ye, Y., Zhang, B., Luo, C., Dai, K., et al. (2024) DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 16-22 June 2024, 27758-27767. [Google Scholar] [CrossRef
[25] Younesi, A., Ansari, M., Fazli, M., Ejlali, A., Shafique, M. and Henkel, J. (2024) A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends. IEEE Access, 12, 41180-41218. [Google Scholar] [CrossRef
[26] Dumoulin, V. and Visin, F. (2016) A Guide to Convolution Arithmetic for Deep Learning. arXiv: 1603.07285.
[27] Veillette, M., Samsi, S. and Mattioli, C. (2020) Sevir: A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology. Advances in Neural Information Processing Systems, 33, 22009-22019.
[28] Loshchilov, I. and Hutter, F. (2017) Decoupled Weight Decay Regularization. arXiv: 1711.05101 [Google Scholar] [CrossRef
[29] Gao, Z., Shi, X., Wang, H., Zhu, Y., Wang, Y.B., Li, M. and Yeung, D.Y. (2022) EarthFormer: Exploring Space-Time Transformers for Earth System Forecasting. Advances in Neural Information Processing Systems, 35, 25390-25403.