基于时空注意力机制的小麦气象产量预测LSTM模型改进
An Improved LSTM Model for Wheat Meteorological Yield Prediction Based on Spatial-Temporal Attention Mechanism
摘要: 针对市域小麦气象产量预测中数据稀疏导致的过拟合及生育期先验知识利用不足的问题,本文提出一种基于时空双层次注意力机制的LSTM改进模型(ST-Attention LSTM)。该模型在标准LSTM主干网络中嵌入生育期级与特征级双层注意力模块,通过分生育期独立学习的权重矩阵显式编码物候学先验,实现“先选生育期、再选气象指标”的递进式特征提纯,以可控参数增量换取显著性能提升。基于豫北地区6个地市2000~2019年共108组样本开展留一市交叉验证,实验结果表明:所提模型在预测精度上优于XGBoost、Transformer等基准方法;消融实验证实移除注意力机制将导致显著性能退化;权重可视化揭示的生育期梯度分布与气象因子主导性客观反映小麦生理规律。该模型兼具高精度、可解释性与训练稳定性,为市域尺度气候风险评估提供了轻量化、可落地的智能解决方案。
Abstract: To address the overfitting caused by data sparsity and insufficient utilization of phenological prior knowledge in urban-scale wheat meteorological yield prediction, this paper proposes an improved LSTM model based on spatial-temporal dual-level attention mechanism (ST-Attention LSTM). The model embeds growth-stage-level and intra-feature attention modules into the standard LSTM backbone, explicitly encoding phenological priors through independently learned weight matrices for each growth stage, achieving progressive feature refinement via a “select growth stage first, then meteorological factors” hierarchy with controllable parameter increment. Leave-one-city-out cross-validation was conducted on 108 samples from 6 cities in northern Henan Province during 2000~2019. Experimental results show that the proposed model outperforms benchmark methods such as XGBoost and Transformer in prediction accuracy; ablation studies confirm that removing the attention mechanism leads to significant performance degradation; weight visualizations reveal gradient distributions across growth stages and dominant meteorological factors that objectively reflect wheat physiological patterns. The model achieves high accuracy, interpretability, and training stability, providing a lightweight and practical intelligent solution for urban-scale climate risk assessment.
文章引用:刘传龙. 基于时空注意力机制的小麦气象产量预测LSTM模型改进[J]. 计算机科学与应用, 2026, 16(2): 439-447. https://doi.org/10.12677/csa.2026.162072

参考文献

[1] 王旭, 刘波, 陈正超, 等. 基于多源数据和LSTM模型的县域冬小麦估产[J]. 农业机械学报, 2020, 51(5): 265-272.
[2] 赫晓慧, 许明晨, 杨永辉, 等. 基于遥感数据时空特征提取的冬小麦估产方法[J]. 农业工程学报, 2023, 39(15): 165-173.
[3] 金夏, 何彦锋, 陈强, 等. 基于LSTM-Transformer模型的生物滞留设施土壤湿度预测[J]. 水利学报, 2023, 54(8): 1-10.
[4] Smajlhodžić-Deljo, M., Hundur Hiyari, M., Gurbeta Pokvić, L., Merdović, N., Bećirović, F., Spahić, L., et al. (2024) Using Data-Driven Computer Vision Techniques to Improve Wheat Yield Prediction. AgriEngineering, 6, 4704-4719. [Google Scholar] [CrossRef
[5] Feng, P., Wang, B., Liu, D.L., Waters, C. and Yu, Q. (2019) Incorporating Machine Learning with Biophysical Model Can Improve the Evaluation of Climate Extremes Impacts on Wheat Yield in South-Eastern Australia. Agricultural and Forest Meteorology, 275, 100-113. [Google Scholar] [CrossRef
[6] Ahmed, M.U. and Hussain, I. (2022) Prediction of Wheat Production Using Machine Learning Algorithms in Northern Areas of Pakistan. Telecommunications Policy, 46, Article 102370. [Google Scholar] [CrossRef
[7] Alkhudaydi, T. and De La Lglesia, B. (2022) Counting Spikelets from Infield Wheat Crop Images Using Fully Convolutional Networks. Neural Computing and Applications, 34, 17539-17560. [Google Scholar] [CrossRef
[8] Zhou, H., Qin, B., Li, Q., Su, W., Liang, S., Min, H., et al. (2026) Integrating 3D Detection Networks and Dynamic Temporal Phenotyping for Wheat Yield Classification and Prediction. Artificial Intelligence in Agriculture, 16, 603-618. [Google Scholar] [CrossRef