兰州市空气质量指数预测——基于LSTM的混合模型研究
Lanzhou Air Quality Index Prediction—A Mixed Model Study Based on LSTM
摘要: 本文旨在通过构建EEMD-GWO-LSTM混合模型,对兰州市空气质量指数(AQI)进行准确预测。兰州市作为中国西北地区重要的工业基地和交通枢纽,其空气质量受工业排放、交通污染及地理环境等因素影响,常年处于高污染等级。针对兰州市AQI监测数据突变性强的特点,文章首先对数据进行预处理,包括填补缺失值、归一化处理等,以提高数据质量。随后,采用集合经验模态分解(EEMD)对数据进行分解,提取出本征模态函数(IMF),并利用灰狼优化算法(GWO)对长短期记忆网络(LSTM)模型的超参数进行优化,以提高预测精度。实验结果表明,EEMD-GWO-LSTM混合模型在预测兰州市AQI时,相较于单一模型和其他混合模型,具有更低的均方根误差(RMSE)和更高的决定系数(R2),显示出更好的预测性能。最后,文章提出了增加监测站点、采用先进技术提高监测频率、跨区域合作及数据公开共享等建议,以促进兰州市空气质量的持续改善和预测模型的进一步优化。
Abstract: This paper aims to accurately predict the Air Quality Index (AQI) of Lanzhou City by constructing a mixed model of EEMD-GGO-LSTM. Lanzhou City is an important industrial base and transportation hub in northwest China. Its air quality is affected by industrial emissions, traffic pollution, and the geographical environment, and it always has a high pollution level. In view of the strong mutability of AQI monitoring data in Lanzhou City, the paper first preprocessed the data, including filling in missing values and normalization processing, so as to improve the data quality. Then, the data is decomposed by ensemble empirical Mode decomposition (EEMD), extracting the intrinsic mode function (IMF), and the hyperparameters of the long short-term memory network (LSTM) model are optimized by Grey Wolf optimization algorithm (GWO) to improve the prediction accuracy. The experimental results show that the EEMD-GGO-LSTM mixed model has a lower root-mean-square error (RMSE) and higher determination coefficient (R2) when predicting AQI in Lanzhou compared with the single model and other mixed models, showing better prediction performance. Finally, the paper puts forward some suggestions, such as increasing monitoring stations, using advanced technology to improve monitoring frequency, cross-regional cooperation and open data sharing, so as to promote the continuous improvement of Lanzhou air quality and further optimization of a prediction model.
文章引用:崔萌, 王仲平, 何厚桦. 兰州市空气质量指数预测——基于LSTM的混合模型研究[J]. 应用数学进展, 2024, 13(10): 4683-4694. https://doi.org/10.12677/aam.2024.1310449

参考文献

[1] 周骥, 付世华, 彭丽, 等. 臭氧和PM2.5对慢阻肺死亡影响及气温修饰效应[J]. 中国环境科学, 2021, 41(12): 5904-5911.
[2] Lu, Q., Zheng, J., Ye, S., Shen, X., Yuan, Z. and Yin, S. (2013) Emission Trends and Source Characteristics of SO2, NOx, PM10 and Vocs in the Pearl River Delta Region from 2000 to 2009. Atmospheric Environment, 76, 11-20. [Google Scholar] [CrossRef
[3] 廖若雯. 基于多视角数据融合的PM2.5浓度预测研究——以兰州市为例[D]: [硕士学武术论文]. 兰州: 兰州财经大学, 2024.
[4] Bai, L., Wang, J., Ma, X. and Lu, H. (2018) Air Pollution Forecasts: An Overview. International Journal of Environmental Research and Public Health, 15, Article No. 780. [Google Scholar] [CrossRef] [PubMed]
[5] 杜续. 基于随机森林的PM2.5浓度预测模型研究[D]: [硕士学武术论文]. 西安: 西安邮电大学, 2018.
[6] Feng, X., Li, Q., Zhu, Y., Hou, J., Jin, L. and Wang, J. (2015) Artificial Neural Networks Forecasting of PM2.5 Pollution Using Air Mass Trajectory Based Geographic Model and Wavelet Transformation. Atmospheric Environment, 107, 118-128. [Google Scholar] [CrossRef
[7] Sun, W., Zhang, H., Palazoglu, A., Singh, A., Zhang, W. and Liu, S. (2013) Prediction of 24-Hour-Average PM2.5 Concentrations Using a Hidden Markov Model with Different Emission Distributions in Northern California. Science of the Total Environment, 443, 93-103. [Google Scholar] [CrossRef] [PubMed]
[8] Bianchini, M., Di Iorio, E., Maggini, M. and Pucci, A. (2008) Cyclostationary Neural Networks for Air Pollutant Concentration Prediction. 3rd IAPR TC3 Workshop, ANNPR 2008, Paris, 2-4 July 2008, 101-112. [Google Scholar] [CrossRef
[9] Dong, M., Yang, D., Kuang, Y., He, D., Erdal, S. and Kenski, D. (2009) PM2.5 Concentration Prediction Using Hidden Semi-Markov Model-Based Times Series Data Mining. Expert Systems with Applications, 36, 9046-9055. [Google Scholar] [CrossRef
[10] Samad, A., Garuda, S., Vogt, U. and Yang, B. (2023) Air Pollution Prediction Using Machine Learning Techniques—An Approach to Replace Existing Monitoring Stations with Virtual Monitoring Stations. Atmospheric Environment, 310, Article ID: 119987. [Google Scholar] [CrossRef
[11] Das, B., Dursun, Ö.O. and Toraman, S. (2022) Prediction of Air Pollutants for Air Quality Using Deep Learning Methods in a Metropolitan City. Urban Climate, 46, Article ID: 101291. [Google Scholar] [CrossRef
[12] Liu, X. and Guo, H. (2022) Air Quality Indicators and AQI Prediction Coupling Long-Short Term Memory (LSTM) and Sparrow Search Algorithm (SSA): A Case Study of Shanghai. Atmospheric Pollution Research, 13, Article ID: 101551. [Google Scholar] [CrossRef
[13] Hu, K., Guo, X., Gong, X., Wang, X., Liang, J. and Li, D. (2022) Air Quality Prediction Using Spatio-Temporal Deep Learning. Atmospheric Pollution Research, 13, Article ID: 101543. [Google Scholar] [CrossRef
[14] Dai, H., Huang, G., Zeng, H. and Zhou, F. (2022) PM2.5 Volatility Prediction by XGBoost-MLP Based on GARCH Models. Journal of Cleaner Production, 356, Article ID: 131898. [Google Scholar] [CrossRef
[15] Dai, H., Huang, G., Zeng, H. and Yu, R. (2022) Haze Risk Assessment Based on Improved PCA-MEE and ISPO-LightGBM Model. Systems, 10, Article No. 263. [Google Scholar] [CrossRef
[16] Huang, Y., Yu, J., Dai, X., Huang, Z. and Li, Y. (2022) Air-Quality Prediction Based on the EMD-IPSO-LSTM Combination Model. Sustainability, 14, Article No. 4889. [Google Scholar] [CrossRef
[17] Wu, Z. and Huang, N.E. (2009) Ensemble Empirical Mode Decomposition: A Noise-Assisted Data Analysis Method. Advances in Adaptive Data Analysis, 1, 1-41. [Google Scholar] [CrossRef
[18] Huang, N.E., Shen, Z., Long, S.R., Wu, M.C., Shih, H.H., Zheng, Q., et al. (1998) The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-Stationary Time Series Analysis. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, 454, 903-995. [Google Scholar] [CrossRef
[19] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[20] Şenol, H., Çakır, İ.T., Bianco, F. and Görgün, E. (2024) Improved Methane Production from Ultrasonically-Pretreated Secondary Sedimentation Tank Sludge and New Model Proposal: Time Series (Arima). Bioresource Technology, 391, Article ID: 129866. [Google Scholar] [CrossRef] [PubMed]
[21] Luo, J. and Gong, Y. (2023) Air Pollutant Prediction Based on ARIMA-WOA-LSTM Model. Atmospheric Pollution Research, 14, Article ID: 101761. [Google Scholar] [CrossRef
[22] Guo, S., Dai, R., Sun, H. and Nian, Q. (2023) PTS-LSTM: Temperature Prediction for Fused Filament Fabrication Using Thermal Image Time Series. Journal of Manufacturing Processes, 106, 316-327. [Google Scholar] [CrossRef
[23] Mirjalili, S., Mirjalili, S.M. and Lewis, A. (2014) Grey Wolf Optimizer. Advances in Engineering Software, 69, 46-61. [Google Scholar] [CrossRef
[24] Lelieveld, J., Prokop, A. and Heusinkveld, H.G. (2015) What Climate Change Means for Air Quality and Human Health. Nature Climate Change, 5, 887-891.
[25] Zhang, Y., Song, G., Cai, J., Liu, W. and Chen, R. (2018) A Review of Relationships between Air Quality and Human Health: Recent Progress in China. Environment International, 121, 666-677.
[26] Krause, P., Boyle, D.P. and Bäse, F. (2005) Comparison of Different Efficiency Criteria for Hydrological Model Assessment. Advances in Geosciences, 5, 89-97. [Google Scholar] [CrossRef
[27] Makridakis, S., Spiliotis, E. and Assimakopoulos, V. (2020) Predicting/Hypothesizing the Findings of the M4 Competition. International Journal of Forecasting, 36, 29-36. [Google Scholar] [CrossRef
[28] Zhang, Q., Streets, D.G., Carmichael, G.R., He, K.B., Huo, H., Kannari, A., et al. (2009) Asian Emissions in 2006 for the NASA INTEX-B Mission. Atmospheric Chemistry and Physics, 9, 5131-5153. [Google Scholar] [CrossRef
[29] Liu, Z., Zhang, Q. and Streets, D.G. (2016) Anthropogenic Emissions of Non-Methane Volatile Organic Compounds in China. Atmospheric Chemistry and Physics, 16, 3545-3568.
[30] Zhang, Y., Shao, M., Zhang, Y. and Zhu, Y. (2018) Emission Factors of Particulate Matter and Trace Gases from Real-World Chinese Vehicles. Atmospheric Environment, 185, 94-102.
[31] Wang, S., Gao, J., Zhang, Q., Lei, Y. and He, K. (2017) Emission Inventories and Trends of Anthropogenic Air Pollutants in China (1990-2010). Atmospheric Chemistry and Physics, 17, 2839-2866.
[32] Liu, P., Zhang, R. and Zhao, X. (2015) A Review of Real-Time Monitoring of Ambient Volatile Organic Compounds (VOCs) and Its Application in China. Science of the Total Environment, 527-528, 251-260.
[33] Varis, O. (2014) Resources: Curb Vast Water Use in Central Asia. Nature, 514, 27-29. [Google Scholar] [CrossRef] [PubMed]