基于随机森林算法的美国芝加哥空气质量预测研究
A Study on Air Quality Prediction in Chicago, USA Based on Random Forest Algorithm
DOI: 10.12677/mos.2025.1410609, PDF,   
作者: 山 昊, 范晓沁, 杨玉兰:上海理工大学管理学院,上海;房志明, 黄中意*:上海理工大学管理学院,上海;上海理工大学智慧应急管理学院,上海
关键词: 随机森林AQI相关性空气质量预测Random Forest AQI Correlation Air Quality Prediction
摘要: 随着城市化的加速,空气污染问题日益严重,因此准确预测影响空气质量的污染物含量对于改善城市空气质量具有重要意义。本文基于随机森林算法建立了空气质量预测模型。该模型仅利用与空气质量指数(AQI)相关性较强的污染物历史数据,即可对未来的空气质量进行短期预测,模型操作简单,预测结果精确度也较高。首先,选取了美国芝加哥市2019年1月至2021年1月的空气质量综合指数,对该城市的AQI数据进行相关性分析,识别出与AQI相关的主要污染物指标。然后,采用随机森林算法建立空气质量预测模型。实验结果表明,本文建立的空气质量预测模型对于芝加哥城市O3浓度的训练集上的R2达到了0.92,验证集上的拟合优度为0.87,对于PM2.5、NO2、SO2等其他主要污染物也有较好的拟合效果。本研究基于AQI提取的相关性污染物指标的随机森林预测模型能显著提升空气质量预测的准确性,能够为空气质量管理提供有效的决策支持。
Abstract: With the acceleration of urbanization, air pollution has become increasingly severe. Therefore, accurately predicting pollutant concentrations that affect air quality is crucial for improving urban air quality. This paper establishes an air quality prediction model based on the Random Forest algorithm. This model can make short-term predictions of future air quality using only historical data of pollutants strongly correlated with the Air Quality Index (AQI). The model is simple to operate and delivers highly accurate prediction results. First, the comprehensive air quality index data for Chicago, USA, from January 2019 to January 2021 were selected. Correlation analysis of the city’s AQI data identified key pollutant indicators correlated with AQI. The Random Forest algorithm was then employed to build the air quality prediction model. Experimental results demonstrate that the proposed air quality prediction model achieves a coefficient of determination of 0.87 for O₃ concentrations in Chicago, with comparable fitting performance for other primary pollutants, including PM2.5, NO₂, and SO₂. This random forest model, trained on correlation-based pollutant indicators extracted from AQI data, significantly enhances the accuracy of air quality forecasting and provides effective decision support for air quality management.
文章引用:山昊, 范晓沁, 杨玉兰, 房志明, 黄中意. 基于随机森林算法的美国芝加哥空气质量预测研究[J]. 建模与仿真, 2025, 14(10): 97-107. https://doi.org/10.12677/mos.2025.1410609

参考文献

[1] 陈雪芬. 新形势下环境工程中大气污染的危害和治理[J]. 石化技术, 2025, 32(6): 391-393.
[2] 王璐. 兰州市AQI指数与气象要素相关性研究[J]. 化学工程与装备, 2023(2): 286-287, 301.
[3] 郑舒天, 朱黎阳, 李自然, 等. 基于AQI的文山市空气质量统计预报研究[J]. 中国资源综合利用, 2022, 40(1): 77-80.
[4] 刘诗韵, 云静波, 姜雨萌, 等. 乌兰察布市空气质量状况及AQI预报方法研究[J]. 内蒙古气象, 2023(2): 32-38.
[5] 牛艳飞. 基于时间序列分析的鹤壁市AQI预测模型[J]. 中阿科技论坛(中英文), 2022(4): 84-88.
[6] Li, X., Peng, L., Yao, X., Cui, S., Hu, Y., You, C., et al. (2017) Long Short-Term Memory Neural Network for Air Pollutant Concentration Predictions: Method Development and Evaluation. Environmental Pollution, 231, 997-1004. [Google Scholar] [CrossRef] [PubMed]
[7] 史雨梅, 余晓美. 图卷积神经网络在空气质量指数预测中的应用研究[J]. 三明学院学报, 2024, 41(3): 58-66.
[8] 文昌俊, 陈洋洋, 何永豪, 等. 多策略SMA-BP神经网络的空气质量指数预测[J]. 电子测量技术, 2023, 46(22): 78-86.
[9] 路凯丽, 杨露, 李涛. 基于集成深度学习模型的空气质量指数预测[J]. 南京信息工程大学学报, 2024, 16(1): 56-65.