基于特征选择和SSA-LSSVM的短期PM2.5浓度预测
Short-Term PM2.5 Concentration Prediction Based on Feature Selection and SSA-LSSVM
摘要: 为了提升PM2.5浓度的预测精度,考虑到PM2.5浓度受时间序列特征和非线性特征等原因的影响,导致了时间序列分析模型在预测PM2.5浓度时会存在较大偏差。为此,提出一种基于特征选择和麻雀搜索算法(Sparrow Search Algorithm, SSA)优化最小二乘支持向量机(Least Squares Support Vector Machine, LSSVM)参数的定量预测模型。首先,将14个特征变量进行二进制编码,利用遗传算法结合最小二乘支持向量机对特征变量进行优选,获取最优特征子集;利用SSA算法对LSSVM的参数进行优化,构建SSA-LSSVM的PM2.5浓度预测模型。实验结果表明,基于遗传算法进行特征选择和麻雀算法优化最小二乘支持向量机参数的模型,具有明显的预测效果。其中,该模型的RMSE和MAE分别为10.53和8.01,预测精度均高于其它模型。
Abstract: In order to improve the prediction accuracy of PM2.5 concentration, considering the time series characteristics of the PM2.5 concentration influencing factor dataset and the nonlinear characteristics of the data, the time series analysis model still has a large error in predicting PM2.5 concentration. To this end, a quantitative prediction model based on feature selection and Sparrow Search Algorithm (SSA) to optimize the parameters of Least Squares Support Vector Machine (LSSVM) is proposed. First, the 14 feature variables are binary coded, and the feature variables are optimized by using the genetic algorithm combined with the Least Squares Support Vector Machine to obtain the optimal feature subset; the SSA algorithm was used to optimize the parameters of the LSSVM, and the PM2.5 concentration prediction model of the SSA-LSSVM was constructed. The experimental results show that the model based on the genetic algorithm for feature selection and the sparrow algorithm to optimize the parameters of the Least Squares Support Vector Machine has obvious prediction effect. Among them, the RMSE and MAE of this model are 10.53 and 8.01, respectively, and the prediction accuracy is higher than other models.
文章引用:金春梅. 基于特征选择和SSA-LSSVM的短期PM2.5浓度预测[J]. 应用数学进展, 2022, 11(3): 947-955. https://doi.org/10.12677/AAM.2022.113101

参考文献

[1] 康俊锋, 谭建林, 方雷, 等. XGBoost-LSTM变权组合模型支持下的短期PM_(2.5)浓度预测——以上海为例[J/OL]. 中国环境科学: 2021: 1-16.
[2] 谢永华, 张鸣敏, 杨乐, 等. 基于支持向量机回归的城市PM2.5浓度预测[J]. 计算机工程与设计, 2015, 36(11): 3106-3111.
[3] 任才溶, 谢刚. 基于随机森林和气象参数的PM2.5浓度等级预测[J]. 计算机工程与应用, 2019, 55(2): 213-220.
[4] 夏晓圣, 陈菁菁, 王佳佳, 等. 基于随机森林模型的中国PM2.5浓度影响因素分析[J]. 环境科学, 2020, 41(5): 2057-2065.
[5] 白盛楠, 申晓留. 基于LSTM循环神经网络的PM2.5预测[J]. 计算机应用与软件, 2019, 36(1): 67-70.
[6] 赵文芳, 林润生, 唐伟, 等. 基于深度学习的PM2.5短期预测模型[J]. 南京师大学报(自然科学版), 2019, 42(3): 32-41.
[7] 王勇, 王泓易, 刘严萍, 等. 融合GNSS水汽、风速与大气污染物的河北省冬季PM_(2.5)浓度预测研究[J]. 大地测量与地球动力学, 2020, 40(11): 1145-1152. [Google Scholar] [CrossRef
[8] 张旭, 杜景林. 改进PSO-GA-BP的PM2.5浓度预测[J]. 计算机工程与设计, 2019, 40(6): 1718-1723. [Google Scholar] [CrossRef
[9] 李建更, 吴水生. 基于PLS-M5P模型的PM_(2.5)浓度预测[J]. 计算机与应用化学, 2018, 35(12): 959-970. [Google Scholar] [CrossRef
[10] 马天成, 刘大铭, 李雪洁, 等. 基于改进型PSO的模糊神经网络PM_(2.5)浓度预测[J]. 计算机工程与设计, 2014, 35(9): 3258-3262. [Google Scholar] [CrossRef
[11] 王文涛. 深度学习结合支持向量机在人脸表情识别中的应用研究[D]: [硕士学位论文]. 西安: 长安大学, 2016.
[12] 孟凡念, 杜文辽, 巩晓赟, 等. 基于粒子群优化最小二乘支持向量机的滚动轴承故障识别[J]. 轴承, 2020(12): 43-50.
[13] Yang, L., Yang, S., Li, S., et al. (2015) Coupled Compressed Sensing Inspired Sparse Spatial-Spectral LSSVM for Hyperspectral Image Classification. Knowledge-Based Systems, 79, 80-89. [Google Scholar] [CrossRef
[14] Zenendehboudi, A. (2016) Implementation of GA-LSSVM Modelling Approach for Estimating the Performance of Solid Desiccant Wheels. Energy Conversion and Management, 127, 245-255. [Google Scholar] [CrossRef