基于特征选择的SSA-XGBoost水质量预测研究
Research on SSA-XGBoost Water Quality Prediction Based on Feature Selection
摘要: 为了能够更好的实现水资源的利用,针对目前对水质预测研究中存在的特征参数复杂、单一模型预测模型精度和适应度欠佳等问题,提出了一种基于XGBoost的水质预测模型。首先利用主成分分析方法对特征进行选择,降低问题复杂度和计算成本,并对数据中的缺失值进行填充,其次采用麻雀搜索算法(SSA)对XGBoost模型中的参数进行优化,采用优化后的参数对水质进行预测。最后在不同实验条件下对水质进行预测,实验结果证明,本文提出的SSA-XGBoost方法与现有方法相比,具有更优秀的性能。
Abstract: In order to better realize the utilization of water resources, a water quality prediction model based on XGBoost is proposed in view of the problems existing in the current research on water quality prediction, such as complex characteristic parameters, poor precision and fitness of a single model prediction model, etc. Firstly, the principal component analysis method is used to select features, reduce problem complexity and computational costs, and fill in missing values in the data. Secondly, the sparrow search algorithm (SSA) is used to optimize the parameters in the XGBoost model, and the optimized parameters are used to predict water quality. Finally, water quality was predicted under different experimental conditions, and the experimental results showed that the SSA-XGBoost method proposed in this paper has better performance compared to existing methods.
文章引用:赵桐, 刘媛华. 基于特征选择的SSA-XGBoost水质量预测研究[J]. 建模与仿真, 2023, 12(4): 4183-4194. https://doi.org/10.12677/MOS.2023.124381

参考文献

[1] Heddam, S. and Kisi, O. (2017) Extreme Learning Machines: A New Approach for Modeling Dissolved Oxygen (DO) Con-centration with and without Water Quality Variables as Predictors. Environmental Science and Pollution Research, 24, 16702-16724. [Google Scholar] [CrossRef] [PubMed]
[2] Mitrović, T., Antanasijević, D., Lazović, S., Perić-Grujić, A. and Ristić, M. (2019) Virtual Water Quality Monitoring at Inactive Monitoring Sites Using Monte Carlo Optimized Artificial Neural Networks: A Case Study of Danube River (Serbia). Science of the Total Environment, 654, 1000-1009. [Google Scholar] [CrossRef] [PubMed]
[3] Tiwari, S., Babbar, R. and Kaur, G. (2018) Performance Evaluation of Two Anfis Models for Predicting Water Quality Index of River Satluj (India). Advances in Civil Engineering, 2018, 1-10. [Google Scholar] [CrossRef
[4] Rankinen, K., Cano Bernal, J.E., Holmberg, M., Vuorio, K. and Granlund, K. (2019) Identifying Multiple Stressors That Influence Eutrophication in a Finnish Agricultural River. Science of the Total Envi-ronment, 658, 1278-1292. [Google Scholar] [CrossRef] [PubMed]
[5] Ahmed, A.A.M. (2017) Prediction of Dissolved Oxygen in Surma River by Biochemical Oxygen Demand and Chemical Oxygen Demand Using the Artificial Neural Networks (ANNs). Journal of King Saud University—Engineering Sciences, 29, 151-158. [Google Scholar] [CrossRef
[6] 查文舒, 李道伦, 沈路航, 张雯, 刘旭亮. 基于神经网络的偏微分方程求解方法研究综述[J]. 力学学报, 2022, 54(3): 543-556.
[7] 张皓, 涂雅培, 高瑜翔, 唐军, 黄天赐. 基于多重模糊神经网络的PID温度控制算法[J]. 西华大学学报(自然科学版), 2023, 42(4): 58-65+81.
[8] 李晶晶, 张永敏, 田桂林, 崔胜胜, 严洁. 基于LSTM神经网络的数据驱动空间负荷预测方法[J]. 电子设计工程, 2022, 30(22): 154-157.
[9] 陆继翔, 张琪培, 杨志宏, 涂孟夫, 陆进军, 彭晖. 基于CNN-LSTM混合神经网络模型的短期负荷预测方法[J]. 电力系统自动化, 2019, 43(8): 131-137.
[10] 韩伟, 李钢. 主成分分析在地区科技竞争力评测中的应用[J]. 数理统计与管理, 2006(5): 512-517.
[11] 方红卫, 孙世群, 朱雨龙, 等. 主成分分析法在水质评价中的应用及分析[J]. 环境科学与管理, 2009, 34(12): 152-154.
[12] 刘臣辉, 吕信红, 范海燕. 主成分分析法用于环境质量评价的探讨[J]. 环境科学与管理, 2011, 36(3): 183-186.
[13] Xue, J. and Shen, B. (2020) A Novel Swarm Intelligence Optimization Approach: Sparrow Search Algorithm. Systems Science & Control Engineering, 8, 22-34. [Google Scholar] [CrossRef