基于随机森林算法的鳄梨价格预测
Avocado Price Prediction Based on Random Forest Algorithm
DOI: 10.12677/MOS.2021.104102, PDF,   
作者: 陈梦凡, 张 涛*:广西科技大学,广西 柳州
关键词: 鳄梨价格决策树随机森林预测Avocado Prices Decision Tree Random Forest Prediction
摘要: 为了能够更好地预测鳄梨的价格走向趋势,解决在大量特征和大数据下价格预测精度低的问题。本研究在随机的基础上提出了一种基于Pearson系数的随机森林新的组合模型方法。首先,利用Pearson系数进行相关性检验,来进行特征筛选;对随机森林参数调优;最后利用剩余特征进行建模回归预测,并得出最终结论。实验结果表明:改进后的随机森林预测值的平均绝对误差(MAE)和均方误差(MSE)都得到了较大的提高。经研究发现,本文建立的新的组合模型,可以实现对鳄梨价格的短期预测,并且可以达到不错的预测效果。
Abstract: In order to be able to better predict prices to the trend of avocado, and solve the problem of low price prediction accuracy under a large number of features and big data, this study on the basis of random puts forward a random forest new combination model based on coefficient of Pearson method. Firstly, Pearson coefficient was used for correlation test to carry out feature screening; tuning random forest parameters; finally, residual features were used for modeling regression prediction, and the final conclusion was drawn. The experimental results show that the improved random forest predicted mean absolute error (MAE) and mean square error (MSE) got improved. The study found that through a new portfolio model, this paper can realize the avocado price short-term prediction, and can achieve good prediction effect.
文章引用:陈梦凡, 张涛. 基于随机森林算法的鳄梨价格预测[J]. 建模与仿真, 2021, 10(4): 1023-1031. https://doi.org/10.12677/MOS.2021.104102

参考文献

[1] 王鹏. 苹果品质评估与价格预测模型研究[D]: [硕士学位论文]. 泰安: 山东农业大学, 2019.
[2] Decker, B.K., Kelly, M.B., Mikolic, J., Walker, J.D. and Clancy, C.J. (2018) 1726 A Random Forest Prediction Model Accurately Identifies Periods at Increased Risk for Positive Legionella Cultures in a Hospital Water Distribution System. Open Fo-rum Infectious Diseases, 5, S54-S55. [Google Scholar] [CrossRef
[3] Ließ, M., Hitziger, M., Huwe, B. and Bradley, R.L. (2014) The Sloping Mire Soil-Landscape of Southern Ecuador: Influence of Predictor Resolution and Model Tuning on Random Forest Predictions. Applied and Environmental Soil Science, 2014, Article ID 603132. [Google Scholar] [CrossRef
[4] 王章章. 基于机器学习的价格预测模型研究与实现[D]: [硕士学位论文]. 西安: 长安大学, 2018.
[5] 乔麟婷. 决策树算法研究[J]. 课程教育研究, 2018(48): 224-225.
[6] 安葳鹏, 尚家泽. 决策树C4.5算法的改进与分析[J]. 计算机工程与应用, 2019, 55(12): 169-173.
[7] Zhang, H.z., Joshua, Z., Dan, N. and Nordman, D.J. (2020) Random Forest Prediction Intervals. The American Statistician, 74, 392-406. [Google Scholar] [CrossRef
[8] Zhang, Q.-Y., Aires-de-Sousa, J. (2007) Random Forest Prediction of Mutagenicity from Empirical Physicochemical Descriptors. Journal of Chemical Information and Modeling, 47, 1-8. [Google Scholar] [CrossRef] [PubMed]
[9] 李超. 机器学习模型在股票价格时间序列分析中的应用与比较[J]. 电子世界, 2021(9): 66-70.
[10] 张家通. 基于数据挖掘的笔记本电脑价格影响因素研究——以京东商城为例[J]. 信息技术与信息化, 2021(1): 58-62.
[11] 单文煜, 吴垠, 陈鹏. 基于机器学习的机票价格预测研究[J]. 现代计算机, 2020(22): 35-38.
[12] 焦岑. 基于随机森林与神经网络的汽车价格影响因素的研究[D]: [硕士学位论文]. 苏州: 苏州大学, 2020.
[13] 曹洁. 基于随机森林模型的二手车价值评估研究[D]: [硕士学位论文]. 石家庄: 河北经贸大学, 2020.
[14] 万威. 灰色预测模型在材料价格预测中的应用[J]. 江苏科技信息, 2021, 38(15): 44-46.