基于机器学习和百度搜索指数的客流量预测
Passenger Traffic Forecast Based on Machine Learning and Baidu Search Index
摘要: 随着互联网的迅速发展和旅游业的兴起,以百度搜索指数为基础的客流量预测可以帮助景区通过某些关键词的搜索量来判断游客的需求与关注度,及时做出改进与对策,从而高效而准确地服务游客。文章以预测九寨沟的日游客量为目标,通过初选、拓展,最后利用皮尔逊相关系数及时差相关法筛选出与九寨沟客流量紧密相关的关键词。接着将这些数据进行对数变换,利用随机森林、支持向量机回归、BP神经网络三种机器学习方法分别来预测九寨沟客流量。以决定系数R2和均方误差ME为评价指标并对比三个模型的结果值,择出最优预测模型。结果表明:BP神经网络的ME为0.019986,R2为0.83669,相较于另外两种预测模型而言,对九寨沟客流量的预测效果最好,可以帮助景区做好资源规划和应急准备。
Abstract: With the rapid development of the Internet and the rise of tourism, the passenger flow prediction based on Baidu search index index can help scenic spots to judge the needs and attention of tourists through the search volume of some keywords, and make timely improvements and countermeasures, so as to serve tourists efficiently and accurately. The article aims to predict the daily number of tourists in Jiuzhaigou, and through the primary selection and expansion, it finally selects the key words closely related to the passenger flow of Jiuzhaigou by means of Pearson’s correlation coefficient. Then the data were log-transformed, and three machine learning methods of random forest, support vector machine regression and BP neural network were used to predict the passenger flow in Jiuzhaigou respectively. Using the determination coefficient R2 and the mean squared error ME as the evaluation indexes and comparing the result values of the three models to select the optimal prediction models. The results show that the ME of BP neural network is 0.019986 and R2 is 0.83669. Compared with the other two prediction models, the prediction effect of Jiuzhaigou passenger flow is the best, which can help the scenic spot to make resource planning and emergency preparedness.
文章引用:李晓楠, 史尚涵, 施雪姣. 基于机器学习和百度搜索指数的客流量预测[J]. 运筹与模糊学, 2024, 14(6): 963-973. https://doi.org/10.12677/orf.2024.146593

参考文献

[1] 殷杰, 郑向敏, 董斌彬. 基于VECM模型的景区网络关注度与旅游人数的关系研究——以鼓浪屿为例[J]. 福建农林大学学报(哲学社会科学版), 2015, 18(5): 68-75.
[2] 刘悦, 吕兴洋, 周晓丽. 冰雪旅游景区网络关注度和实际客流量之间的关系研究——以成都西岭雪山景区为例[J]. 山西师范大学学报(自然科学版), 2021, 35(4): 96-102.
[3] Tihomir, S. (2002) A Comparison of Two Econometric Models (OLS and SUR) for Forecasting Croatian Tourism Arrivals. Working Papers 8, Croatian National Bank, Zagreb.
[4] 胡勇, 黄磊. 中国大陆赴香港旅游预测模型[J]. 国际经贸探索, 2008(5): 41-44.
[5] Li, X., Pan, B., Huang, X.K. (2017) Forecasting Tourism Demand with Composite Search Index. Tourism Management, 59, 57-66. [Google Scholar] [CrossRef
[6] Yang, X., Pan, B., Lv, B.F. (2015) Forecasting Chinese Tourist Volume with Search Engine Data. Tourism Management, 46, 386-397. [Google Scholar] [CrossRef