基于LDA主题模型的重庆市5A级景区旅游评价研究
Research on Tourism Evaluation of 5A Scenic Spots in Chongqing Based on LDA Theme Model
摘要: 随着“互联网 + 旅游”模式的普及,网络中旅游信息呈爆炸式增长,其中用户评论由于数据的非结构特点及多样性,难以直接使用。本文通过挖掘用户评论对景区进行旅游评价,有利于游客和景区负责人多方面了解景点形象。本文以重庆市5家著名“AAAAA”级景区为研究对象,爬取12,571条评论数据。首先,借助LDA主题模型挖掘评论文本中的主题信息,并构建综合评价体系。然后,以游客对主题的关注程度为依据,对评价指标进行相应的赋权处理,得到评价得分。最后,对5家景区进行得分排序,并根据结果进行分析。从提取的主题信息来看,游客关注的五个主题为:平台服务、景区管理、自然景观、夏冬体验、性价比;从高到低,综合得分排序结果为:金佛山、武隆喀斯特、巫山小三峡、云阳龙岗、四面山。
Abstract: With the popularization of the “Internet + Tourism” model, the tourism information in the network has exploded. Among them, user reviews are difficult to be directly used due to the unstructured characteristics and diversity of data. This paper evaluates the tourism of scenic spots by mining user comments, which is conducive to tourists and scenic spot leaders to understand the image of scenic spots in many aspects. This paper takes five famous “AAAAA” scenic spots in Chongqing as the research object, and crawls 12,571 comment data. Firstly, LDA topic model is used to mine the topic information in the comment text and construct a comprehensive evaluation system. Then, based on the tourists’ attention to the theme, the evaluation index is weighted accordingly, and the evaluation score is obtained. Finally, the five scenic spots are ranked and analyzed according to the results. From the extracted theme information, the five themes that tourists pay attention to are: platform service, scenic spot management, natural landscape, summer and winter experience, and cost performance. From high to low, the composite score ranking results are: Jinfo Mountain, Wulongkast, Wushan Little Three Gorges, Yunyang Longgang and Simian Mountain.
文章引用:龚乃林, 赵胜利. 基于LDA主题模型的重庆市5A级景区旅游评价研究[J]. 统计学与应用, 2021, 10(6): 1053-1059. https://doi.org/10.12677/SA.2021.106112

参考文献

[1] 百度百科. 重庆景点[EB/OL]. https://baike.baidu.com/item/重庆景点, 2021-10-25.
[2] 新浪财经. 较2020年增长31.2%! 2021年国庆假期重庆A级旅游景区累计接待游客1273万人次[EB/OL]. https://finance.sina.com.cn/jjxw/2021-10-07/doc-iktzqtyu0133479.shtml, 2021-10-25.
[3] 周文亮. 基于LDA改进的AHP旅游目的地评价研究——以江西省5A级景区为例[D]: [硕士学位论文]. 南昌:江西财经大学, 2021.
[4] Blei, D.M., Ng, A.Y. and Jordan, M.I. (2012) Latent Dirichlet Allocation. Journal of Machine Learning Research, 3, 993-1022.
[5] 赵蓉英, 戴祎璠, 王旭. 基于LDA模型与ATM模型的学者影响力评价研究——以我国核物理学科为例[J]. 情报科学, 2019, 37(6): 3-9.
[6] Azzopardi, L., Girolami, M. and Risjbergen, K.V. (2003) Investigating the Relationship between Language Model Perplexity and IR Precision- Recall Measures. International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, 28 July-1 August 2003, 369-370.
[Google Scholar] [CrossRef