基于游记文本内容的旅游场景知识图谱的构建
Construction of Tourism Scene Knowledge Graph Based on the Content of Travelogs
摘要: [目的/意义]:传统以旅游本体概念为基础的知识表示形式,侧重对静态特征的描述,缺乏对动态特征的捕捉,如若考虑融入在线UGC中的一些动态信息,如:旅游活动事件发生的顺承关系,群体旅游情绪等特征,可进一步增强旅游知识图谱的可解释性,从而为当今体验式旅游的智能应用提供更加具有可推理性的知识库。[方法/过程]:因此,本文提出一种基于贝叶斯网络的旅游场景知识图谱表示模型,该模型以旅游活动事件作为单位实体节点,从游记文本中获取人们的旅游情感信息扩充实体的动态属性,并以人们在目的地之间的转移概率和情感差值作为边的动态属性,丰富图谱中边的信息,从而提升知识图谱的可解释性。[结论]:最终实验表明,基于本文所构建的旅游场景知识图谱设计的LTTE推荐模型,在旅游目的地推荐实验中表现的效果明显优于基准算法TF-IDF。
Abstract: The traditional knowledge representation based on the concept of tourism ontology, which is focus-ing on the description of static features and lacks dynamic features. If some dynamic information hidden in the online UGC is fully utilized, such as the inheritance relationship of tourism events, group tourism Features such as emotions, can further enhance the interpretability of the tourism knowledge graph, thereby providing a more inferred knowledge graph for intelligent applications of experiential tourism. Therefore, this paper proposes a tourism scene knowledge graph (TSKG) representation model based on Bayesian network. The model uses tourism events as unit entity to obtain people’s travel emotional information from the travelogs. The sentiment difference and transition probability between destinations is used as the dynamic attribute of edges to enrich the information of edges in the graph. The experimental result shows that the LTTE recommendation model based on the TSKG performs significantly better than the benchmark algorithm TF-IDF in the tourism destination recommendation experiment.
文章引用:张小红, 马彪. 基于游记文本内容的旅游场景知识图谱的构建[J]. 数据挖掘, 2020, 10(1): 56-67. https://doi.org/10.12677/HJDM.2020.101006

参考文献

[1] http://www.world-tourism.org
[2] DERI, on Tour Ontology.
[3] Mili, H., Valtchev, P., Charif, Y., et al. (2011) E-Tourism Portal: A Case Study in Ontology-Driven Development. In: E-Technologies: Transformation in a Connected World, Springer, Berlin Heidelberg, 76-99.
[Google Scholar] [CrossRef
[4] 冯欣, 王成良. 本体在旅游信息系统中的应用研究[J]. 计算机与现代化, 2010(3): 128-132.
[5] 李艳, 王重英, 屈正庚. 基于主题词表的旅游政务系统本体构建研究[J]. 信息技术, 2015(3): 53-56.
[6] 奚凡. 基于情景感知的自适应旅游活动与推荐系统研究[D]: [硕士学位论文]. 上海: 东华大学, 2013.
[7] 杨青云, 尹鹏飞. 基于语义网的旅游信息服务平台研究[J]. 邵阳学院学报(自然科学版), 2014, 11(2): 19-24.
[8] Yuan, H., Qian, Y., Yang, R. and Ren, M. (2014) Human Mobility Discovering and Movement Intention Detection with GPS Trajectories. Decision Support Systems, 63, 39-51.
[Google Scholar] [CrossRef
[9] Lu, X., Wang, C., Yang, J.M., et al. (2010) Photo2Trip: Generating Travel Routes from Geo-Tagged Photos for Trip Planning. In: ACM International Conference on Multimedia, ACM, New York, 143-152.
[Google Scholar] [CrossRef
[10] 赵振斌, 党娇. 基于网络文本内容分析的太白山背包旅游行为研究[J]. 人文地理, 2011, 26(1): 134-139.
[11] 吴恒, 陈燕翎. 基于UGC文本挖掘的游客目的地选择信息研究——以携程蜜月游记为例[J]. 情报科学, 2017, 35(1): 101-105.
[12] Banyai, M.T.D. (2012) Evaluating Research Methods on Travel Blogs. Journal of Travel Research, 51, 267-277.
[Google Scholar] [CrossRef
[13] 胡乔楠. 基于旅游文记的旅游景点推荐及行程路线规划系统[D]: [硕士学位论文]. 杭州: 浙江大学, 2015.
[14] Hao, Q., Cai, R., Wang, X.J., et al. (2009) Generating Loca-tion Overviews with Images and Tags by Mining User-Generated Travelogues. Proceedings of the 17th In-ternational Conference on Multimedia 2009, Vancouver, British Columbia, Canada, October 19-24 2009, 801-804.
[15] 梁柱, 曾绍玮. 知识表示技术研究[J]. 科学咨询(决策管理), 2010(1): 52.
[16] 廖开际, 叶东海, 闫健峻, 等. 基于加权语义网的专家知识发现及表示方法[J]. 情报学报, 2012, 31(1): 60-64.
[17] 陈祖国, 李勇刚, 卢明, 陈超洋, 刘端. 基于贝叶斯概率语义网的铝电解槽况知识表示模型与约简方法[J/OL]. 控制与决策, 1-16.