基于文本数据挖掘影响乘客满意度的因素
Factors Influencing Passenger Satis-faction Based on Text Data Mining
摘要:
随着互联网行业和航空行业的高速发展,越来越多的人倾向在网站上购买机票,许多乘客会在乘坐之后对航班进行评论。本文基于文本数据研究影响乘客满意度的航司服务特征,帮助航空公司进行相应服务的改善,提升乘客航程体验。本文利用python爬虫技术爬取CAPSE网站东方航空公司乘客的评论数据,首先对数据进行预处理;其次统计评论文本高频词汇;再应用LDA主题模型方法获取主题关键字,从用户角度挖掘乘客关注的服务特征;然后利用TF-IDF方法将文本评论转化为基于服务特征的词向量矩阵。最后通过相关系数法和基于决策树的特征重要性分析方法,发现航空公司服务中影响乘客满意的关键因素是飞机是否准时、空乘服务水平、客舱环境等问题。
Abstract:
With the rapid development of the
Internet industry and the aviation industry, more and more people tend to buy
tickets on the website, and many passengers will comment on the flight after
taking the flight. Based on textual data, this paper studies the
characteristics of airline services that affect passenger satisfaction, so as
to help airlines improve corresponding services and enhance passenger flight
experience. Firstly, this paper uses python crawler technology to crawl the
comment data of passengers of China Eastern airlines on CAPSE website. Secondly,
high frequency words in comment text are counted. Then the LDA theme model
method is applied to obtain the theme keywords, and the service characteristics
concerned by passengers are mined from the perspective of users. Then TF-IDF method
is used to transform text comments into a word vector matrix based on service
characteristics. Finally, through correlation coefficient method and feature
importance analysis method based on decision tree, it is found that the key
factors affecting passenger satisfaction in airline service are whether the
plane is on time, flight attendant service level, cabin environment and so on.
参考文献
|
[1]
|
中国互联网络信息中心(CNNIC). 第43次中国互联网络发展状况统计报告[EB/OL].
http://www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/201803/P020180305409870339136.pdf, 2018-04-28.
|
|
[2]
|
刘金兰. 顾客满意度与ACSI[M]. 天津: 天津大学出版社, 2006.
|
|
[3]
|
郭立秀. 基于文本挖掘的生鲜电商顾客满意度研究[D]: [硕士学位论文]. 成都: 西南交通大学.
|
|
[4]
|
张振华, 许柏鸣. 基于在线评论文本挖掘的商业竞争情报分析模型构建及应用[J]. 情报科学, 2019, 37(2): 151-155+162.
|
|
[5]
|
王伟, 周咏梅, 阳爱民, 周剑峰, 林江豪. 一种基于LDA主题模型的评论文本情感分类方法[J]. 数据采集与处理, 2017, 32(3): 629-635.
|
|
[6]
|
刘阳. 基于文本挖掘的在线旅游产品销量影响因素分析[D]: [硕士学位论文]. 北京: 首都经济贸易大学.
|
|
[7]
|
张良均, 等. Python数据分析与挖掘实战[M]. 北京: 机械工业出版社, 2015.
|
|
[8]
|
崔永生. 在线评论文本挖掘对电商的影响研究[J]. 中国商论, 2018, 772(33): 23-29.
|
|
[9]
|
吴晖. 航空公司服务质量旅客满意度研究[J]. 现代商业, 2007(24): 175-176.
|