基于细粒度情感的文本挖掘及可视化分析
Text Mining and Visualization Analysis Based on Fine-Grained Sentiment
摘要:
针对当前文本挖掘情感分析缺乏系统性分析研究,建立综合评价指标,系统的分析产品间优劣势,明确改进方向。构建基于细粒度情感分析的模型,首先通过对评论文本进行预处理及分词,再运用LDA主题模型构建属性词典,运用知网情感词词库结合网络新词构建情感词典;接着建立评论有用性规则与情感打分规则,对有用短语打分,获取情感数据集;最后建立四大评价指标,对三款手机进行综合评价及可视化分析。模型数据结果表明,四大指标能够显著突出产品间优劣势,可以帮助生产者更快更准确的了解重点发展方向,也可以帮助消费者更便利的选择钟爱的产品。
Abstract:
For the current text mining sentiment analysis, there is a lack of systematic analysis and research, this paper establishes comprehensive evaluation indicators, systematically analyzes the advantages and disadvantages of products, clarifies the direction of improvement and constructs a model based on fine-grained sentiment analysis. Firstly, the comment text is preprocessed and segmented, and then the LDA topic model is used to build the attribute dictionary, and the HowNet sentiment vocabulary is used to build the sentiment dictionary with new words on the Internet; then, in order to obtain the sentiment data set, use the comment usefulness rule and sentiment score rules, scoring useful phrases; finally, four major evaluation indicators are established to conduct comprehensive evaluation and visual analysis of three mobile phones. The results of the model data show that the four indicators can significantly highlight the advantages and disadvantages of the products, which can help the producers to understand the key development direction more quickly and accurately, and also help consumers to choose the products they like more conveniently.
参考文献
|
[1]
|
中国互联网络信息中心. 第44次《中国互联网络发展状况统计报告》[EB/OL].
http://www.cac.gov.cn/2019-08/30/c_1124938750.htm, 2019-8-30.
|
|
[2]
|
张玉峰, 朱莹. 基于Web文本挖掘的企业竞争情报获取方法研究[J]. 情报理论与实践, 2006(5): 563-566.
|
|
[3]
|
薛为民, 陆玉昌. 文本挖掘技术研究[J]. 北京联合大学学报(自然科学版), 2005(4): 59-63.
|
|
[4]
|
张振华, 许柏鸣. 基于在线评论文本挖掘的商业竞争情报分析模型构建及应用[J]. 情报科学, 2019, 37(2): 149-153+ 160.
|
|
[5]
|
王克勤, 毋凤君. 面向产品设计改进的在线评论挖掘[J]. 计算机工程与应用, 2019, 55(19): 235-245+252.
|
|
[6]
|
林崇德, 杨治良, 黄希庭. 心理学大辞典[M]. 上海: 上海教育出版社, 2003.
|
|
[7]
|
Wilson, T., Wiebe, J. and Hoffmann, P. (2005) Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis, Vancouver, British Columbia, Canada, F oct, 2005. Association for Computational Linguistics.
|
|
[8]
|
唐晓波, 刘广超. 细粒度情感分析研究综述[J]. 图书情报工作, 2017, 61(5): 132-140.
|
|
[9]
|
杨东红, 吴邦安, 陈天鹏, 等. 基于京东商城评价数据的在线商品好评、中评、差评比较研究[J]. 情报科学, 2019, 37(2): 125-132.
|
|
[10]
|
杨程, 谭昆, 俞春阳. 基于评论大数据的手机产品改进[J/OL]. 计算机集成制造系统, 1-19.
http://kns.cnki.net/kcms/detail/11.5946.TP.20190606.1053.008.html, 2020-10-03.
|