基于词注意力的BiLSTM和CNN集成模型的中文情感分析
Word Attention-Based BiLSTM and CNN Ensemble for Chinese Sentiment Analysis
DOI: 10.12677/CSA.2020.102032, PDF,  被引量   
作者: 孙 凯*:云南财经大学统计与数学学院,云南 昆明
关键词: 情感分析Word2vecWABCSM词注意力机制Sentiment Analysis Word2vec WABCSM Word Attention
摘要: 电影评论作为观众观影的重要情感流露方式,是情感挖掘研究的一个重要领域。本文提出了一种基于词注意力机制的BiLSTM和CNN集成模型(Word Attention-Based BiLSTM & CNN Stacking Model, WABCSM),以豆瓣平台的电影中文评论为研究对象,分析影迷对电影的情感倾向,来论证本文模型的可行性。首先利用word2vec训练文本,得到文本的词向量表示,然后使用基于词注意力机制的BiLSTM和CNN集成模型来进行情感挖掘以更好的提取文本中的情感词,达到正确分类的目的。本文模型相比较传统的LSTM模型和CNN模型,实验结果显示在分类精准率和召回率上都有很大提升。尤其是注意力机制的添加,使得本文模型对情感特征也可以进行有效的提取,从而有效克服了口语化短文本情感极性判断的难点问题。
Abstract: Film comments as an important way of expressing emotions in audience viewing, and an im-portant field of emotional mining research. This paper proposes a stacking model based on word attention mechanism BiLSTM and CNN (Word Attention-Based BiLSTM and CNN Stacking Model, WABCSM), and uses the Chinese commentary of the Douban platform as the research object to analyze the emotional tendency of fans to the film. The feasibility of the model. Firstly, the word vector is used to train the text, and the word vector representation is obtained. Then, the fusion model based on word attention mechanism BiLSTM and CNN is used to perform emotion mining to better extract the emotional words in the text to achieve the purpose of correct classification. The model of this paper is valid. Compared with the traditional LSTM model and CNN model, the experimental results show that the classification accuracy and recall are improved. In particular, the addition of the attention mechanism can make the model effectively extract the emotional features, thus effectively overcoming the difficult problem of colloquial short text emotional polarity judgment.
文章引用:孙凯. 基于词注意力的BiLSTM和CNN集成模型的中文情感分析[J]. 计算机科学与应用, 2020, 10(2): 312-324. https://doi.org/10.12677/CSA.2020.102032

参考文献

[1] 周纯洁, 黎巎, 徐翼龙, 等, 文本情感分析研究[J]. 计算机科学, 2018, 10(45): 296-299.
[2] 肖江, 丁星, 何荣杰. 基于领域情感词典的中文微博情感分析[J]. 电子设计工程, 2015, 6(12): 18-21.
[3] Paltoglou, G. and Thelwall, M. (2012) Twitter, My Space, Digg: Unsupervised Sentiment Analysis in Social Media. ACM Transactions on Intelligent Systems & Technology, 3, 1-19. [Google Scholar] [CrossRef
[4] Jo, Y. and Oh, A.H. (2011) Aspect and Sentiment Unification Model for Online Review Analysis. In: ACM International Conference on Web Search and Data Mining, ACM, New York, 815-824. [Google Scholar] [CrossRef
[5] Pang, B., Lee, L. and Vaithyanathan, S. (2002) Thumbs up? Sentiment Classification Using Machine Learning Techniques. Proceedings of Annual Conference of the Association for Computational Linguistics, July 2002, 79-86. [Google Scholar] [CrossRef
[6] Liu, S., Li, F., et al. (2013) Adaptive Co-Training SVM for Sentiment Classification on Tweets. In: Proceeding of the 22nd ACM International Conference on Information & Knowledge Management, ACM, New York, 2079-2088. [Google Scholar] [CrossRef
[7] Berger, A.L., Dellapietra, V.J., Pietra, S.A.D., et al. (1996) A Maximum Entropy Approach to Natural Language Processing. Computational Linguistics, 22, 39-71.
[8] Hinton, G.E. (1986) Learning Distributed Representations of Concepts. Proceedings of the Eighth Annual Conference of the Cognitive Science Society, 1, 12.
[9] Le, Q. and Mikolov, T. (2014) Distributed Representations of Sentences and Documents. Proceedings of the 31st International Conference on Machine Learning, 14, 1188-1196.
[10] Collobert, R., Weston, J., Bottou, L., et al. (2011) Natural Language Processing (Almost) from Scratch. Journal of Machine Learning Research, 12, 2493-2537.
[11] 王煜涵, 张春云, 赵宝林, 等. 卷积神经网络下的Twitter文本情感分析[J]. 数据采集与处理, 2018, 33(5): 921-927.
[12] You, Q.Z., Chen, Y.X., Yuan, J.B. and Luo, J.B. (2018) Twitter Sentiment Analysis via Bi-Sense Emoji Embedding and Attention-Based LSTM. Computer and Language, 8, 117-125.
[13] 关鹏飞, 李宝安, 吕学强, 等. 注意力增强的双向LSTM情感分析[J]. 中文信息学报, 2019, 33(2): 105-111.
[14] 王盛玉, 曾碧卿, 商齐, 等. 基于词注意力卷积神经网络模型的情感分析研究[J]. 中文信息学报, 2018, 32(9): 123-130.
[15] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[16] Bahdanau, D., Cho, K. and Bengio, Y. (2014) Neural Machine Translation by Jointly Learning to Align and Translate. Computer Science.
[17] Wang, Y., Huang, M., Zhu, X. and Zhao, L. (2016) Attention-Based LSTM for Aspect-Level Sentiment Classification. Proceedings of 2016 Conference on Empirical Methods in Nature Language Processing, Austin, TX, 1-5 November 2016, 606-615. [Google Scholar] [CrossRef
[18] Kim, Y. (2014) Convolutional Neural Networks for Sentence Classification. [Google Scholar] [CrossRef
[19] Dos Santos, C.N. and Gatti, M. (2014) Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts. Proceedings of the 25th International Conference on Computational Linguistics (COLING), Dublin, Ireland, 23-29 August 2014.
[20] 李昊璇, 张华洁. 基于词向量和CNN的书籍评论情感分析[J]. 测试技术学报, 2019, 33(2): 165-171.
[21] 李慧, 柴亚青. 基于卷积神经网络的细粒度情感分析方法[J]. 数据分析与知识实现, 2019, 3(1): 95-103.