基于双向LSTM的文本情感倾向分类
Text Sentiment Classification Based on Bilateral LSTM
DOI: 10.12677/CSA.2021.115143, PDF,  被引量    科研立项经费支持
作者: 李金宇, 王晓晔, 彭 宪, 田 昊, 吉智豪, 罗一宁, 李金泳:天津理工大学计算机科学与工程系,天津
关键词: 情感分析PaddlePaddleLSTMEmotion Analysis PaddlePaddle LSTM
摘要: 随着互联网普及,在网络上出现了大量带有个人主观性的文本,这些文本含有大量情感相关信息和个人的主观观点,目前通常的卷积网络处理这些文本无法将信息进行关联,处理起来无法达到想要的效果,判断文本情感倾向不够准确,所以本文使用国内百度开发的PaddlePaddle框架,构建双向LSTM (Long Short-Term Memory)网络从众多文本信息和数据中准确而高效地分析出文本中所蕴含的情感,并判断情感极性,对情感倾向做出分类。实验中对美食评论信息进行情感预测,首先利用Embedding来计算出词向量,通过双向LSTM提取特征和融合,借助softmax函数构建分类器,获得文本信息的情感倾向,实验结果较为理想。
Abstract: With the popularity of the Internet, a large number of personally subjective texts appear on the Internet. These text messages contain a large amount of emotional related information and personal subjective opinions. Some of these information are useless and may cause information explosion. At present, the usual convolutional network processing these texts cannot associate the information, and the processing cannot achieve the desired effect, so we use the PaddlePaddle developed by Baidu in China based on the bidirectional LSTM (Long Short-Term Memory) network built by us to obtain a large amount of text information, to analyze the emotion contained in the text accurately and efficiently from the data and judge the polarity of the emotion, classify the emotion, and apply it in practice. The model first uses Embedding to calculate the word vector, then uses the two-way LSTM to extract features and fusion, and finally uses the softmax function to construct a classifier to obtain the emotional tendency of the text information.
文章引用:李金宇, 王晓晔, 彭宪, 田昊, 吉智豪, 罗一宁, 李金泳. 基于双向LSTM的文本情感倾向分类[J]. 计算机科学与应用, 2021, 11(5): 1401-1410. https://doi.org/10.12677/CSA.2021.115143

参考文献

[1] Xin, K. and Bublé, M. (2021) On Extracting Keywords from Long-and-Difficult English Sentences for Smart Sentiment Analysis. Internet Technology Letters, 4, e226. [Google Scholar] [CrossRef
[2] Ahmad, S.R., Yusop, N., Ba-kar, A.A., et al. (2017) Statistical Analysis for Validating ACO-KNN Algorithm as Feature Selection in Sentiment Anal-ysis. AIP Conference Proceedings 1891, Article ID: 020018. [Google Scholar] [CrossRef
[3] Saif, H., et al. (2014) SentiCircles for Contextual and Conceptual Semantic Sentiment Analysis of Twitter. The Semantic Web: Trends and Challenges: 11th International Conference, ESWC 2014, Anissaras, Crete, Greece, 25-29 May 2014, 83-98. [Google Scholar] [CrossRef
[4] Dyer, C., Ballesteros, M., Ling, W., et al. (2015) Transition-Based Dependency Parsing with Stack Long Short-Term Memory. Computer Science, 37, 321-332.
[5] Dey, R. and Salemt, F.M. (2017) Gate-Variants of Gated Recurrent Unit (GRU) Neural Networks. 2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS), Boston, MA, 6-9 August 2017, 1597-1600. [Google Scholar] [CrossRef
[6] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[7] Rong, X. (2014) word2vec Parameter Learning Explained. arXiv:1411.2738 [cs.CL]
[8] Srivastava, N., Hinton, G., Krizhevsky, A., et al. (2014) Dropout: A Simple Way to Pre-vent Neural Networks from Overfitting. Journal of Machine Learning Research, 15, 1929-1958.
[9] 祝永志, 荆静. 基于Python语言的中文分词技术的研究[J]. 通信技术, 2019, 52(7): 1612-1619.