基于RoBERTa的新闻评论可解释性情感分析
Interpretable Sentiment Analysis of News Commentary Based on RoBERTa
DOI: 10.12677/sa.2025.144086, PDF,   
作者: 张月月:燕山大学理学院,河北 秦皇岛;刘立佳:燕山大学信息科学与工程学院(软件学院),河北 秦皇岛
关键词: RoBERTa卷积神经网络可解释性情感分析RoBERTa Convolutional Neural Network Interpretability Sentiment Analysis
摘要: 在数字化信息高速传播的当下,实现对新闻评论精准且具备可解释性的情感分析,对舆情洞察与舆论引导极为关键。本研究运用RoBERTa模型,结合多尺度卷积神经网络(MSCNN),深度剖析新闻评论中的情感倾向。同时,采用局部可解释的模型无关解释方法(LIME),实现对模型预测结果的深度解释,可视化展示模型在处理新闻评论时对词汇和短语的关注重点,为模型决策提供清晰依据。实验结果表明,RoBERTa-MSCNN在新闻评论情感分析任务上取得了更优的性能,准确率达到83.34%,精确率为82.6%,召回率为84.67%,F1值提升至83.62%。同时,可解释性分析为用户理解模型输出提供了清晰的视角,有助于新闻媒体更有效地进行舆情监测与引导,为相关领域的研究与应用提供了有力支持。
Abstract: In the current era of high-speed digital information dissemination, accurate and interpretable emotional analysis of news comments is crucial for public opinion insight and guidance. In this study, RoBERTa model and multi-scale convolutional neural network (MSCNN) are used to analyze the emotional tendency of news commentary. At the same time, Local Interpretable Model-agnostic Explanations (LIME) is used to realize the in-depth interpretation of the model prediction results and visually display the model’s focus on words and phrases when processing news comments, providing a clear basis for the model’s decision-making. The experimental results show that RoBERTa-MSCNN has achieved superior performance in the task of sentiment analysis of news comments. Its accuracy rate reaches 83.34%, the precision rate is 82.6%, the recall rate is 84.67%, and the F1 score has been increased to 83.62%. At the same time, interpretability analysis provides a clear perspective for users to understand the model output, helps news media to monitor and guide public opinion more effectively, and provides strong support for research and application in related fields.
文章引用:张月月, 刘立佳. 基于RoBERTa的新闻评论可解释性情感分析[J]. 统计学与应用, 2025, 14(4): 34-46. https://doi.org/10.12677/sa.2025.144086

参考文献

[1] 宁益民. 基于深度学习的新闻评论情感分析研究[D]: [硕士学位论文]. 北京: 北京邮电大学, 2023.
[2] 惠调艳, 王智, 何振华, 等. 基于词典-TextCNN-Word2Vec组合模型的在线评价细粒度情感分析[J]. 情报理论与实践, 2025, 48(2): 168-177.
[3] Srivats Athindran, N., Manikandaraj, S. and Kamaleshwar, R. (2018) Comparative Analysis of Customer Sentiments on Competing Brands Using Hybrid Model Approach. 2018 3rd International Conference on Inventive Computation Technologies (ICICT), Coimbatore, 15-16 November 2018, 348-353. [Google Scholar] [CrossRef
[4] Vanaja, S. and Belwal, M. (2018) Aspect-Level Sentiment Analysis on E-Commerce Data. 2018 International Conference on Inventive Research in Computing Applications (ICIRCA), Coimbatore, 11-12 July 2018, 1275-1279. [Google Scholar] [CrossRef
[5] 程顺正, 胡楠. 基于BERT模型的短视频平台评论信息情感分析研究[J]. 辽宁科技学院学报, 2024, 26(6): 30-33.
[6] Kamruzzaman, M., Hossain, M., Imran, M.R.I. and Bakchy, S.C. (2021) A Comparative Analysis of Sentiment Classification Based on Deep and Traditional Ensemble Machine Learning Models. 2021 International Conference on Science & Contemporary Technologies (ICSCT), Dhaka, 5-7 August 2021, 1-5. [Google Scholar] [CrossRef
[7] Janardhana, D.R., Vijay, C.P., Swamy, G.B.J. and Ganaraj, K. (2020. Feature Enhancement Based Text Sentiment Classification Using Deep Learning Model. 2020 5th International Conference on Computing, Communication and Security (ICCCS), Patna, 14-16 October 2020, 1-6. [Google Scholar] [CrossRef
[8] Tan, K.L., Lee, C.P., Anbananthen, K.S.M. and Lim, K.M. (2022) RoBERTa-LSTM: A Hybrid Model for Sentiment Analysis with Transformer and Recurrent Neural Network. IEEE Access, 10, 21517-21525. [Google Scholar] [CrossRef
[9] 杨明. 基于BERT-CBA方法的中文新闻情感分类[J]. 长江信息通信, 2022, 35(5): 170-172.
[10] 林奕欧. 深度情感分析模型的建模方法和可解释性研究[D]: [硕士学位论文]. 成都: 电子科技大学, 2021.
[11] 蒋建洪, 李梦欣. 融合LGBM和SHAP的企业负面事件下微博用户极端情感可解释模型[J]. 数据分析与知识发现, 2024, 8(7): 103-117.
[12] Jain, R., Kumar, A., Nayyar, A., Dewan, K., Garg, R., Raman, S., et al. (2023) Explaining Sentiment Analysis Results on Social Media Texts through Visualization. Multimedia Tools and Applications, 82, 22613-22629. [Google Scholar] [CrossRef] [PubMed]
[13] Go, A. Bhayani, R. and Huang, L. (2009) Twitter Sentiment Classification Using Distant Supervision. CS224N Project Report, Stanford.