基于BERT模型的网络舆情情感分析——以上海疫情为例
Sentiment Analysis of Network Public Opinion Based on BERT Model—Taking the Shanghai Epidemic as an Example
DOI: 10.12677/AAM.2022.118530, PDF,  被引量   
作者: 孙 洋*:上海工程技术大学管理学院,上海;冷冠男:上海工程技术大学电子电气工程学院,上海
关键词: 网络舆情情感分析上海疫情BERT模型Internet Public Opinion Sentiment Analysis Shanghai Epidemic BERT Model
摘要: 随着互联网时代的不断演进,各种社交软件为网络舆情的传播提供了主要平台。疫情发生后对网络舆情信息的搜集、分析和引导,对于相关部门开展舆情解析和稳定公众情绪具有现实意义。本研究基于新浪微博数据,以此次上海疫情为例,对疫情期间的微博相关数据信息进行提取和挖掘,首先用TF-IDF词频统计处理方法统计出关键节点日期的主题词,然后使用BERT模型对此次疫情的舆情情感进行分析。结果表明:在时间分布上,博文的发布数量整体呈“n”字形分布。在舆情情感上,负面感情前期一直占了较大的比例,中期无明显变化,后期民众的感情虽依旧起伏不定,但消极情绪总体上有了明显的下降趋势。本文研究结果可为日后疫情防控中网络舆情的引导和处置提供借鉴参考。
Abstract: With the continuous evolution of the Internet era, various social softwares provide the main plat-form for the dissemination of network public opinion. The collection, analysis and guidance of online public opinion information after the outbreak of the COVID-19 is of practical significance for rele-vant departments to carry out public opinion analysis and stabilize public sentiment. This study is based on Sina Weibo data, taking the Shanghai epidemic as an example, to extract and mine Weibo-related data information during the epidemic. First, the TF-IDF word frequency statistical processing method is used to count the subject words of the key node dates, then use the BERT model to analyze the public sentiment of the COVID-19. The results show that: in terms of time dis-tribution, the number of blog posts is distributed in the shape of “n” as a whole. In terms of public opinion and emotion, negative emotions have always accounted for a large proportion in the early stage, and there was no significant change in the mid-term. Although the people’s emotions in the later period still fluctuated, negative emotions showed a clear downward trend in general. The re-sults of this paper can provide reference for the guidance and disposal of network public opinion in the future epidemic prevention and control.
文章引用:孙洋, 冷冠男. 基于BERT模型的网络舆情情感分析——以上海疫情为例[J]. 应用数学进展, 2022, 11(8): 5053-5061. https://doi.org/10.12677/AAM.2022.118530

参考文献

[1] 邢云菲, 王晰巍, 韩雪雯, 张长亮. 基于信息熵的新媒体环境下网络节点影响力研究——以微信公众号为例[J]. 图书情报工作, 2018, 62(5): 76-86. [Google Scholar] [CrossRef
[2] 邢鹏飞, 李鑫鑫. 重大疫情防控中网络舆情形成机制及引导策略研究——基于新冠肺炎疫情期间网络舆情文本的质性分析[J]. 情报杂志, 2020, 39(7): 67-74+158.
[3] 黄仕靖, 吴川徽, 袁勤俭, 夏镜然. 基于情感分析的突发公共卫生事件舆情时空演化差异研究[J]. 情报科学, 2022, 40(6), 149-159. [Google Scholar] [CrossRef
[4] 陈娟, 刘燕平, 邓胜利. 政府辟谣信息的用户评论及其情感倾向的影响因素研究[J]. 情报科学, 2017, 35(12): 61-65+72. [Google Scholar] [CrossRef
[5] 凌海彬, 缪裕青, 张万桢, 周明, 武继刚. 多特征融合的图文微博情感分析[J]. 计算机应用研究, 2020, 37(7): 1935-1939+1951. [Google Scholar] [CrossRef
[6] 胡思才, 孙界平, 琚生根, 王霞, 龙彬, 廖强. 基于扩展的情感词典和卡方模型的中文情感特征选择方法[J]. 四川大学学报(自然科学版), 2019, 56(1): 37-44.
[7] 薄涛, 李小军, 陈苏, 王玉婷, 祁国良. 基于社交媒体数据的地震烈度快速评估方法[J]. 地震工程与工程振动, 2018, 38(5): 206-215. [Google Scholar] [CrossRef
[8] Zhao, Y. and Fan, B. (2018) Exploring Open Government Data Capacity of Government Agency: Based on the Resource-Based Theory. Gov-ernment Information Quarterly, 35, 1-12. [Google Scholar] [CrossRef
[9] Mergel, I., Kleibrink, A. and Sörvik, J. (2018) Open Data Outcomes: US Cities between Product and Process Innovation. Government Infor-mation Quarterly, 35, 622-632. [Google Scholar] [CrossRef
[10] 李宗敏, 张琪, 杜鑫雨. 基于辟谣微博的互动及热门评论情感倾向的辟谣效果研究——以新冠疫情相关辟谣微博为例[J]. 情报杂志, 2020, 39(11): 90-95+110.
[11] Li, S., Wang, Y., Xue, J., Zhao, N. and Zhu, T. (2020) The Impact of COVID-19 Epidemic Declara-tion on Psychological Consequences: A Study on Active Weibo Users. International Journal of Environmental Research and Public Health, 17, 2032. [Google Scholar] [CrossRef] [PubMed]
[12] Gao, J., Zheng, P., Jia, Y., Chen, H., Mao, Y., Chen, S., Dai, J., et al. (2020) Mental Health Problems and Social Media Exposure during COVID-19 Out-break. PLOS ONE, 15, e0231924. [Google Scholar] [CrossRef] [PubMed]
[13] 齐珉, 齐文华, 苏桂武. 基于新浪微博的2017年四川九寨沟7.0级地震舆情情感分析[J]. 华北地震科学, 2020, 38(1): 57-63.
[14] 张娜, 张琨, 张先国, 张佳慧, 蒋彤彤, 方悦. 基于主题词与信息熵编码的文本零水印算法[J]. 计算机与数字工程, 2021, 49(8): 1612-1618.
[15] Lin, B., Zampetti, F., Bavota, G., Di Penta, M., Lanza, M. and Oliveto, R. (2018) Sentiment Analysis for Software Engineering: How Far Can We Go? Proceedings of the 40th International Conference on Software Engineering, Gothenburg, 27 May-3 June 2018, 94-104. [Google Scholar] [CrossRef
[16] Devlin, J., Chang, M.W., Lee, K., et al. (2018) BERT: Pre-Training of Deep Bidirectional Transformers for Language Understand-ing. Proceedings of NAACL-HLT 2019, Minneapolis, 2-7 June 2019, 4171-4186.
[17] Vaswani, A., Shazeer, N., Par-mar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Polosukhin, I., et al. (2017) Attention Is All You Need. Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Long Beach, 4-9 December 2017, 5998-6008.
[18] Song, K., Tan, X., Qin, T., Lu, J. and Liu, T.Y. (2020) Mpnet: Masked and Permuted Pre-Training for Language Understanding. Advances in Neural Information Processing Systems 33: Annu-al Conference on Neural Information Processing Systems 2020, NeurIPS 2020, 6-12 December 2020, 16857-16867.