基于IMDb影评的情感分析研究

doi:10.12677/ml.2026.144261

期刊菜单

基于IMDb影评的情感分析研究
Sentiment Analysis Research Based on IMDb Movie Reviews

DOI: 10.12677/ml.2026.144261, PDF,
作者: 杨菊：上海海事大学外国语学院，上海
关键词: 自然语言处理；情感分析；跨语言情感分析；朴素贝叶斯；支持向量机；Natural Language Processing； Sentiment Analysis； Cross-Lingual Sentiment Analysis； Naïve Bayes； Support Vector Machine

摘要: 情感分析的任务是挖掘文本中蕴含的情感倾向，对文本情感极性进行分类，是自然语言处理领域研究的重要问题。本研究通过构建一个情感分析系统，选取IMDb影评数据集作为训练与测试对象，探索如何从数据收集、文本预处理到模型训练的完整流程。同时，针对多语言情感分析的挑战，研究了翻译对情感分类效果的影响。实验表明，基于传统机器学习模型(如朴素贝叶斯和支持向量机)的情感分析方法在高质量预处理和特征提取下能够获得较好的性能，而翻译对分类结果的影响取决于语言的特性和翻译工具的准确性。

Abstract: The task of sentiment analysis is to mine the emotional tendencies contained in texts and classify the polarity of text emotions, which is an important issue in the field of Natural Language Processing (NLP). This study explores the complete process from data collection and text preprocessing to model training by constructing a sentiment analysis system and selecting the IMDb movie review dataset as the training and testing subject. Simultaneously, addressing the challenges of multilingual sentiment analysis, the study investigates the impact of translation on sentiment classification effectiveness. Experiments show that sentiment analysis methods based on traditional machine learning models (such as Naïve Bayes and Support Vector Machines) can achieve good performance under high-quality preprocessing and feature extraction. The impact of translation on classification results depends on language characteristics and the accuracy of translation tools.

文章引用：杨菊. 基于IMDb影评的情感分析研究[J]. 现代语言学, 2026, 14(4): 10-22. https://doi.org/10.12677/ml.2026.144261

参考文献

[1]	刘玲玉, 邓燕燕. 基于Python情感分析和批评隐喻的网络话语分析——以影片《流浪地球》中美德影评为例[J]. 江苏大学学报(社会科学版), 2022, 24(3): 76-88.
[2]	徐月梅, 曹晗, 王文清, 等. 跨语言情感分析研究综述[J]. 数据分析与知识发现, 2023, 7(1): 1-21.
[3]	王钦炀, 施水才, 王洪俊. 文本情感分析综述[J/OL]. 软件导刊: 1-10. https://kns.cnki.net/kcms2/article/abstract?v=9oxawJFDgQVlbfQPmNjR6YQ2RoYUdt8npuNBmgH4VacHCNBuKrECb5UR4dt_k6ff_Hb1Jo3J33gxxgnwmnxKSjZoTEqFRB3ZJg2zCAfZt1wm-H_FK23ETF249ZBAMpX7Zdn1IjDGd4n0oE_Y4wvoubfDOM4Age8O0yuyJbM9R9w=&uniplatform=NZKPT, 2024-12-23.
[4]	Srivats Athindran, N., Manikandaraj, S. and Kamaleshwar, R. (2018) Comparative Analysis of Customer Sentiments on Competing Brands Using Hybrid Model Approach. 2018 3rd International Conference on Inventive Computation Technologies (ICICT), Coimbatore, 15-16 November 2018, 348-353. [Google Scholar] [CrossRef]
[5]	Vanaja, S. and Belwal, M. (2018) Aspect-Level Sentiment Analysis on E-Commerce Data. 2018 International Conference on Inventive Research in Computing Applications (ICIRCA), Coimbatore, 11-12 July 2018, 1275-1279. [Google Scholar] [CrossRef]
[6]	Iqbal, N., Chowdhury, A.M. and Ahsan, T. (2018) Enhancing the Performance of Sentiment Analysis by Using Different Feature Combinations. 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), Rajshahi, 8-9 February 2018, 1-4. [Google Scholar] [CrossRef]
[7]	游棉州. 情感分析的算法与技术应用[J]. 电子技术, 2022, 51(9): 190-191.
[8]	陈琪, 张莉, 蒋竞, 等. 一种基于支持向量机和主题模型的评论分析方法[J]. 软件学报, 2019, 30(5): 1547-1560.
[9]	张小艳, 白瑜. 基于加权融合字词向量的中文在线评论情感分析[J]. 计算机应用研究, 2022, 39(1): 31-36.
[10]	刘祉燊, 张倩, 周菠, 等. 基于支持向量机的中文文本情感分析方法研究[J]. 科技创新与应用, 2022, 12(32): 27-30.

为你推荐

友情链接