基于文本分类的新能源汽车满意度研究——以比亚迪与特斯拉为例
A Study on Satisfaction of New Energy Vehicles Based on Text Categorization—Taking BYD and Tesla as an Example
DOI: 10.12677/mos.2025.142173, PDF,   
作者: 赵嘉麒, 罗鄂湘:上海理工大学管理学院,上海
关键词: 新能源汽车满意度集成学习New Energy Vehicles Satisfaction Ensemble Learning
摘要: 量化评估新能源汽车满意度对于推动新能源汽车产业发展、未来能源转型具有重要意义。本文使用python爬取汽车之家上比亚迪与特斯拉两品牌的新能源汽车口碑评论数据,对非结构化数据进行LDA主题建模分析;对结构化数据提出一种基于Stacking集成学习的文本情感分类算法以了解各属性的满意度情况,在对比近十种同质算法后,选择精度最佳的Catboost、神经网络(ANN)和随机森林(RF)构建多基分类器,并利用逻辑回归(LR)作为元分类器,实现对汽车结构化维度的情感分类,最终准确度高达到95%。研究发现,购车体验、车型选择与噪音问题是结构化数据中被忽视的用户关注维度。而就满意度来看,比亚迪的整体满意度高于特斯拉,且均在内饰方面表现较差。未来应关注内饰部分的设计改进,以提升用户体验并改善用户满意度。
Abstract: Quantitatively assessing the satisfaction of new energy vehicles is of great significance for promoting the development of new energy vehicle industry and future energy transition. In this paper, we use python to crawl the review data of new energy vehicles of BYD and Tesla brands on Autohome.com, and perform LDA topic modeling analysis on the unstructured data; and propose a text sentiment classification algorithm based on Stacking integrated learning for the structured data to understand the satisfaction situation of each attribute. After comparing nearly ten homogeneous algorithms, Catboost, neural network (ANN) and random forest (RF) with the best accuracy are selected to construct a multi-base classifier, and logistic regression (LR) is used as a meta-classifier to realize sentiment classification of structured dimensions of cars, with a high final accuracy of 95%. It was found that car-buying experience, model selection and noise problem are neglected user concern dimensions in structured data. In terms of satisfaction, BYD’s overall satisfaction is higher than Tesla’s, and both perform poorly in the interior. In the future, attention should be paid to the design improvement of the interior part to enhance user experience and improve user satisfaction.
文章引用:赵嘉麒, 罗鄂湘. 基于文本分类的新能源汽车满意度研究——以比亚迪与特斯拉为例[J]. 建模与仿真, 2025, 14(2): 532-542. https://doi.org/10.12677/mos.2025.142173

参考文献

[1] 张芳, 包先建. 我国新能源汽车市场推广问题与对策研究[J]. 价格理论与实践, 2011(5): 85-86.
[2] 高仲敏. 中国新能源汽车产业存在的问题和解决策略[J]. 汽车维修技师, 2023(8): 110.
[3] 郑舒允. 高质量绿色发展下中国新能源汽车产业发展现状及其问题分析[J]. 科技和产业, 2022, 22(3): 132-137.
[4] Lukyanenko, R., Parsons, J. and Wiersma, Y.F. (2014) The IQ of the Crowd: Understanding and Improving Information Quality in Structured User-Generated Content. Information Systems Research, 25, 669-689. [Google Scholar] [CrossRef
[5] 赵妍妍, 秦兵, 刘挺. 文本情感分析[J]. 软件学报, 2010, 21(8): 1834-1848.
[6] Taboada, M., Brooke, J., Tofiloski, M., Voll, K. and Stede, M. (2011) Lexicon-Based Methods for Sentiment Analysis. Computational Linguistics, 37, 267-307. [Google Scholar] [CrossRef
[7] 卫青蓝, 何雨, 宋金宝. 基于语义规则的自适应情感词典自动构建算法[J/OL]. 北京航空航天大学学报, 1-11. 2024-12-24.[CrossRef
[8] 尹丽春, 王悦. 基于在线评论的图书消费者满意度影响因素与作用机理[J]. 图书情报工作, 2019, 63(22): 106-117.
[9] 康雁, 李浩, 梁文韬, 宁浩宇, 霍雯. 针对文本情感分类任务的textSE-ResNeXt集成模型[J]. 计算机工程与应用, 2020, 56(7): 205-209.
[10] 闵洁, 吉秉彧. 基于主题相关性和深度学习的文本情感分析[J]. 信阳师范学院学报(自然科学版), 2023, 36(3): 490-494.
[11] 范黄健, 叶楠, 张梦婷. 基于多源数据的新能源汽车用户满意度分析[J]. 经营与管理, 2024(2): 32-39.
[12] Shu, T., Wang, Z., Lin, L., Jia, H. and Zhou, J. (2022) Customer Perceived Risk Measurement with NLP Method in Electric Vehicles Consumption Market: Empirical Study from China. Energies, 15, Article 1637. [Google Scholar] [CrossRef
[13] Ikonomakis, M., Kotsiantis, S. and Tampakas, V. (2005) Text Classification Using Machine Learning Techniques. WSEAS Transactions on Computers, 4, 966-974.
[14] Smagulova, K. and James, A.P. (2019) A Survey on LSTM Memristive Neural Network Architectures and Applications. The European Physical Journal Special Topics, 228, 2313-2324. [Google Scholar] [CrossRef
[15] 陈红阳, 黄正洪, 何盈盈, 周也力. 融合内容特征与传播特征的微博文本情感分类[J]. 重庆理工大学学报(自然科学), 2023, 37(7): 245-255.