基于BiLSTM-CAM的新闻文本分类研究
A Study on News Text Classification Based on BiLSTM-CAM
DOI: 10.12677/airr.2026.153078, PDF,   
作者: 黄盼望, 郑 浩:西京学院计算机学院,陕西 西安
关键词: 文本分类新闻文本BiLSTM-CAMText Classification News Text BiLSTM-CAM
摘要: 随着信息技术的迅速发展,新闻文本的数量急剧增加,怎么有效地对这些文本进行分类已成为一个重要的研究课题,新闻文本的多样性和复杂性使得传统的文本分类方法面临诸多挑战。本文提出一种BiLSTM-CAM模型旨在解决深度学习算法存在的问题,其在处理序列数据方面的优势,能够有效捕捉文本中的上下文信息,更好地识别新闻文本中的重要特征。在实验中,我们首先对数据集进行了预处理,包括文本清洗、分词和向量化等步骤,对BiLSTM-CAM模型进行了训练和调优。进行了对比实验与消融实验,研究结果表明,所提出的模型在处理数据时,能够有效提升分类性能,改善了传统方法的不足。
Abstract: With the rapid development of information technology, the volume of news texts has surged dramatically. How to effectively classify these texts has become a significant research topic; however, the diversity and complexity of news texts pose numerous challenges to traditional text classification methods. This paper proposes a BiLSTM-CAM model designed to address the limitations of deep learning algorithms. Leveraging the advantages of deep learning in processing sequential data, this model can effectively capture contextual information within the text and better identify key features in news texts. In the experiments, we first preprocessed the dataset, including steps such as text cleaning, word segmentation, and vectorization, and then trained and fine-tuned the BiLSTM-CAM model. We conducted comparative and ablation experiments. The results indicate that the proposed model effectively improves classification performance when processing data, addressing the shortcomings of traditional methods.
文章引用:黄盼望, 郑浩. 基于BiLSTM-CAM的新闻文本分类研究[J]. 人工智能与机器人研究, 2026, 15(3): 844-852. https://doi.org/10.12677/airr.2026.153078

参考文献

[1] 李晓英, 杨名, 全睿, 等. 基于深度学习的不均衡文本分类方法[J]. 吉林大学学报(工学版), 2022, 52(8): 1889-1895.
[2] 刘晓琳, 宋营营, 李卓. 基于增强逐点图卷积网络的民航短文本组合分类方法[J/OL]. 北京航空航天大学学报: 1-18. 2024-10-30.[CrossRef
[3] 王驰宇. 基于变分贝叶斯的小样本新闻文本分类方法[J]. 中国传媒科技, 2026(1): 149-153.
[4] 徐朋, 沈子宁. 基于孪生神经网络的新闻文本分类方法研究[J]. 计算机与数字工程, 2025, 53(10): 2831-2836.
[5] 乔京, 常承伟, 王哲, 等. 融合语义增强的新闻文本分类研究[J]. 计算机仿真, 2025, 42(6): 72-77.
[6] 郝婷, 冯赛赛. 基于深度学习的文本分类模型研究[J]. 信息记录材料, 2025, 26(6): 114-116+119.
[7] Shi, J., Wei, T. and Li, Y. (2024) Residual Diverse Ensemble for Long-Tailed Multi-Label Text Classification. Science China Information Sciences, 67, 92-105. [Google Scholar] [CrossRef
[8] 季天瑶, 王挺韶. 基于词嵌入与卷积神经网络的建筑能耗预测[J]. 华南理工大学学报(自然科学版), 2021, 49(6): 40-48.
[9] Chen, G.Z., Liu, S. and Xu, J.T. (2023) Memory-Boosting RNN with Dynamic Graph for Event-Based Action Recognition. Optoelectronics Letters, 19, 629-634. [Google Scholar] [CrossRef
[10] Li, O., Lei, J., Qin, C., Zhang, Z., Tao, J. and Liu, C. (2024) A Novel Multi-Channel CNN-LSTM and Transformer-Based Network for Diesel Engine Misfire Diagnosis under Different Noise Conditions. Science China Technological Sciences, 67, 2965-2967. [Google Scholar] [CrossRef
[11] Gong, J., Liu, X., Zhang, Y., Zhu, F. and Hu, G. (2024) Prediction of Single Cell Mechanical Properties in Microchannels Based on Deep Learning. Applied Mathematics and Mechanics, 45, 1857-1874. [Google Scholar] [CrossRef
[12] 孙刘成, 黄润才. 融合LSTM和注意力机制的新闻文本分类模型[J]. 传感器与微系统, 2022, 41(9): 38-41.