基于自注意力机制改进GCNN模型的图书标签分类研究
Research on Book Label Classification Based on Improved GCNN Model Based on Self-Attention Mechanism
摘要: 针对卷积神经网络聚焦于局部特征,不足以捕捉文本中长程依赖关系的问题,本文提出了一种基于CNN和自注意力机制改进的双通道图书标签分类模型(Gate Convolution Neural Network based on self- attention mechanism, GCNN-SAM)。该模型使用skip-gram将词嵌入成稠密低纬的向量,得到文本嵌入矩阵,分别输入到门卷积神经网络和自注意力机制,再经过逐点卷积,将两个通道中经过特征提取层得到的特征进行融合用于图书标签分类。在复旦大学中文文本分类数据集上进行对比实验,相较于SCNN、GCNN和其它改进的模型,测试集准确率达到96.21%,表明了GCNN-SAM模型在图书标签分类上具有优越性。同时,为验证GCNN-SAM模型的有效性,消融实验结果表明GCNN-SAM模型相较于CNN、GCNN和CNN-SAM在分类准确率上分别提升了5.9%、3.19%和3.66%。
Abstract: Aiming at the problem that convolutional neural networks focus on local features and are not enough to capture long-range dependencies in text, in order to improve this problem, this paper proposes an improved dual-channel book label classification model based on CNN and self-attention mechanism. The model uses skip-gram to embed words into dense low-latitude vectors to obtain a text embedding matrix, which is input into the gate convolutional neural network and self-attention mechanism respectively. After point-by-point convolution, the features obtained by the feature ex-traction layer in the two channels are fused for book label classification. A comparative experiment was conducted on the Fudan University Chinese text classification dataset. Compared with SCNN, GCNN and other improved models, the accuracy of the test set is 96.21%, which shows that the GCNN-SAM model has advantages in book label classification. At the same time, in order to verify the effectiveness of the GCNN-SAM model, ablation experiments were carried out. The results showed that the GCNN-SAM model improved the classification accuracy by 5.9%, 3.19% and 3.66% respec-tively compared with CNN, GCNN and CNN-SAM.
文章引用:张健. 基于自注意力机制改进GCNN模型的图书标签分类研究[J]. 建模与仿真, 2024, 13(2): 1322-1332. https://doi.org/10.12677/MOS.2024.132124

参考文献

[1] 李蕾, 王冕, 章成志. 区分标签类型的社会化标签质量测评研究[J]. 图书情报工作, 2013, 57(23): 11.
[2] 潘薇, 喻浩. 文献信息知识组织与内容揭示方法探究[J]. 图书馆研究, 2019, 39(3): 46-48.
[3] Cui, J.W, and Li, C.J. (2017) Identifying Semantic Relations of Clusters Based on Linked Data. Data Analysis and Knowledge Discovery, 1, 57-66.
[4] Zeng, Z.Y., Wang, H., Zhang, Z.B., et al. (2021) A Study on Construction and Application of Text Classification Model Integrated with Associated Information from GCN. Data Analysis and Knowledge Discovery, 3, 12-20.
[5] 朱靖波, 王会珍, 张希娟. 面向文本分类的混淆类判别技术[J]. 软件学报, 2008, 19(3): 630-639.
[6] 何嘉欣, 张涛, 陈旭岚, 等. 基于数据挖掘的P2P网贷个人信用评价模型研究[J]. 建模与仿真, 2021, 10(4): 991-1002.
[7] Yang, L., Hang, X.S. and Wang, J.Y. (2022) Identifying Subtypes of Clinical Trial Diseases with BERT-TextCNN. Data Analysis and Knowledge Discovery, 6, 69-81.
[8] Wang, S., Liu, C., Wu, J. and Yao, X. (2019) Multi-Label Text Classification with Deep Learning. IEEE Access, 7, 174906-174917.
[9] Gao, S., Ramanathan, A. and Tourassi, G. (2018) Hierarchical Convolutional Attention Networks for Text Classification. In: Proceedings of the Third Workshop on Representation Learning for NLP, Association for Computation-al Linguistics, Melbourne, 11-23. [Google Scholar] [CrossRef
[10] Ma, Y.F., Peng, Y.X. and Cambria, E. (2018) GCN-Based Joint Learning for Community Sentiment Analysis. Proceedings of the 27th International Conference on Computational Linguistics: Technical Papers, New Mexico, 20-26 August 2018, 1288-1297.
[11] Alshubaily, I. (2021) TextCNN with Attention for Text Classifi-Cation. ArXiv: 2108.01921.
[12] Chen, W. and Fan, J.S. (2022) MechPerformer: A General Deep Learning Model for History-Dependent Response Prediction in Structural Engineering. Journal of Building Structures, 43, 209-219.
[13] Mikolov, T., Sutskever, I., Chen, K., et al. (2013) Distributed Representations of Words and Phrases and Their Compositionality. arXiv: 1310.4546.
[14] 李琳, 段围, 周栋,袁景凌. 基于深度语义匹配的法律条文推荐方法[J]. 软件学报, 2022, 33(7): 2618-2632.
[15] 杨灿, 董海龙. 基于国家标准学科分类的统计学科体系研究[J]. 统计研究, 2010, 27(1): 50-57.
[16] Cen, Y.H., Han, Z. and Ji, P.P. (2008) Chinese Term Recognition Based on Hidden Markov Model. 2008 IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application, Wuhan, 19-20 December 2008, 219-224.
[17] Yan, C.M. and Wang, C. (2021) Development and Application of Convolutional Neural Network Model. Journal of Frontiers of Computer Science & Technology, 15, 27.
[18] Zagoruyko, S. and Komodakis, N. (2015) Learning to Compare Image Patches via Convolutional Neural Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 4353-4361. [Google Scholar] [CrossRef
[19] Hai, H., Guo, Z.L., Cai, T.T. and He, Z.C. (2022) A Text Classification Method Based on a Convolutional and Bidirectional Long Short-Term Memory Model. Connection Science, 34, 2108-2124. [Google Scholar] [CrossRef