|
[1]
|
Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
|
|
[2]
|
Schuster, M. and Paliwal, K.K. (2002) Bidirectional Recurrent-neural Networks. IEEE Transactions on Signal Processing, 45, 2673-2681. [Google Scholar] [CrossRef]
|
|
[3]
|
Kim, Y. (2014) Convolutional Neural Networks for Sentence Classifica-tion. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Doha, 25-29 October 2014, 1746-1751. [Google Scholar] [CrossRef]
|
|
[4]
|
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., et al. (2017) Attention Is All You Need. Proceedings of the 31st Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 5998-6008.
|
|
[5]
|
Bengio, Y., Ducharme, R., Vincent, P. and Jauvin, C. (2003) A Neural Probabilistic Language Model. The Journal of Machine Learning Research, 3, 1137-1155.
|
|
[6]
|
Mikolov, T., Chen, K., Corrado, G. and Dean, J. (2013) Efficient Estimation of Word Representations in Vector Space. Computer Science. arXiv: 1301.3781.
|
|
[7]
|
Zhang, J., Li, Y., Tian, J. and Li, T. (2018) LSTM-CNN Hybrid Model for Text Classification. 2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chongqing, 12-14 October 2018, 1675-1680. [Google Scholar] [CrossRef]
|
|
[8]
|
吴汉瑜, 严江, 黄少滨, 李熔盛, 姜梦奇. 用于文本分类的CNN_BiLSTM_Attention混合模型[J]. 计算机科学, 2020, 47(z2): 23-27+34.
|
|
[9]
|
梁顺攀, 豆明明, 于洪涛, 郑智中. 基于混合神经网络的文本分类方法[J]. 计算机工程与设计, 2022, 43(2): 573-579.
|
|
[10]
|
张小川, 刘连喜, 戴旭尧, 刘璐. 基于词性特征的 CNN_BiGRU文本分类模型[J]. 计算机应用与软件, 2021, 38(11): 155-161.
|
|
[11]
|
陶志勇, 李小兵, 刘影, 刘晓芳. 基于双向长短时记忆网络的改进注意力短文本分类方法[J]. 数据分析与知识发现, 2019, 3(12): 21-29.
|
|
[12]
|
蒲相忠, 梁春燕, 李鑫鑫, 赵磊, 王栋. 基于Self-Attention的多语言语义角色标注联合学习方法[J]. 计算机应用与软件, 2021, 38(12): 174-178.
|
|
[13]
|
邓朝阳, 仲国强, 王栋. 基于注意力门控图神经网络的文本分类[J]. 计算机科学, 2022, 49(6): 326-334.
|
|
[14]
|
陈农田, 李俊辉, 满永政. 基于改进CNN-BiGRU-att模型的文本分类研究[J/OL]. 昆明理工大学学报(自然科学版), 2022, 47(1): 30-37. 2021-09-28. [Google Scholar] [CrossRef]
|
|
[15]
|
陈可嘉, 刘惠. 基于改进BiGRU-CNN的中文文本分类方法[J/OL]. 计算机工程, 2022, 48(5): 59-66+73.
2021-12-11. [Google Scholar] [CrossRef]
|
|
[16]
|
Hinton, G.E., Ba, J.L. and Kiros, J.R. (2016) Layer Normalization. arXiv Preprint, arXiv: 1607.06450.
|
|
[17]
|
Ioffe, S. and Szegedy, C. (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. arXiv Preprint arXiv: 1502.03167.
|
|
[18]
|
Diganta, M. (2020) Mish: A Self Regularized Non-Monotonic Neural Activation Function. arXiv Preprint, arXiv: 1908.08681. https://arxiv.org/pdf/1908.08681.pdf
|
|
[19]
|
THUCTC: 一个高效的中文文本分类工具包[OL]. http://thuctc.thunlp.org/, 2020-11-11.
|