基于Transformer增强架构的中文文本纠错研究
Research on Chinese Text Error Correction Based on Transformer Enhanced Architecture
摘要: 本文将Transformer模型应用于中文文本自动校正领域,并将Transformer模型中不同神经模块的输出动态结合,同时在模型训练时引入课程学习策略,以加快模型收敛速度。实验结果表明,本文所提出的增强模型及在训练中引入的课程学习策略对校正结果的准确率、召回率、纠错F0.5值有较大提升。
Abstract: In this paper, the transformer model is applied to the field of Chinese text automatic correction, and the outputs of different neural modules in the transformer model are dynamically combined. At the same time, the curriculum learning strategy is introduced in the model training to speed up the convergence speed of the model. The experimental results show that the proposed enhancement model and the curriculum learning strategy introduced in the training can greatly improve the accuracy, recall rate and error correction F0.5 value of the correction results.
文章引用:杨靖翔, 赵曙光. 基于Transformer增强架构的中文文本纠错研究[J]. 计算机科学与应用, 2022, 12(3): 565-571. https://doi.org/10.12677/CSA.2022.123057

参考文献

[1] 段娜, 杨妍, 赵军民. 基于规则的短文本识别算法[J]. 计算机产品与流通, 2019(2): 173-174.
[2] 王琼, 旷文珍, 许丽. 基于改进的N-gram模型和知识库的文本查错算法[J]. 计算机应用与软件, 2021, 38(10): 310-315.
[3] 高印权. 基于深度学习的文本语法自动纠错模型研究与实现[D]: [硕士学位论文]. 成都: 电子科技大学, 2020.[CrossRef
[4] Zhao, W. (2021) Machine Translation and Global Re-search: Towards Improved Machine Translation Literacy in the Scholarly Community. Lynne Bowker and Jairo Buitrago Ciro. Digital Scholarship in the Humanities, 36, 256-260. [Google Scholar] [CrossRef
[5] Fu, K., Huang, J. and Duan, Y. (2018) Youdao’s Winning Solution to the NLPCC-2018 Task 2 Challenge: A Neural Machine Translation Approach to Chinese Grammatical Error Correction. Springer, Cham, 879-884. [Google Scholar] [CrossRef
[6] Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. arXiv:1706.03762 [cs.CL]
[7] Bengio, Y., Louradour, J., Collobert, R., et al. (2009) Curriculum Learning. Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, Montreal, 14-18 June 2009, 41-48. [Google Scholar] [CrossRef