基于元学习模型优化器的小样本手写西波文字识别
Meta-Learned Optimizer for Few-Shot Handwritten Xibo Characters Recognition
摘要: 本文针对手写西波文字因样本稀缺而难以使用传统神经网络进行有效识别的问题,提出了一种基于元学习的优化框架。该框架以长短期记忆神经网络(LSTM)作为元优化器,来学习如何优化一个基础的卷积神经网络(CNN)或深度神经网络(DNN)。其核心创新在于结合了两种策略:1) 通过在MNIST数据集上进行预训练,实现跨任务的知识迁移;2) 通过LSTM优化器实现动态的梯度调控策略,自适应地调整基础网络的参数更新。实验结果显示,该方法在10类手写西波文字的小样本测试集上,将CNN的识别准确率从24%显著提升至73%,并将训练时间缩短至368秒,远优于传统的数据增强方法。研究旨在为西波文等少数民族濒危文字的数字化保护提供一种高效、可行的小样本学习技术方案。
Abstract: This paper proposes a meta-learning-based optimization framework to address the challenge of effectively recognizing handwritten Xibo characters using traditional neural networks under extreme sample scarcity. The framework employs a Long Short-Term Memory (LSTM) network as a meta-optimizer to learn how to optimize a base Convolutional Neural Network (CNN) or Deep Neural Network (DNN). Its core innovation lies in combining two strategies: 1) Cross-task knowledge transfer achieved through pre-training on the MNIST dataset; 2) A dynamic gradient regulation strategy implemented by the LSTM optimizer to adaptively adjust the parameter updates of the base network. Experimental results demonstrate that on a few-shot test set of 10-class handwritten Xibo characters, this method significantly boosts the recognition accuracy of a CNN from 24% to 73%, while reducing the training time to 368 seconds, far outperforming traditional data augmentation methods. This research aims to provide an efficient and feasible few-shot learning technical solution for the digital preservation of endangered minority scripts like Xibo.
文章引用:何翠玲, 梅自艳, 顾玉碟. 基于元学习模型优化器的小样本手写西波文字识别[J]. 人工智能与机器人研究, 2026, 15(3): 903-912. https://doi.org/10.12677/airr.2026.153083

参考文献

[1] Wang, Y.Q., Yao, Q.M., Kwok, J., et al. (2020) Generalizing from a Few Examples: A Survey on Few-Shot Learning. ACM Computing Surveys (CSUR), 53, Article No. 63. [Google Scholar] [CrossRef
[2] Gregory, K., Zemel, R. and Salakhutdinov, R. (2015) Siamese Neural Networks for One-Shot Image Recognition. The 32nd International Conference on Machine Learning (ICML 2015), Lille, 6-11 July 2015.
[3] Snell, J., Swersky, K. and Zemel, R. (2017) Prototypical Networks for Few-Shot Learning. [Google Scholar] [CrossRef
[4] Sung, F., Yang, Y., Zhang, L., Xiang, T., Torr, P.H.S. and Hospedales, T.M. (2018) Learning to Compare: Relation Network for Few-Shot Learning. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 1199-1208. [Google Scholar] [CrossRef
[5] Santoro, A., Bartunov, S. and Botvinick, M. (2016) One-Shot Learning with Memory-Augmented Neural Networks.
https://arxiv.org/abs/1605.06065
[6] Munkhdalai, T. and Yu, H. (2017) Meta Network.
https://arxiv.org/abs/1703.00837
[7] Ravi, S. and Larochelle, H. (2017) Optimization as a Model for Few-Shot Learning. 2017 5th International Conference on Learning Representations, Toulon, 24-26 April 2017.
[8] Antoniou, A., Edwards, H. and Storkey, A.J. (2019) How to Train Your MAML. 2019 International Conference on Learning Representations, New Orleans, 6-9 May 2019.
[9] Yu, M. (2018) Diverse Few-Shot Text Classification with Multiple Metrics.
https://arxiv.org/abs/1606.04474
[10] Andrychowicz, M., Denil, M., Gomez, S., Hoffman, M.W., Pfau D., Schaul, T. and de Freitas, N. (2016) Learning to Learn by Gradient Descent by Gradient Descent.
https://arxiv.org/abs/1606.04474
[11] 王嘉梅, 文永华, 李燕青, 高雅莉. 基于图像分割的古彝文字识别系统研究[J]. 云南民族大学学报: 自然科学版, 2008, 17(1): 76-79.
[12] 朱龙华, 王嘉梅. 基于组合特征的多分类器集成的脱机手写体彝文字识别[J]. 云南民族大学学报自然科学版, 2010, 19(5): 329-333.
[13] 朱宗晓, 吴显礼. 脱机印刷体彝族文字识别系统的原理与实现[J]. 计算机技术与发展, 2012, 22(2): 85-88+92.
[14] 沙马拉毅. 计算机彝文操作系统的研制[J]. 西南民族学院学报(自然科学版), 2003, 29(1): 1-8.
[15] 陈善雄, 王小龙, 韩旭, 刘云, 王明贵. 一种基于深度学习的古彝文识别方法[J]. 浙江大学学报: 理学版, 2019, 46(3): 261-269.
[16] 伊恩古德费洛, 约书亚本吉奥, 亚伦库维尔. 神经网络与深度学习[M]. 北京: 人民邮电出版社, 2017.
[17] 邱锡鹏. 神经网络与深度学习[M]. 北京: 机械工业出版社, 2020.