基于YOLOv8n的甲骨文图像识别技术研究
Research on Oracle Bone Script Image Recognition Technology Based on YOLOv8n
DOI: 10.12677/jisp.2025.144043, PDF,    科研立项经费支持
作者: 张振文*, 黄柏锋*, 韦易霖, 覃 军:桂林信息科技学院机电工程学院,广西 桂林;刘映伶#:桂林信息科技学院基础教研部,广西 桂林
关键词: YOLOv8n甲骨文物体检测深度学习图像识别YOLOv8n Oracle Bone Script Object Detection Deep Learning Image Recognition
摘要: 随着深度学习技术的发展,物体检测算法在图像识别领域取得了显著进展。针对甲骨文图像识别技术的研究,本文基于YOLOv8n框架构建了一种高效的甲骨文物体检测与分类方法。通过数据预处理、数据增强、模型训练、模型预测等环节,结合CSPDarknet骨干网络、PAFPN特征金字塔和CIoU损失函数,提升了检测精度。该验证结果表明目标检测模型的P值和召回率R分别达到0.855和0.807,其中mAP@0.5的指标达到0.876,mAP@[0.5:0.95]的值为0.53。分类模型的准确率高达93.6%,比YOLOv5提高了6.6%。本研究为甲骨文的数字化与智能分析提供了新的技术方案,探索了提高识别与分类精度及泛化能力的方法。
Abstract: With the development of deep learning technology, object detection algorithms have achieved significant progress in the field of image recognition. Focusing on the recognition of oracle bone script images, this paper proposes an efficient object detection and classification method based on the YOLOv8n framework. Through stages such as data preprocessing, data augmentation, model training, and prediction, and by integrating the CSPDarknet backbone, PAFPN feature pyramid, and CIoU loss function, the detection accuracy is effectively improved. Experimental results show that the object detection model achieves a precision (P) of 0.855 and a recall (R) of 0.807, with an mAP@0.5 of 0.876 and an mAP@[0.5:0.95] of 0.53. The classification model attains an accuracy of 93.6%, representing a 6.6% improvement over YOLOv5. This study provides a novel technical solution for the digitization and intelligent analysis of oracle bone inscriptions, and explores methods to enhance recognition accuracy, classification performance, and model generalization capability.
文章引用:张振文, 黄柏锋, 韦易霖, 覃军, 刘映伶. 基于YOLOv8n的甲骨文图像识别技术研究[J]. 图像与信号处理, 2025, 14(4): 467-485. https://doi.org/10.12677/jisp.2025.144043

参考文献

[1] Tang, H., Tang, R. and Wang, H. (2024) Oracle Bone Script Intelligent Recognition: Automatic Segmentation and Recognition of Original Rubbing Single Characters. 2024 5th International Conference on Electronic Communication and Artificial Intelligence (ICECAI), Shenzhen, 31 May-2 June 2024, 414-418. [Google Scholar] [CrossRef
[2] Liu, J., Li, R., Fu, T., Yin, Z. and Yang, Z. (2022) Interpretation Study of Unknown Oracle Bones Based on Self-Training Learning. Proceedings of the 2022 4th International Conference on Video, Signal and Image Processing, Shanghai, 25-27 November 2022, 7-11. [Google Scholar] [CrossRef
[3] Zhen, Q., Wu, L. and Liu, G. (2024) An Oracle Bone Inscriptions Detection Algorithm Based on Improved YOLOv8. Algorithms, 17, Article No. 174. [Google Scholar] [CrossRef
[4] Xu, H., Zhang, Z., Liang, Q. and Lin, Z. (2024) Research on the Intelligent Oracle Bone Script Recognition Model Based on OTSU and Improved YOLOv8. 2024 IEEE 4th International Conference on Electronic Technology, Communication and Information (ICETCI), Changchun, 24-26 May 2024, 972-976. [Google Scholar] [CrossRef
[5] Li, D. and Du, B. (2024) Research on Oracle Bone Inscription Segmentation and Recognition Model Based on Deep Learning. 2024 IEEE 4th International Conference on Electronic Technology, Communication and Information (ICETCI), Changchun, 24-26 May 2024, 1309-1314. [Google Scholar] [CrossRef
[6] Sun, W., Zhai, G., Gao, Z., Chen, T., Zhu, Y. and Wang, Z. (2020) Dual-View Oracle Bone Script Recognition System via Temporal-Spatial Psychovisual Modulation. 2020 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), Shenzhen, 6-8 August 2020, 193-198. [Google Scholar] [CrossRef
[7] Huang, S., Wang, H., Liu, Y., Shi, X. and Jin, L. (2019) OBC306: A Large-Scale Oracle Bone Character Recognition Dataset. 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, 20-25 September 2019, 681-688. [Google Scholar] [CrossRef
[8] Zhu, M., Wu, D., Xu, B., Cao, S., Cao, J. and Wang, K. (2024) Oracle Bone Inscriptions Image Recognition and Classification Method Based on Deep Learning. 2024 IEEE 7th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chongqing, 20-22 September 2024, 1960-1965. [Google Scholar] [CrossRef
[9] Song, L., Hong, C., Gao, T. and Yu, J. (2024) Lightweight Facial Landmark Detection Network Based on Improved MobileViT. Signal, Image and Video Processing, 18, 3123-3131. [Google Scholar] [CrossRef
[10] Jiang, H., Pan, Y., Chen, J., Liu, Z., Zhou, Y., Shu, P., Li, Y., Zhao, H., Mihm, S., Howe, L. and Liu, T. (2024) OracleSage: Towards Unified Visual-Linguistic Understanding of Oracle Bone Scripts through Cross-Modal Knowledge Fusion.
[11] Zheng, Q., Saponara, S., Tian, X., Yu, Z., Elhanashi, A. and Yu, R. (2023) A Real-Time Constellation Image Classification Method of Wireless Communication Signals Based on the Lightweight Network MobileViT. Cognitive Neurodynamics, 18, 659-671. [Google Scholar] [CrossRef
[12] Zhao, L., Li, Q., Zhang, L., Luo, Z., Jiang, H., Jiang, C., et al. (2020) A Virtual Teaching Machine System Based on Oracle-Bone-Script Learning. 2020 IEEE International Conference on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), Exeter, 17-19 December 2020, 1144-1149. [Google Scholar] [CrossRef
[13] Mehta, S. and Rastegari, M. (2021) MobileViT: Light-Weight, General-Purpose, and Mobile-Friendly Vision Transform-er.
[14] Fu, X., Zhou, R., Yang, X. and Li, C. (2024) Detecting Oracle Bone Inscriptions via Pseudo-Category Labels. Heritage Science, 12, Article No. 107. [Google Scholar] [CrossRef