基于YOLOv8改进的牙齿X光医学图像分割模型
Dental X-Ray Medical Image Segmentation Model Improved Based on YOLOv8
DOI: 10.12677/mos.2025.146504, PDF,   
作者: 杜一族, 张学典*:上海理工大学光电信息与计算机工程学院,上海
关键词: 牙齿分割注意力机制实例分割YOLOv8Tooth Segmentation Attention Mechanism Instance Segmentation YOLOv8
摘要: 医学牙齿图像对于帮助医生根据图像中的器官、组织和病变视觉表现,来快速进行诊断和临床干预。其中牙齿分割和编号是口腔分析诊断的重要开始。在牙科治疗中,X光片是一种常见的诊断工具,在诊断中被医生用于检查牙齿、牙龈、颚骨和口腔骨骼结构的状态。自动准确地分割牙齿位置和区域,对牙齿分割模型非常重要,由于牙科X射线图像的对比度差,特征提取困难,导致预测掩膜质量较低。本文为了提高牙齿X光图像分割的质量提出了YDBD模型。该模型基于YOLOv8改进的EfficientNet骨干网络中加入了空间注意力机制,增加了牙齿位置的识别的准确性。在其Neck结构中,使用了动态上采样来减少边缘伪影,并加入高层特征图拼接,对边缘特征进行更好的提取,提高预测掩膜贴合度。实验结果表明准确性和平均精度均值方面对比原模型有部分提升。
Abstract: Medical dental images play a crucial role in helping doctors make rapid diagnoses and clinical interventions based on the visual manifestations of organs, tissues, and lesions in the images. Among them, tooth segmentation and numbering are important starting points for oral analysis and diagnosis. In dental treatment, X-rays are a common diagnostic tool, which are used by doctors to examine the condition of teeth, gums, jawbones and oral skeletal structures during diagnosis. Accurate and automatic segmentation of tooth positions and regions is of great significance for the tooth segmentation model. Due to the poor contrast of dental X-ray images and the difficulty in feature extraction, the quality of the prediction mask is relatively low. In this paper, to improve the quality of dental X-ray image segmentation, the YDBD model is proposed. This model is based on the improved EfficientNet backbone network of YOLOv8 and incorporates a spatial attention mechanism, which enhances the accuracy of tooth position recognition. During upsampling, dynamic upsampling is used to reduce edge artifacts and high-level feature maps are concatenated to better extract edge features, thereby improving the fitting degree of the prediction mask. Experimental results show that there is a partial improvement in terms of accuracy and mean average precision compared to the original model.
文章引用:杜一族, 张学典. 基于YOLOv8改进的牙齿X光医学图像分割模型[J]. 建模与仿真, 2025, 14(6): 370-382. https://doi.org/10.12677/mos.2025.146504

参考文献

[1] World Health Organization (2022) Global Oral Health Status Report: Towards Universal Health Coverage for Oral Health by 2030. World Health Organization.
[2] Zhang, T., Zhang, Q., Peng, Y., Zheng, K., Yang, L., Xiao, Q., et al. (2022) Retrospective Summary Analysis on the Results of Oral Health Epidemiological Investigations in China. Journal of Oral Biology and Craniofacial Research, 12, 809-817. [Google Scholar] [CrossRef] [PubMed]
[3] Vimala, R. and Preethi, D.M.D. (2024) Maxillary Sinus Disease Detection and Analysis Approaches in Deep Learning: Survey. 2024 International Conference on Cognitive Robotics and Intelligent Systems (ICCROBINS), Coimbatore, 17-19 April 2024, 173-181. [Google Scholar] [CrossRef
[4] Tandon, D., Rajawat, J. and Banerjee, M. (2020) Present and Future of Artificial Intelligence in Dentistry. Journal of Oral Biology and Craniofacial Research, 10, 391-396. [Google Scholar] [CrossRef] [PubMed]
[5] Wu, D., Jiang, J., Wang, J., Bi, Z. and Yu, G. (2025) Segmentation Algorithms of Dental CT Images: A Comprehensive Review from Classical to Deep Learning Trend. Expert Systems with Applications, 275, Article ID: 126853. [Google Scholar] [CrossRef
[6] Zhang, L., Li, W., Lv, J., Xu, J., Zhou, H., Li, G., et al. (2023) Advancements in Oral and Maxillofacial Surgery Medical Images Segmentation Techniques: An Overview. Journal of Dentistry, 138, Article ID: 104727. [Google Scholar] [CrossRef] [PubMed]
[7] Suganyadevi, S., Seethalakshmi, V. and Balasamy, K. (2021) A Review on Deep Learning in Medical Image Analysis. International Journal of Multimedia Information Retrieval, 11, 19-38. [Google Scholar] [CrossRef] [PubMed]
[8] He, K., Gkioxari, G., Dollar, P. and Girshick, R. (2017) Mask R-CNN. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 2980-2988. [Google Scholar] [CrossRef
[9] Wang, X., Zhang, R., Kong, T., et al. (2020) Solov2: Dynamic and Fast Instance Segmentation. Advances in Neural Information Processing Systems, 33, 17721-17732.
[10] Tuzoff, D.V., Tuzova, L.N., Bornstein, M.M., Krasnov, A.S., Kharchenko, M.A., Nikolenko, S.I., et al. (2019) Tooth Detection and Numbering in Panoramic Radiographs Using Convolutional Neural Networks. Dentomaxillofacial Radiology, 48, Article ID: 20180051. [Google Scholar] [CrossRef] [PubMed]
[11] Koch, T.L., Perslev, M., Igel, C. and Brandt, S.S. (2019) Accurate Segmentation of Dental Panoramic Radiographs with U-NETS. 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, 8-11 April 2019, 15-19. [Google Scholar] [CrossRef
[12] Lee, J., Kim, D., Jeong, S. and Choi, S. (2018) Detection and Diagnosis of Dental Caries Using a Deep Learning-Based Convolutional Neural Network Algorithm. Journal of Dentistry, 77, 106-111. [Google Scholar] [CrossRef] [PubMed]
[13] Chung, M., Lee, J., Park, S., Lee, M., Lee, C.E., Lee, J., et al. (2021) Individual Tooth Detection and Identification from Dental Panoramic X-Ray Images via Point-Wise Localization and Distance Regularization. Artificial Intelligence in Medicine, 111, Article ID: 101996. [Google Scholar] [CrossRef] [PubMed]
[14] Jader, G., Fontineli, J., Ruiz, M., Abdalla, K., Pithon, M. and Oliveira, L. (2018) Deep Instance Segmentation of Teeth in Panoramic X-Ray Images. 2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Parana, 29 October-1 November 2018, 400-407. [Google Scholar] [CrossRef
[15] Silva, B., Pinheiro, L., Oliveira, L. and Pithon, M. (2020) A Study on Tooth Segmentation and Numbering Using End-To-End Deep Neural Networks. 2020 33rd SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Porto de Galinhas, 7-10 November 2020, 164-171. [Google Scholar] [CrossRef
[16] Zhao, Y., Li, P., Gao, C., Liu, Y., Chen, Q., Yang, F., et al. (2020) TSASNet: Tooth Segmentation on Dental Panoramic X-Ray Images by Two-Stage Attention Segmentation Network. Knowledge-Based Systems, 206, Article ID: 106338. [Google Scholar] [CrossRef
[17] 丁状状, 侯俊, 梁善淇, 等. 牙齿图像分割算法研究[J]. 软件工程与应用, 2022, 11(6): 1282-1287.
[18] Varghese, R. and M., S. (2024) YOLOv8: A Novel Object Detection Algorithm with Enhanced Performance and Robustness. 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS), Chennai, 18-19 April 2024, 1-6. [Google Scholar] [CrossRef
[19] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W. and Frangi, A., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2015, Springer, 234-241. [Google Scholar] [CrossRef
[20] Kamencay, P., Zachariasova, M., Hudec, R., Benco, M. and Radil, R. (2014) 3D Image Reconstruction from 2D CT Slices. 2014 3DTV-Conference: The True VisionCapture, Transmission and Display of 3D Video (3DTV-CON), Budapest, 2-4 July 2014, 1-4. [Google Scholar] [CrossRef
[21] Li, Q., Chen, K., Han, L., Zhuang, Y., Li, J. and Lin, J. (2020) Automatic Tooth Roots Segmentation of Cone Beam Computed Tomography Image Sequences Using U-Net and RNN. Journal of X-Ray Science and Technology: Clinical Applications of Diagnosis and Therapeutics, 28, 905-922. [Google Scholar] [CrossRef] [PubMed]
[22] Christ, P.F., Elshaer, M.E.A., Ettlinger, F., Tatavarty, S., Bickel, M., Bilic, P., et al. (2016) Automatic Liver and Lesion Segmentation in CT Using Cascaded Fully Convolutional Neural Networks and 3D Conditional Random Fields. In: Ourselin, S., Joskowicz, L., Sabuncu, M., Unal, G. and Wells, W., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2016, Springer, 415-423. [Google Scholar] [CrossRef
[23] Xia, Z., Pan, X., Song, S., Li, L.E. and Huang, G. (2022) Vision Transformer with Deformable Attention. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 4784-4793. [Google Scholar] [CrossRef
[24] Tan, M., Pang, R. and Le, Q.V. (2020) EfficientDet: Scalable and Efficient Object Detection. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 10778-10787. [Google Scholar] [CrossRef
[25] Liu, W., Lu, H., Fu, H. and Cao, Z. (2023) Learning to Upsample by Learning to Sample. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, 1-6 October 2023, 6004-6014. [Google Scholar] [CrossRef
[26] Mehla, N., Ishita, Talukdar, R. and Sharma, D.K. (2023) Object Detection in Autonomous Maritime Vehicles: Comparison between YOLO V8 and EfficientDet. In: Namasudra, S., Trivedi, M.C., Crespo, R.G. and Lorenz, P., Eds., Data Science and Network Engineering, Springer, 125-141. [Google Scholar] [CrossRef