基于改进U-Net模型的咽拭子图像分割研究
Research on Pharyngeal Swab Image Segmentation Based on an Improved U-Net Model
摘要: 为提高咽拭子图像中口腔M区域的分割效果,本文提出一种改进U-Net模型。针对传统U-Net的浅层特征冗余与边缘敏感性不足的问题,引入循环残差模块增强边缘特征传递能力,并结合注意力机制抑制背景噪声,优化多尺度特征融合策略。实验表明,相比先前研究所采用的U-Net方法,改进模型的交并比(IoU)、Dice相似系数(DSC)和像素精确率(PA)分别提高3.8%、2.1%和3.5%,可精准分割复杂背景下低对比度的口腔M目标区域,为咽拭子采样机器人提供高效的视觉支持,降低人工采样风险。
Abstract: This paper proposes an improved U-Net model to enhance the segmentation performance of the oral M region in pharyngeal swab images. To address the issues of redundant shallow features and insufficient edge sensitivity in traditional U-Net models, a recurrent residual module is introduced to enhance edge feature propagation. Additionally, an attention mechanism is incorporated to suppress background noise and optimize the multi-scale feature fusion strategy. Experimental results demonstrate that compared to the U-Net method used in previous studies, the improved model achieves increases of 3.8%, 2.1%, and 3.5% in Intersection over Union (IoU), Dice Similarity Coefficient (DSC), and Pixel Accuracy (PA), respectively, which can enable precise segmentation of the low-contrast oral M region in complex backgrounds, providing efficient visual support for pharyngeal swab sampling robots and reducing the risks associated with manual sampling.
文章引用:贾劼. 基于改进U-Net模型的咽拭子图像分割研究[J]. 人工智能与机器人研究, 2025, 14(3): 605-611. https://doi.org/10.12677/airr.2025.143059

参考文献

[1] Li, S., Guo, W., Liu, H., Wang, T., Zhou, Y., Yu, T., et al. (2020) Clinical Application of an Intelligent Oropharyngeal Swab Robot: Implication for the COVID-19 Pandemic. European Respiratory Journal, 56, Article ID: 2001912. [Google Scholar] [CrossRef] [PubMed]
[2] Long, J., Shelhamer, E. and Darrell, T. (2015) Fully Convolutional Networks for Semantic Segmentation. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, 7-12 June 2015, 3431-3440. [Google Scholar] [CrossRef
[3] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W. and Frangi, A., Eds., Lecture Notes in Computer Science, Springer International Publishing, 234-241. [Google Scholar] [CrossRef
[4] Litjens, G., Kooi, T., Bejnordi, B.E., Setio, A.A.A., Ciompi, F., Ghafoorian, M., et al. (2017) A Survey on Deep Learning in Medical Image Analysis. Medical Image Analysis, 42, 60-88. [Google Scholar] [CrossRef] [PubMed]
[5] Roy, A.G., Navab, N. and Wachinger, C. (2018) Concurrent Spatial and Channel ‘Squeeze & Excitation’ in Fully Convolutional Networks. In: Frangi, A., Schnabel, J., Davatzikos, C., Alberola-López, C. and Fichtinger, G., Eds., Lecture Notes in Computer Science, Springer International Publishing, 421-429. [Google Scholar] [CrossRef
[6] 邱纯乾, 陈建森, 郑茜颖. 基于改进CENet的新冠肺炎CT图像感染区域分割[J]. 传感器与微系统, 2023, 42(11): 139-142+146.
[7] 刘柏男. 视觉引导咽拭子机器人智能采集关键技术研究[D]: [硕士学位论文]. 哈尔滨: 哈尔滨工业大学, 2021.