基于SAM注意力机制的头颈部危及器官 CT图像分割
Automatic Segmentation of Organs-at-Risk in Head-and-Neck Carcinoma from Radiation Therapy Using Multi-Scale Fusion and Attention-Based Mechanisms
摘要: 目的:本文提出了一种基于残差和注意力机制U-Net的头颈部医学影像危及器官分割新方法。方法:在nnU-Net编码阶段,提出将SAM注意力模块与残差方法相结合的残差注意力模块,以增强特征表达能力;在解码阶段引入残差注意力模块,根据分割任务提高特征加权的相关性。结果:该方法基于包含22个头颈部危及器官的真实医学影像数据集进行评估,实验结果表明,与现有方法相比,所提方法的平均分割准确率提升了11.4%。对22种头颈部危及器官分割的平均骰子相似系数(Dice Similarity Coefficient, DSC)为87.2%。结论:基于可分离卷积和注意力机制的U-Net卷积神经网络对鼻咽癌靶区达到了更好的分割精度,表明该方法有望帮助临床医生提高放射治疗的准确性和效率。
Abstract: We proposed a new organs-at-risk segmentation method for medical images of heads and necks based on the U-Net with residuals and attention mechanism. A SAM block combined with a residual method is proposed as Residual-SAM Block in the nnU-Net encoding stage in order to enhance the feature expression ability. SAM are introduced in the decoding stage to increase the relevance of feature weighting in accord with segmentation tasks. The proposed method has been evaluated through a set of real world medical images including 22 organs at risk in the head and neck. Experimental results showed that our proposed method improved the average segmentation accuracy by 11.4% compared with current existing methods. The average Dice Similarity Coefficient (DSC) score on the segmentation of the 22 types of head-and-neck related organs at risk is 87.2%. It demonstrates that this approach is promising for clinical doctors to improve their accuracy and efficiency in radiotherapy.
文章引用:林小惟, 张福全. 基于SAM注意力机制的头颈部危及器官 CT图像分割[J]. 临床医学进展, 2026, 16(3): 2644-2650. https://doi.org/10.12677/acm.2026.1631064

参考文献

[1] Wei, W.I. and Sham, J.S. (2005) Nasopharyngeal Carcinoma. The Lancet, 365, 2041-2054. [Google Scholar] [CrossRef] [PubMed]
[2] Chang, E.T. and Adami, H. (2006) The Enigmatic Epidemiology of Nasopharyngeal Carcinoma. Cancer Epidemiology, Biomarkers & Prevention, 15, 1765-1777. [Google Scholar] [CrossRef] [PubMed]
[3] Han, X., Hoogeman, M.S., Levendag, P.C., Hibbard, L.S., Teguh, D.N., Voet, P., et al. (2008) Atlas-Based Auto-Segmentation of Head and Neck CT Images. In: Metaxas, D., Axel, L., Fichtinger, G. and Székely, G., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2008, Springer, 434-441. [Google Scholar] [CrossRef] [PubMed]
[4] Daisne, J. and Blumhofer, A. (2013) Atlas-Based Automatic Segmentation of Head and Neck Organs at Risk and Nodal Target Volumes: A Clinical Validation. Radiation Oncology, 8, Article No. 154. [Google Scholar] [CrossRef] [PubMed]
[5] Gollmer, S.T. and M. Buzug, T. (2013) Relaxed Statistical Shape Models for 3D Image Segmentation—Application to Mandible Bone in Cone-Beam CT Data. Current Medical Imaging Reviews, 9, 129-137. [Google Scholar] [CrossRef
[6] Fitton, I., Cornelissen, S.A.P., Duppen, J.C., Steenbakkers, R.J.H.M., Peeters, S.T.H., Hoebers, F.J.P., et al. (2011) Semi‐automatic Delineation Using Weighted CT‐MRI Registered Images for Radiotherapy of Nasopharyngeal Cancer. Medical Physics, 38, 4662-4666. [Google Scholar] [CrossRef] [PubMed]
[7] Zhou, J., Chan, K.L., Xu, P. and Chong, V.F.H. (2006) Nasopharyngeal Carcinoma Lesion Segmentation from MR Images by Support Vector Machine. 3rd IEEE International Symposium on Biomedical Imaging: Macro to Nano, 2006, Arlington, 6-9 April 2006, 1364-1367. [Google Scholar] [CrossRef
[8] Lei, W., Wang, H., Gu, R., Zhang, S., Zhang, S. and Wang, G. (2019) DeepIGeoS-V2: Deep Interactive Segmentation of Multiple Organs from Head and Neck Images with Lightweight CNNs. In: Zhou, L., et al., Eds., Large-Scale Annotation of Biomedical Data and Expert Label Synthesis and Hardware Aware Learning for Medical Imaging and Computer Assisted Intervention, Springer, 61-69. [Google Scholar] [CrossRef
[9] Shin, H., Roth, H.R., Gao, M., Lu, L., Xu, Z., Nogues, I., et al. (2016) Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning. IEEE Transactions on Medical Imaging, 35, 1285-1298. [Google Scholar] [CrossRef] [PubMed]
[10] Long, J., Shelhamer, E. and Darrell, T. (2015) Fully Convolutional Networks for Semantic Segmentation. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, 7-12 June 2015, 3431-3440. [Google Scholar] [CrossRef
[11] Ibragimov, B. and Xing, L. (2017) Segmentation of Organs‐At‐Risks in Head and Neck CT Images Using Convolutional Neural Networks. Medical Physics, 44, 547-557. [Google Scholar] [CrossRef] [PubMed]
[12] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W. and Frangi, A., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2015, Springer, 234-241. [Google Scholar] [CrossRef
[13] Kirillov, A., Heitz, G., Girshick, R., Rother, C. and Dollár, P. (2023) Segment Anything. arXiv: 2304.02643.
https://arxiv.org/abs/2304.02643
[14] Gao, Y., Huang, R., Chen, M., Wang, Z., Deng, J., Chen, Y., et al. (2019) FocusNet: Imbalanced Large and Small Organ Segmentation with an End-To-End Deep Neural Network for Head and Neck CT Images. In: Shen, D., et al., Eds., Medical Image Computing and Computer Assisted InterventionMICCAI 2019, Springer, 829-838. [Google Scholar] [CrossRef
[15] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[16] Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Computer VisionECCV 2018, Springer, 3-19. [Google Scholar] [CrossRef
[17] Roy, A.G., Navab, N. and Wachinger, C. (2018) Concurrent Spatial and Channel ‘Squeeze & Excitation’ in Fully Convolutional Networks. In: Frangi, A., Schnabel, J., Davatzikos, C., Alberola-López, C. and Fichtinger, G., Eds., Medical Image Computing and Computer Assisted InterventionMICCAI 2018, Springer, 421-429. [Google Scholar] [CrossRef