基于Transformer和双注意力机制的颈动脉超声造影图像斑块分割方法
Segmentation of Carotid Plaque in Contrast-Enhanced Ultrasound Image Based on Transformer and Dual Attention Mechanism
摘要: 脑卒中是一项重大的公共卫生挑战,同时也是全球导致死亡人数最多的疾病之一。颈动脉粥样硬化斑块与脑卒中等缺血性疾病密切相关。颈动脉斑块的早发现和早治疗对预防未来缺血性脑卒中疾病的发生具有重要意义。超声造影(CEUS)已经成为常见的诊断颈动脉斑块的成像方式,因此从CEUS图像中准确分割动脉粥样硬化斑块对于预防和治疗缺血性脑卒中至关重要。然而,由于斑块边界模糊和图像噪声强烈等原因,CEUS图像颈动脉斑块自动分割面临巨大挑战。因此,如何提高颈动脉斑块分割性能仍然是迫切需要解决的问题。本文提出了一种创新的医学图像分割框架,称为DATU-Net,该框架将Swin Transformer模块和双注意力机制集成到U形架构中,以实现CEUS图像颈动脉斑块自动分割。DATU-Net采用基于Swin Transformer模块构建的编码器,可以有效地建模远程依赖关系和多尺度上下文信息。为了获得更丰富的特征表示,我们在编码器–解码器的跳跃连接中引入了双级注意力(Dual-Level Attention)模块,以增强图像特定的位置特征和通道特征,从而有效提高了斑块分割性能。此外,我们在解码器中引入了Swin Transformer模块,用于进一步探索上采样过程中的全局上下文信息,同时逐步恢复特征图。我们利用实际的临床数据集对提出的框架性能进行了评估。广泛的实验仿真结果显示,本文提出的方法在Dice系数(0.8548)、交并比(0.7632)、精确度(0.8746)和召回率(0.8863)等方面始终优于其他分割网络。这些实验证明了DATU-Net的有效性,为颈动脉超声造影图像斑块自动分割问题提供了一种可行的解决方案。
Abstract: Stroke is a major public health challenge and one of the leading causes of death around the world. Carotid atherosclerotic plaque closely correlates with ischemic diseases such as stroke. Early detection and treatment of carotid plaques are important for preventing future ischemic stroke diseases. Contrast-enhanced ultrasound (CEUS) has emerged as a prevalent imaging modality for the diagnosis of carotid plaques, so accurate segmentation of atherosclerotic plaques from CEUS images is crucial for the prevention and treatment of ischemic stroke. However, automatic segmentation of carotid plaques from CEUS images is tremendously challenging due to blurred plaque boundaries and strong noise in images. Therefore, how to improve the performance of carotid plaque segmentation remains an urgent problem. In this paper, we propose an innovative medical image segmentation framework, called DATU-Net, which aims to integrate the Swin Transformer block and the dual-attention mechanism into a U-shaped architecture for automatic carotid plaque segmentation of CEUS images. DATU-Net employs an encoder built upon the Swin Transformer block, proficient in effectively modelling long-range dependencies and multi-scale contextual information. In order to obtain richer feature representations, we introduce a Dual-Level Attention module in the encoder-decoder skip connection to enhance the image-specific positional and channel features, which effectively improves the performance of plaque segmentation. In addition, we introduce the Swin Transformer block in the decoder for further exploring the global contextual information during the up-sampling process while progressively recovering feature maps. We evaluate the performance of the proposed framework using real clinical datasets. Extensive experimental simulation results consistently show that the method proposed in this paper outperforms other segmentation networks in terms of Dice coefficient (0.8548), intersection over union (0.7632), precision (0.8746) and recall (0.8863). These experiments demonstrate the effectiveness of DATU-Net and provide a viable solution to the problem of automatic plaque segmentation in carotid CEUS images.
文章引用:王金生, 孙占全. 基于Transformer和双注意力机制的颈动脉超声造影图像斑块分割方法[J]. 建模与仿真, 2024, 13(2): 1577-1591. https://doi.org/10.12677/mos.2024.132149

参考文献

[1] Organization, W.H. (2019) World Health Statistics Overview 2019: Monitoring Health for the SDGs, Sustainable Development Goals. World Health Organization, Geneva.
[2] Feigin, V.L., Brainin, M., Norrving, B., et al. (2022) World Stroke Organization (WSO): Global Stroke Fact Sheet 2022. International Journal of Stroke, 17, 18-29. [Google Scholar] [CrossRef] [PubMed]
[3] Xie, M., Li, Y., Xue, Y., et al. (2020) Two-Stage and Dual-Decoder Convolutional U-Net Ensembles for Reliable Vessel and Plaque Segmentation in Carotid Ultrasound Images. Proceedings of the 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA), Miami, 14-17 December 2020, 1376-1381. [Google Scholar] [CrossRef
[4] 张蕴, 敖梦. 超声评价颈动脉易损斑块各病理特征的研究进展[J]. 临床医学进展, 2023, 13(2): 2717-2723.
[5] 孙雪, 李伟, 秦志平, 等. 脑梗死患者颈动脉斑块超声造影特征与梗死特征的相关性[J]. 中国实用神经疾病杂志, 2024, 27(1): 59-63.
[6] Chung, Y.E. and Kim, K.W. (2015) Contrast-Enhanced Ultrasonography: Advance and Current Status in Abdominal Imaging. Ultrasonography, 34, 3-18. [Google Scholar] [CrossRef] [PubMed]
[7] Sumathi, K., Mahesh, V. and Ramakrishnan, S. (2014) Analysis of Intima Media Thickness in Ultrasound Carotid Artery Images Using Level Set Segmentation without Re-Initialization. Proceedings of the 2014 International Conference on Informatics, Electronics & Vision (ICIEV), Dhaka, 23-24 May 2014, 1-4. [Google Scholar] [CrossRef
[8] Nagaraj, Y., Hema Sai Teja, A. and Narasimhadhan, A. (2019) Automatic Segmentation of Intima Media Complex in Carotid Ultrasound Images Using Support Vector Machine. Arabian Journal for Science and Engineering, 44, 3489-3496. [Google Scholar] [CrossRef
[9] Destrempes, F., Soulez, G., Giroux, M.-F., et al. (2009) Segmentation of Plaques in Sequences of Ultrasonic B-Mode Images of Carotid Arteries Based on Motion Estimation and Nakagami Distributions. Proceedings of the 2009 IEEE International Ultrasonics Symposium, Rome, 20-23 September 2009, 2480-2483. [Google Scholar] [CrossRef
[10] Hassan, M., Chaudhry, A., Khan, A., et al. (2012) Carotid Artery Image Segmentation Using Modified Spatial Fuzzy C-Means and Ensemble Clustering. Computer Methods and Programs in Biomedicine, 108, 1261-1276. [Google Scholar] [CrossRef] [PubMed]
[11] Loizou, C.P., Petroudi, S., Pattichis, C.S., et al. (2012) Segmentation of Atherosclerotic Carotid Plaque in Ultrasound Video. Proceedings of the 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, San Diego, 28 August-1 September 2012, 53-56. [Google Scholar] [CrossRef
[12] Hoogi, A., Adam, D., Hoffman, A., et al. (2011) Carotid Plaque Vulnerability: Quantification of Neovascularization on Contrast-Enhanced Ultrasound with Histopathologic Correlation. American Journal of Roentgenology, 196, 431-436. [Google Scholar] [CrossRef
[13] Carvalho, D.D., Akkus, Z., Van Den Oord, S.C., et al. (2014) Lumen Segmentation and Motion Estimation in B-Mode and Contrast-Enhanced Ultrasound Images of the Carotid Artery in Patients with Atherosclerotic Plaque. IEEE Transactions on Medical Imaging, 34, 983-993. [Google Scholar] [CrossRef
[14] Azzopardi, C., Camilleri, K.P. and Hicks, Y.A. (2020) Bimodal Automated Carotid Ultrasound Segmentation Using Geometrically Constrained Deep Neural Networks. IEEE Journal of Biomedical and Health Informatics, 24, 1004-1015. [Google Scholar] [CrossRef
[15] Zhou, R., Fenster, A., Xia, Y., et al. (2019) Deep Learning-Based Carotid Media-Adventitia and Lumen-Intima Boundary Segmentation from Three-Dimensional Ultrasound Images. Medical Physics, 46, 3180-3193. [Google Scholar] [CrossRef] [PubMed]
[16] Mi, S., Bao, Q., Wei, Z., et al. (2021) Mbff-Net: Multi-Branch Feature Fusion Network for Carotid Plaque Segmentation in Ultrasound. Proceedings of the Medical Image Computing and Computer Assisted Intervention-MICCAI 2021: 24th International Conference, Strasbourg, 27 September-1 October 2021, 313-322. [Google Scholar] [CrossRef
[17] Meshram, N.H., Mitchell, C.C., Wilbrand, S., et al. (2020) Deep Learning for Carotid Plaque Segmentation Using a Dilated U-Net Architecture. Ultrasonic Imaging, 42, 221-230. [Google Scholar] [CrossRef] [PubMed]
[18] Jain, P.K., Dubey, A., Saba, L., et al. (2022) Attention-Based UNet Deep Learning Model for Plaque Segmentation in Carotid Ultrasound for Stroke Risk Stratification: An Artificial Intelligence Paradigm. Journal of Cardiovascular Development and Disease, 9, Article No. 326. [Google Scholar] [CrossRef] [PubMed]
[19] Long, J., Shelhamer, E. and Darrell, T. (2015) Fully Convolutional Networks for Semantic Segmentation. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, 7-12 June 2015, 3431-3440. [Google Scholar] [CrossRef
[20] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. Proceedings of the Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015: 18th International Conference, Munich, 5-9 October 2015, 234-241.
[21] Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N., et al. (2018) Unet : A Nested U-Net Architecture for Medical Image Segmentation. Proceedings of the Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, Held in Conjunction with MICCAI 2018, Granada, 20 September 2018, 3-11.
[22] Oktay, O., Schlemper, J., Folgoc, L.L., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas.
[23] Ibtehaz, N. and Rahman, M.S. (2020) MultiResUNet: Rethinking the U-Net Architecture for Multimodal Biomedical Image Segmentation. Neural Networks, 121, 74-87. [Google Scholar] [CrossRef] [PubMed]
[24] Jha, D., Riegler, M.A., Johansen, D., et al. (2020) Doubleu-Net: A Deep Convolutional Neural Network for Medical Image Segmentation. Proceedings of the 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS), Rochester, 28-30 July 2020, 558-564. [Google Scholar] [CrossRef
[25] Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al. (2020) An Image Is Worth 16x16 Words: Transformers for Image Recognition At Scale.
[26] Liu, Z., Lin, Y., Cao, Y., et al. (2021) Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. The Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, 10-17 October 2021, 9992-10002. [Google Scholar] [CrossRef
[27] Chen, J., Lu, Y., Yu, Q., et al. (2021) Transunet: Transformers Make Strong Encoders for Medical Image Segmentation.
[28] Zhang, Y., Liu, H. and Hu, Q. (2021) Transfuse: Fusing Transformers and CNNs for Medical Image Segmentation. Proceedings of the Medical Image Computing and Computer Assisted Intervention-MICCAI 2021: 24th International Conference, Strasbourg, 27 September-1 October 2021, Proceedings, 14-24. [Google Scholar] [CrossRef
[29] Cao, H., Wang, Y., Chen, J., et al. (2021) Swin-Unet: Unet-Like Pure Transformer for Medical Image Segmentation.
[30] Lin, A., Chen, B., Xu, J., et al. (2022) Ds-Transunet: Dual Swin Transformer U-Net for Medical Image Segmentation. IEEE Transactions on Instrumentation and Measurement, 71, 1-15. [Google Scholar] [CrossRef
[31] Fu, J., Liu, J., Tian, H., et al. (2019) Dual Attention Network for Scene Segmentation. The Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, 15-20 June 2019, 3141-3149. [Google Scholar] [CrossRef
[32] He, K., Zhang, X., Ren, S., et al. (2016) Deep Residual Learning for Image Recognition. The Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[33] De Boer, P.-T., Kroese, D.P., Mannor, S., et al. (2005) A Tutorial on the Cross-Entropy Method. Annals of Operations Research, 134, 19-67. [Google Scholar] [CrossRef
[34] Máttyus, G., Luo, W. and Urtasun, R. (2017) DeepRoadMapper: Extracting Road Topology from Aerial Images. The Proceedings of the IEEE International Conference on Computer Vision, Venice, 22-29 October 2017, 3458-3466. [Google Scholar] [CrossRef
[35] Zhao, H., Shi, J., Qi, X., et al. (2017) Pyramid Scene Parsing Network. The Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 21-26 July 2017, 6230-6239. [Google Scholar] [CrossRef
[36] Chen, L.-C., Zhu, Y., Papandreou, G., et al. (2018) Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. The Proceedings of the European Conference on Computer Vision (ECCV), Munich, 8-14 September 2018, 833-851. [Google Scholar] [CrossRef
[37] Yang, M., Yu, K., Zhang, C., et al. (2018) DenseASPP for Semantic Segmentation in Street Scenes. The Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 3684-3692. [Google Scholar] [CrossRef