融合注意力机制的残差双通道多尺度的胰腺囊性肿瘤分割模型
Pancreatic Cystic Tumor Segmentation with Fusion Attention Mechanism and Residual Dual-Channel Multi-Scale Approach
DOI: 10.12677/mos.2024.134373, PDF,    国家自然科学基金支持
作者: 何 聪, 戴俊龙, 武 杰*:上海理工大学健康科学与工程学院,上海;边 云*:海军军医大学第一附属医院放射诊疗科,上海
关键词: 胰腺囊性肿瘤注意力机制图像分割多尺度双通道Pancreatic Cystic Tumor Attention Mechanism Image Segmentation Multi-Scale Dual-Channel
摘要: 本文针对胰腺肿瘤的大小形状差异大、区域边界不清晰等问题,提出了一种融合注意力机制的残差双通道多尺度的胰腺囊性肿瘤分割模型ARDM-Net (Attention Residual Dual Channel and Multi Scale-UNet)。首先,对长海医院提供的数据集进行肿瘤区域的裁剪;其次,在U-Net网络模型上,将基础的3 × 3卷积模块替换为残差双通道多尺度卷积模块,增强网络特征提取的能力;最后在跳跃连接中加入注意力模块,调整学习特征的权重。本文的方法性能表现,Dice相似系数为89.50%,豪斯多夫距离(Hausdorff Distance, HD)为2.80 mm,交并比(Intersection-over-Union, IoU)为82.46%,均优于普通的U-Net网络。该试验结果充分表现了本文方法在胰腺囊性肿瘤分割任务中的显著价值。
Abstract: This paper proposes a fusion Attention Mechanism and Residual Dual-Channel Multi-Scale model (ARDM-Net) for the segmentation of pancreatic cystic tumors, addressing the challenges of significant differences in size and shape, as well as unclear regional boundaries. Initially, tumor regions were cropped from the dataset provided by Changhai Hospital. Subsequently, the basic 3x3 convolution module in the U-Net network model was replaced with a Dual-Channel and multi-scale convolution module to enhance feature extraction capability. Finally, an attention mechanism module was incorporated into the skip connection to adjust the weight of learning features. The proposed method in this paper achieves a performance with the Dice similarity coefficient of 89.50%, Hausdorff Distance (HD) of 2.80 mm, and Intersection-over-Union (IoU) of 82.46%, all of which outperformed the standard U-Net network. These experimental results demonstrate the significant value of our approach in the segmentation of pancreatic cystic tumors.
文章引用:何聪, 戴俊龙, 武杰, 边云. 融合注意力机制的残差双通道多尺度的胰腺囊性肿瘤分割模型[J]. 建模与仿真, 2024, 13(4): 4120-4127. https://doi.org/10.12677/mos.2024.134373

参考文献

[1] 徐佳薇, 武杰, 雷宇, 等. 用于烟雾病检测的Faster RCNN改进算法[J]. 数据采集与处理, 2022, 37(6): 1391-1400.
[2] Buda, M., Wildman-Tobriner, B., Castor, K., Hoang, J.K. and Mazurowski, M.A. (2020) Deep Learning-Based Segmentation of Nodules in Thyroid Ultrasound: Improving Performance by Utilizing Markers Present in the Images. Ultrasound in Medicine & Biology, 46, 415-421. [Google Scholar] [CrossRef] [PubMed]
[3] Wang, L., Yang, S., Yang, S., Zhao, C., Tian, G., Gao, Y., et al. (2019) Automatic Thyroid Nodule Recognition and Diagnosis in Ultrasound Imaging with the YOLOv2 Neural Network. World Journal of Surgical Oncology, 17, Article No. 12. [Google Scholar] [CrossRef] [PubMed]
[4] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation.
[5] Oktay, O., Schlemper, J., Folgoc, L.L., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas.
[6] Ibtehaz, N. and Rahman, M.S. (2020) MultiResUNet: Rethinking the U-Net Architecture for Multimodal Biomedical Image Segmentation. Neural Networks, 121, 74-87. [Google Scholar] [CrossRef] [PubMed]
[7] Yang, X., Chen, Y., Yue, X., Ma, C. and Yang, P. (2021) Local Linear Embedding Based Interpolation Neural Network in Pancreatic Tumor Segmentation. Applied Intelligence, 52, 8746-8756. [Google Scholar] [CrossRef
[8] Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. and Wojna, Z. (2016) Rethinking the Inception Architecture for Computer Vision. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 2818-2826. [Google Scholar] [CrossRef
[9] Huang, G., Liu, Z., Van Der Maaten, L. and Weinberger, K.Q. (2017). Densely Connected Convolutional Networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 2261-2269.[CrossRef
[10] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[11] 侯向丹, 赵一浩, 刘洪普, 郭鸿湧, 于习欣, 丁梦园. 融合残差注意力机制的UNet视盘分割[J]. 中国图象图形学报, 2020, 25(9): 15.
[12] Schlemper, J., Oktay, O., Schaap, M., Heinrich, M., Kainz, B., Glocker, B., et al. (2019) Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images. Medical Image Analysis, 53, 197-207. [Google Scholar] [CrossRef] [PubMed]