胰腺病变图像分割模型研究
Research on Image Segmentation Model of Pancreatic Lesions
DOI: 10.12677/mos.2025.144274, PDF,   
作者: 黄梦滨, 顾春华:上海理工大学光电信息与计算机工程学院,上海
关键词: 胰腺病变分割注意力机制MedSAM高效通道注意力Segmentation of Pancreatic Lesions Attention Medsam ECA
摘要: 胰腺病变分割是医学图像分析中的一项重要任务,但由于胰腺的大小、形状不规则以及位置不固定,导致该任务充满挑战。胰腺病变的早期检测对于患者的治疗和预后至关重要,因此受到了广泛关注。本文提出了一种改进的U-Net网络模型,通过引入MedSAM编码器和高效通道注意力(ECA)模块,显著提升了胰腺病变分割的精度和鲁棒性。MedSAM编码器对输入图像进行预处理,提取多层次、多尺度的特征。MedSAM编码器能够有效捕捉胰腺的复杂结构和病变区域的细节信息,为后续分割提供高质量的特征表达。其次,在上采样过程中引入高效通道注意力(ECA)模块,增强通道间的依赖关系,防止降维导致的通道信息丢失。在MSD_Pancreas数据集上的实验结果表明,本文方法在Dice相似系数(DSC)和豪斯多夫距离(HD95)两个评价指标上均取得了显著提升。具体而言,本文方法的DSC分数达到82.7%,HD95值为10.2 mm,相比基线U-Net分别提高了4.4%和2.3 mm。实验结果表明,本文方法在胰腺病变分割任务中显著优于基线模型,尤其是在处理复杂病变和边界模糊区域时表现突出。
Abstract: Pancreatic lesion segmentation is an important task in medical image analysis, but it is challenging due to the irregular size and shape of the pancreas as well as its location. Early detection of pancreatic lesions is crucial for patient treatment and prognosis, and thus has received much attention. In this paper, we propose an improved U-Net network model that significantly improves the accuracy and robustness of pancreatic lesion segmentation by introducing the MedSAM encoder and the Efficient Channel Attention (ECA) module. The MedSAM encoder preprocesses the input image and extracts multi-level and multi-scale features. The MedSAM encoder is able to efficiently capture the complex structure of the pancreas and the lesion region’s detail information, providing high-quality feature representation for subsequent segmentation. Second, the Efficient Channel Attention (ECA) module is introduced in the up-sampling process to enhance the inter-channel dependencies and prevent the loss of channel information due to dimensionality reduction. The experimental results on the MSD_Pancreas dataset show that this paper’s method achieves significant improvement in both the evaluation metrics of Dice Similarity Coefficient (DSC) and Hausdorff Distance (HD95). Specifically, the DSC score of this paper’s method reaches 82.7% and the HD95 value is 10.2 mm, which are improved by 4.4% and 2.3 mm, respectively, compared with the baseline U-Net. The experimental results show that this paper’s method significantly outperforms the baseline model in the task of pancreatic lesion segmentation, especially when dealing with complex lesions and regions with fuzzy boundaries.
文章引用:黄梦滨, 顾春华. 胰腺病变图像分割模型研究[J]. 建模与仿真, 2025, 14(4): 159-166. https://doi.org/10.12677/mos.2025.144274

参考文献

[1] 杜楠, 魏妙艳, 徐近. 胰腺癌的靶向治疗进展及前沿[J]. 中国临床药理学与治疗学, 2025, 30(2): 183-192.
[2] Ma, J., He, Y., Li, F., Han, L., You, C. and Wang, B. (2024) Segment Anything in Medical Images. Nature Communications, 15, Article No. 654. [Google Scholar] [CrossRef] [PubMed]
[3] Hu, J., Shen, L. and Sun, G. (2018) Squeeze-and-Excitation Networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 7132-7141. [Google Scholar] [CrossRef
[4] Woo, S., Park, J., Lee, J. and Kweon, I.S. (2018) CBAM: Convolutional Block Attention Module. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Computer VisionECCV 2018, Springer, 3-19. [Google Scholar] [CrossRef
[5] Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W. and Hu, Q. (2020. ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 11531-11539. [Google Scholar] [CrossRef
[6] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015: 18th International Conference, Munich, 5-9 October 2015, 234-241.
[7] Oktay, O., Schlemper, J., Folgoc, L.L., Lee, M., Heinrich, M., Misawa, K., Rueckert, D., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas. arXiv: 1804.03999.
[8] 李昊轩, 窦凤岐. 基于深度学习的胰腺肿瘤分割算法研究[J]. 电脑编程技巧与维护, 2023(8): 119-121.
[9] 张国栋, 唐晓艺, 鞠蓉晖, 等. 融合双注意力机制的多尺度胰腺分割方法[J]. 计算机工程与设计, 2024, 45(4): 1189-1194.
[10] 生琳, 王朝立. 基于改进MaxViT的胰腺肿瘤图像分割[J]. 软件, 2024, 45(6): 13-15.
[11] Milletari, F., Navab, N. and Ahmadi, S. (2016) V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. 2016 Fourth International Conference on 3D Vision (3DV), Stanford, 25-28 October 2016, 565-571. [Google Scholar] [CrossRef
[12] Chen, J.N., et al. (2021) TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation. arXiv: 2102.04306.
[13] Diakogiannis, F.I., Waldner, F., Caccetta, P. and Wu, C. (2020) ResUNet-a: A Deep Learning Framework for Semantic Segmentation of Remotely Sensed Data. ISPRS Journal of Photogrammetry and Remote Sensing, 162, 94-114. [Google Scholar] [CrossRef
[14] Chen, L., Zhu, Y., Papandreou, G., Schroff, F. and Adam, H. (2018) Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y., Eds., Computer VisionECCV 2018, Springer, 833-851. [Google Scholar] [CrossRef