基于改进UNet模型的眼球超声图像分割算法研究
Research on Segmentation Algorithm of Eye Ultrasound Image Based on Improved UNet Model
DOI: 10.12677/mos.2024.136529, PDF,   
作者: 赵 兵:上海理工大学机械工程学院,上海
关键词: 图像分割残差网络UNet注意力机制Image Segmentation Residual Network UNet Attention Mechanism
摘要: 在医学图像分割领域,提高分割性能一直是一个具有挑战性的任务。超声图像具有边缘模糊、噪声污染等缺点,为了解决眼球超声图像分割结果不理想这一难题,本文提出了一种基于UNet的改进分割算法。首先,本文采用了残差网络(ResNet)结合UNet,有效地解决了模型退化的问题,进一步提高了模型的精度和泛化能力;其次,在主干特征提取部分引入高效多尺度注意力(EMA)机制,以增强分割模型的特征表示能力;最后,通过RAVIR数据集进行泛化性实验,证明了所提出模型的泛化能力。实验结果显示,改进的UNet算法在超声眼球图像数据集上获得的MIoU和Dice的值分别达到了82.5%和82.3%,相比UNet模型分别提升了1.1%和1.4%,具有更好的医学图像分割效果。
Abstract: In the field of medical image segmentation, improving segmentation performance has been a challenging task. Ultrasound images have disadvantages such as blurred edges and noise pollution; in order to solve the complex problem of unsatisfactory segmentation results of eye ultrasound images, this paper proposes an improved segmentation algorithm based on UNet. Firstly, the residual network (ResNet) combined with UNet is used in this paper to effectively solve the problem of model degradation and further improve the accuracy and generalization ability of the model; secondly, the efficient multi-scale attention (EMA) mechanism is introduced in the central feature extraction part to enhance the feature representation ability of the segmentation model; finally, generalizability experiments are carried out with the RAVIR dataset, which proves the generalization ability of the proposed model’s generalization ability. The experimental results show that the improved UNet algorithm achieves 82.5% and 82.3% values of MIoU and Dice on the ultrasound eye image dataset, which are 1.1% and 1.4% higher than the UNet model, and it has better medical image segmentation results.
文章引用:赵兵. 基于改进UNet模型的眼球超声图像分割算法研究[J]. 建模与仿真, 2024, 13(6): 5808-5816. https://doi.org/10.12677/mos.2024.136529

参考文献

[1] Yildiz, S., Memis, A. and Varl, S. (2022) Nuclei Segmentation in Colon Histology Images by Using the Deep CNNs: A U-Net Based Multi-Class Segmentation Analysis. 2022 Medical Technologies Congress, Antalya, 31 October-2 November 2022, 1-4. [Google Scholar] [CrossRef
[2] Yin, P., Yuan, R., Cheng, Y. and Wu, Q. (2020) Deep Guidance Network for Biomedical Image Segmentation. IEEE Access, 8, 116106-116116. [Google Scholar] [CrossRef
[3] Yang, J., Jiao, L., Shang, R., Liu, X., Li, R. and Xu, L. (2023) EPT-Net: Edge Perception Transformer for 3D Medical Image Segmentation. IEEE Transactions on Medical Imaging, 42, 3229-3243. [Google Scholar] [CrossRef] [PubMed]
[4] Ling, Y., Wang, Y., Dai, W., Yu, J., Liang, P. and Kong, D. (2024) MTANet: Multi-Task Attention Network for Automatic Medical Image Segmentation and Classification. IEEE Transactions on Medical Imaging, 43, 674-685. [Google Scholar] [CrossRef] [PubMed]
[5] Hu, X., Li, X., Huang, Z., Chen, Q. and Lin, S. (2024) Detecting Tea Tree Pests in Complex Backgrounds Using a Hybrid Architecture Guided by Transformers and Multi-Scale Attention Mechanism. Journal of the Science of Food and Agriculture, 104, 3570-3584. [Google Scholar] [CrossRef] [PubMed]
[6] Ronneberger, O., Philipp, F. and Thomas, B. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation.
http://arxiv.org/abs/1505.04597
[7] Long, J., Shelhamer, E. and Darrell, T. (2015) Fully Convolutional Networks for Semantic Segmentation. 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 3431-3440. [Google Scholar] [CrossRef
[8] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[9] Jin, K., Huang, X., Zhou, J., Li, Y., Yan, Y., Sun, Y., et al. (2022) FIVES: A Fundus Image Dataset for Artificial Intelligence Based Vessel Segmentation. Scientific Data, 9, Article No. 475. [Google Scholar] [CrossRef] [PubMed]
[10] Li, Z. and Guo, Y. (2020) Semantic Segmentation of Landslide Images in Nyingchi Region Based on PSPNet Network. 2020 7th International Conference on Information Science and Control Engineering, Changsha, 18-20 December 2020, 1269-1273. [Google Scholar] [CrossRef
[11] Zheng, X., Zhang, S., Li, X., Li, G. and Li, X. (2021) Lightweight Bridge Crack Detection Method Based on SEGNet and Bottleneck Depth-Separable Convolution with Residuals. IEEE Access, 9, 161649-161668. [Google Scholar] [CrossRef
[12] Cheng, L., Xiong, R., Wu, J., Yan, X., Yang, C., Zhang, Y., et al. (2024) Fast Segmentation Algorithm of USV Accessible Area Based on Attention Fast Deeplabv3. IEEE Sensors Journal, 24, 24168-24177. [Google Scholar] [CrossRef
[13] Hatamizadeh, A., Hosseini, H., Patel, N., Choi, J., Pole, C.C., Hoeferlin, C.M., et al. (2022) RAVIR: A Dataset and Methodology for the Semantic Segmentation and Quantitative Analysis of Retinal Arteries and Veins in Infrared Reflectance Imaging. IEEE Journal of Biomedical and Health Informatics, 26, 3272-3283. [Google Scholar] [CrossRef] [PubMed]