基于利尿肾图的新型肾实质自动分割算法
A Novel Automatic Renal Parenchyma Segmentation Algorithm Based on the Diuretic Renal Image
DOI: 10.12677/mos.2024.132158, PDF,    国家自然科学基金支持
作者: 朱郭慧, 孙占全:上海理工大学光电信息与计算机工程学院,上海
关键词: 医学图像分割语义分割注意力机制空间金字塔感受野膨胀卷积Medical Image Segmentation Semantic Segmentation Attention Mechanisms Spatial Pyramids Sensory Fields Inflated Convolution
摘要: 尽管深度学习图像分割技术在医学图像处理中取得了良好的效果,但由于利尿肾动态显像具有噪声明显、对比度低、图像质量差和边界不清晰的特点,大多数现有的分割方法在获取感受野和提取图像特征信息方面仍然面临很大的挑战。为了解决上述问题,本文提出了一种新的肾实质自动分割算法,该算法采用的是Swin-Transformer编码器与解码器,并且结合了特征融合模块、连续扩张卷积模块与深度注意力模块。特征增强模块中的空间金字塔池模块,可以弥补空间表征并产生多尺度表征。连续扩张卷积能通过获取多尺度上下文聚合来扩大感受野。注意力模块通过顺序捕获多尺度编码器特征之间的关系来解决编码器和解码器特征之间的语义差距。该算法应用于私人的利尿肾造影数据集。仿真实验结果表明,与其他深度学习分割方法比较,该方法可以显著提高肾脏分割性能。
Abstract: Although deep learning image segmentation techniques have achieved good results in medical image processing, most of the existing segmentation methods still face great challenges in acquiring the sensory field and extracting the image feature information due to the fact that diuretic renal dynamic imaging is characterised by significant noise, low contrast, poor image quality, and unclear boundaries. In order to solve the above problems, this paper proposes a new automatic renal parenchyma segmentation algorithm, which employs a Swin-Transformer encoder and decoder, and combines a feature Enhancement module, a successive dilation convolution module, and a deep attention module. The spatial pyramid pooling module in the feature Enhancement module compensates for spatial representations and produces multi-scale representations. Sequential Expansion Convolution can expand the sensory field by capturing multiscale context aggregation. The Attention module addresses the semantic gap between encoder and decoder features by sequentially capturing relationships between multiscale encoder features. The algorithm is applied to a private diuretic nephrography dataset. The results of simulation experiments show that the method can significantly improve the performance of renal segmentation compared to other deep learning segmentation methods.
文章引用:朱郭慧, 孙占全. 基于利尿肾图的新型肾实质自动分割算法[J]. 建模与仿真, 2024, 13(2): 1673-1683. https://doi.org/10.12677/mos.2024.132158

参考文献

[1] Gordon, I., Piepsz, A. and Sixt, R. (2011) Guidelines for Standard and Diuretic Renogram in Children. European Journal of Nuclear Medicine and Molecular Imaging, 38, 1175-1188. [Google Scholar] [CrossRef] [PubMed]
[2] Taha, A., Lo, P., Li, J. and Zhao, T. (2018) Kid-Net: Convolution Networks for Kidney Vessels Segmentation from Ct-Volumes. Medical Image Computing and Computer Assisted Intervention-MICCAI 2018: 21st International Conference, Granada, 16-20 September 2018, 463-471. [Google Scholar] [CrossRef
[3] Koff, S.A., Thrall, J.H. and Keyes Jr., J.W. (1980) Assessment of Hydroureteronephrosis in Children Using Diuretic Radionuclide Urography. The Journal of Urology, 123, 531-534. [Google Scholar] [CrossRef
[4] Da Cruz, L.B., Araújo, J.D.L., Ferreira, J.L., Diniz, J.O.B., Silva, A.C., De Almeida, J.D.S., De Paiva, A.C. and Gattass, M. (2020) Kidney Segmentation from Computed Tomography Images Using Deep Neural Network. Computers in Biology and Medicine, 123, Article ID: 103906. [Google Scholar] [CrossRef] [PubMed]
[5] Sharma, K., Rupprecht, C., Caroli, A., Aparicio, M.C., Remuzzi, A., Baust, M. and Navab, N. (2017) Automatic Segmentation of Kidneys Using Deep Learning for Total Kidney Volume Quantification in Autosomal Dominant Polycystic Kidney Disease. Scientific Reports, 7, Article No. 2049. [Google Scholar] [CrossRef] [PubMed]
[6] Long, J., Shelhamer, E. and Darrell, T. (2015) Fully Convolutional Networks for Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 3431-3440. [Google Scholar] [CrossRef
[7] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015: 18th International Conference, Munich, 5-9 October 2015, 234-241. [Google Scholar] [CrossRef
[8] Schlemper, J., Oktay, O., Schaap, M., Heinrich, M., Kainz, B., Glocker, B. and Rueckert, D. (2019) Attention Gated Networks: Learning to Leverage Salient Regions in Medical Images. Medical Image Analysis, 53, 197-207. [Google Scholar] [CrossRef] [PubMed]
[9] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł. and Polosukhin, I. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Red Hook, 4-9 December 2017, 6000-6010.
[10] Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G. and Gelly, S. (2020) An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale.
[11] Chen, J., Lu, Y., Yu, Q., Luo, X., Adeli, E., Wang, Y., Lu, L., Yuille, A.L. and Zhou, Y. (2021) Transunet: Transformers Make Strong Encoders for Medical Image Segmentation.
[12] Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S. and Guo, B. (2021) Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, 10-17 October 2021, 10012-10022. [Google Scholar] [CrossRef
[13] Halkar, R., Chrem, Y., Galt, J., Rothfusz, B., Grant, S., Jones, M., Folks, R., Alazraki, N. and Taylor, A. (1996) Interoperator Variability in Quantitating the MAG3 Renal Uptake Based on Semiautomated and Manual Regions of Interest. Journal of Nuclear Medicine, 37, 285-293.
[14] Tomaru, Y., Inoue, T., Oriuchi, N., Takahashi, K. and Endo, K. (1998) Semi-Automated Renal Region of Interest Selection Method Using the Double-Threshold Technique: Inter-Operator Variability in Quantitating 99m Tc-MAG3 Renal Uptake. European Journal of Nuclear Medicine, 25, 55-59. [Google Scholar] [CrossRef] [PubMed]
[15] Inoue, Y., Yoshikawa, K., Yoshioka, N., Watanabe, T., Saegusa, S., Kaneko, Y., Yokoyama, I. and Ohtomo, K. (2000) Evaluation of Renal Function with 99mTc-MAG3 Using Semiautomated Regions of Interest. Journal of Nuclear Medicine, 41, 1947-1954.
[16] Sun, P., Mo, Z., Hu, F., Song, X., Mo, T., Yu, B., Zhang, Y. and Chen, Z. (2022) Segmentation of Kidney Mass Using AgDenseU-Net 2.5D Model. Computers in Biology and Medicine, 150, Article ID: 106223. [Google Scholar] [CrossRef] [PubMed]
[17] Jing, X., Zheng, X., Song, S. and Liu, K. (2017) A Computer Tomography Assisted Method for the Automatic Detection of Region of Interest in Dynamic Kidney Images. Journal of Biomedical Engineering, 34, 928-933.
[18] Oktay, O., Schlemper, J., Folgoc, L.L., Lee, M., Heinrich, M., Misawa, K., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas.
[19] Wu, Z., Hai, J., Zhang, L., Chen, J., Cheng, G. and Yan, B. (2019) Cascaded Fully Convolutional DenseNet for Automatic Kidney Segmentation in Ultrasound Images. 2019 2nd International Conference on Artificial Intelligence and Big Data (ICAIBD), Chengdu, 25-28 May 2019, 384-388. [Google Scholar] [CrossRef
[20] Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F. and Adam, H. (2018) Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, 8-14 September 2018, 801-818. [Google Scholar] [CrossRef
[21] Hsiao, C.H., Lin, P.C., Chung, L.A., Lin, F.Y., Yang, F.J., Yang, S.Y., et al. (2022) A Deep Learning-Based Precision and Automatic Kidney Segmentation System Using Efficient Feature Pyramid Networks in Computed Tomography Images. Computer Methods and Programs in Biomedicine, 221, Article ID: 106854. [Google Scholar] [CrossRef] [PubMed]
[22] Azad, R., Heidari, M., Shariatnia, M., Aghdam, E.K., Karimijafarbigloo, S., Adeli, E. and Merhof, D. (2022) Transdeeplab: Convolution-Free Transformer-Based Deeplab V3 for Medical Image Segmentation. In: International Workshop on Predictive Intelligence in Medicine, Springer, Berlin, 91-102. [Google Scholar] [CrossRef
[23] Hu, H., Gu, J., Zhang, Z., Dai, J. and Wei, Y. (2018) Relation Networks for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 3588-3597. [Google Scholar] [CrossRef
[24] Hu, H., Zhang, Z., Xie, Z. and Lin, S. (2019) Local Relation Networks for Image Recognition. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, 27 October-2 November 2019, 3464-3473. [Google Scholar] [CrossRef
[25] Hendrycks, D. and Gimpel, K. (2016) Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units.