多尺度融合编码与自注意力的肺部CT分割算法
Lung CT Image Segmentation Algorithm with Multi-Scale Fusion Encoder and Self-Attention
DOI: 10.12677/MOS.2023.125421, PDF,    科研立项经费支持
作者: 彭佑菊*, 蒋诗怡, 熊举举:贵州大学大数据与信息工程学院,贵州 贵阳;徐 杨#:贵州大学大数据与信息工程学院,贵州 贵阳;贵阳铝镁设计研究院有限公司,贵州 贵阳
关键词: 肺炎图像分割多尺度融合编码器自注意力机制深度学习Pneumonia Image Segmentation Multi-Scale Fusion Encoder Self-Attention Mechanism Deep Learning
摘要: 计算机断层扫描(CT)在当前是一种辅助检测肺炎的有效手段,但病理表征的复杂性给医生诊断时带来不便,难以准确地对图像进行分割。为进一步辅助医生根据病理表征诊断病情,本文基于U-net提出了一种多尺度融合编码网络,并结合自注意力机制,力图在辅助医生判断的角度提供可行性方案。为了获取不同尺度的语义信息,首先在编码器部分设计了一种多尺度融合编码器模块,提取不同尺度的特征,充分感知语义信息;同时在编码器和解码器之间的跳连部分引入了改进的自注意力机制,使得网络更好地关注不同语义特征的相关性;最后,采用融合Dice损失函数,Focal损失函数,交叉熵损失函数构建的多级损失函数,更好地约束训练。通过训练公开的数据集,得到分割结果表明Dice相似系数、精确率(Precision)、召回率(Recall)分别为75.37%、77.03%、71.87%,优于其他的模型。我们验证了改进的网络能够在图像分割过程中提升网络性能的可能性。
Abstract: Computed tomography (CT) is currently an effective means to assist in the detection of new coro-nary pneumonia, but the complexity of pathological representations brings inconvenience to doc-tors when diagnosing lesions, and it is difficult to accurately segment the images. In order to further assist doctors in diagnosis based on pathological representations, we propose a multi-scale fusion coding network based on U-net, combined with the self-attention mechanism, trying to provide a feasible solution from the perspective of assisting the doctor’s judgment. In order to obtain seman-tic information of different scales, firstly, a multi-scale fusion encoder module is designed in the en-coder part to extract features of different scales and fully perceive the semantic information; at the same time, a skip connection between the encoder and decoder is introduced. The improved self-attention mechanism makes the network pay more attention to the correlation of different se-mantic features; finally, the multi-level loss function of the fusion Dice loss function, Focal loss func-tion, and cross-entropy loss function is used to better constrain training. Through training the pub-lic data set, the results show that the Dice similarity coefficient, precision rate and recall rate are 75.37%, 77.03%, and 71.87%, respectively, which are better than other existing network models. We have verified the feasibility of the improved network being able to improve network perfor-mance during image segmentation.
文章引用:彭佑菊, 徐杨, 蒋诗怡, 熊举举. 多尺度融合编码与自注意力的肺部CT分割算法[J]. 建模与仿真, 2023, 12(5): 4616-4630. https://doi.org/10.12677/MOS.2023.125421

参考文献

[1] Thompson, R. (2020) Pandemic Potential of 2019-nCoV. The Lancet Infectious Diseases, 20, 280. [Google Scholar] [CrossRef
[2] 唐江平, 周晓飞, 贺鑫, 等. 基于深度学习的新型冠状病毒肺炎诊断研究综述[J]. 计算机工程, 2021, 47(5): 1-15.
[3] Zhang, Z., Romero, A., Muckley, M.J., et al. (2019) Reducing Uncertainty in Undersampled MRI Reconstruction with Active Acquisition. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, 15-20 June 2019, 2049-2058. [Google Scholar] [CrossRef
[4] Rubin, G.D., Ryerson, C.J., Haramati, L.B., et al. (2020) The Role of Chest Imaging in Patient Management during the COVID-19 Pandemic: A Multinational Consensus Statement from the Fleischner Society. Radiology, 296, 172-180. [Google Scholar] [CrossRef] [PubMed]
[5] Ai, T., Yang, Z., Hou, H., et al. (2020) Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases. Radiology, 296, E32-E40. [Google Scholar] [CrossRef] [PubMed]
[6] Girshick, R., Donahue, J., Darrell, T., et al. (2014) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 580-587. [Google Scholar] [CrossRef
[7] Long, J., Shelhamer, E. and Darrell, T. (2015) Fully Convolutional Networks for Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 3431-3440. [Google Scholar] [CrossRef
[8] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-net: Con-volutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W.M. and Frangi, A.F., Eds., International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, Cham, 234-241. [Google Scholar] [CrossRef
[9] Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N., et al. (2018) Unet++: A Nested U-Net Architecture for Medical Image Segmentation. In: Stoyanov, D., et al., Eds., Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Springer, Cham, 3-11. [Google Scholar] [CrossRef] [PubMed]
[10] Oktay, O., Schlemper, J., Folgoc, L.L., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas.
[11] Alom, M.Z., Hasan, M., Yakopcic, C., et al. (2018) Recurrent Residual Convolutional Neural Network Based on U-Net (R2U-Net) for Medical Image Segmentation.
[12] Fan, D.P., Zhou, T., Ji, G.P., et al. (2020) Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images. IEEE Transactions on Medical Imaging, 39, 2626-2637. [Google Scholar] [CrossRef
[13] 钱宝鑫, 肖志勇, 宋威. 改进的卷积神经网络在肺部图像上的分割应用[J]. 计算机科学与探索, 2020, 14(8): 1358-1367.
[14] 王磐, 强彦, 杨晓棠, 等. 基于双注意力3D-UNet的肺结节分割网络模型[J]. 计算机工程, 2021, 47(2): 307-313.
[15] 张桃红, 郭徐徐, 张颖. LRSAR-Net语义分割模型用于肺炎CT图片辅助诊断[J]. 电子与信息学报, 2022, 44(1): 48-58.
[16] Qiu, Y., Liu, Y., Li, S., et al. (2020) Miniseg: An Extremely Minimum Network for Efficient COVID-19 Segmentation.
[17] 郭宇, 李瑞冰, 刘莎, 等. 基于机器学习的肺癌图像辅助诊断应用研究[J]. 中国医学装备, 2021, 18(3): 124-128.
[18] 杨莉, 万旺根. 基于预训练-微调策略的肺炎预测模型[J]. 计算机工程, 2022, 48(3): 17-22.
[19] Raj, A.N.J., Zhu, H., Khan, A., et al. (2021) ADID-UNET—A Segmentation Model for COVID-19 Infection from Lung CT Scans. PeerJ Computer Science, 7, e349. [Google Scholar] [CrossRef] [PubMed]
[20] He, K., Zhang, X., Ren, S., et al. (2016) Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[21] 王雪, 李占山, 吕颖达. 基于多尺度感知和语义适配的医学图像分割算法[J]. 吉林大学学报(工学版), 2022, 52(3): 640-647.
[22] Wang, X., Girshick, R., Gupta, A., et al. (2018) Non-Local Neural Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 7794-7803. [Google Scholar] [CrossRef
[23] Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 6000-6010.
[24] 刘锐, 丁辉, 尚媛园, 等. 肺炎医学影像数据集及研究进展[J]. 计算机工程与应用, 2021, 57(22): 15-27.
[25] Ma, J., Wang, Y., An, X., et al. (2021) Toward Data-Efficient Learning: A Benchmark for COVID-19 CT Lung and Infection Segmentation. Medical Physics, 48, 1197-1210. [Google Scholar] [CrossRef] [PubMed]
[26] Badrinarayanan, V., Kendall, A. and Cipolla, R. (2017) Segnet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Transactions on Pattern Analysis and Ma-chine Intelligence, 39, 2481-2495. [Google Scholar] [CrossRef
[27] Xu, X., Wen, Y., Zhao, L., et al. (2021) CARes-UNet: Con-tent-Aware Residual UNet for Lesion Segmentation of COVID-19 from Chest CT Images. Medical Physics, 48, 7127-7140. [Google Scholar] [CrossRef] [PubMed]