基于频率特征增强的结直肠息肉分割模型
Colon Polyp Segmentation Model Based on Frequency Feature Enhancement
DOI: 10.12677/mos.2024.133327, PDF,   
作者: 刘峻昊, 瑚 琦:上海理工大学光电信息与计算机工程学院,上海
关键词: 结直肠癌息肉分割深度学习傅里叶变换Colorectal Cancer Polyp Segmentation Deep Learning Fourier Transform
摘要: 早期息肉检查是防范结直肠癌发病的重要手段,针对现有基于深度学习方法依旧不能准确辨别息肉位置和边缘信息的问题,提出了一种利用傅里叶变换增强频率特征(FFENet)的息肉分割方法。具体地,在FFENet中设计了一个细节特征增强注意力模块和一个全局频率特征学习模块,前者重耦合不同深度的特征并计算三种显著性特征图来细化息肉区域及其边缘;后者在频域中引入可学习的滤波核,以增强息肉与其边缘间的连贯性并捕捉图像像素之间的长距离依赖关系。结合改善的部分解码器和自适应特征选择模块,大量实验结果表明所提出FFENet在五类息肉数据上更具优势。尤其是在ETIS数据集上,对比其他最先进的模型,大模型版本FFENet-L在Dice和IoU指标上分别提升了4%和5.5%,而小模型版本FFENet-S在保持精度相当的同时,仅仅使用了6.2M参数。
Abstract: Colorectal cancer mainly originates from mutated polyp tissues, so early polyp examination can effectively reduce the incidence of colorectal cancer. The ability to automatically and accurately assist doctors in screening polyps is of great significance in the clinical diagnosis of colorectal cancer. However, existing deep learning-based methods cannot fully address the challenges posed by the color, brightness, and complexity of polyps in polyp images, making it difficult to accurately distinguish the location and edge information of polyps. Considering these problems, we innovatively utilize the Fourier Transform in traditional digital image processing to propose a frequency feature enhancement network (FFENet) for polyp segmentation. In FFENet, a detail feature enhancement attention module and a global frequency feature learning module are presented based on the frequency features. DFEA recouples features from different depths and calculates three types of saliency feature maps to refine the polyp region and polyp boundary. GFFLM aims to enhance the coherence of the polyp body and boundary by incorporating a learnable filtering kernel in the frequency domain to capture the long-range relationships between image pixels. Combined with an enhanced partial decoder and an adaptive feature selection module, our method excels in small polyp segmentation and polyp segmentation in complex environments. Extensive experiments are conducted on five public datasets and demonstrate the superiority of our proposed method for polyp segmentation compared with several state-of-the-art methods. Especially on the ETIS dataset, compared to other SOTA models, our large model version FFENet-L has improved by 4% and 5.5% in terms of Dice and IoU metrics respectively, while our small model version FFENet-S has only used 6.2M parameters while maintaining similar accuracy.
文章引用:刘峻昊, 瑚琦. 基于频率特征增强的结直肠息肉分割模型[J]. 建模与仿真, 2024, 13(3): 3593-3606. https://doi.org/10.12677/mos.2024.133327

参考文献

[1] Sawicki, T., Ruszkowska, M., Danielewicz, A., Niedźwiedzka, E., Arłukowicz, T. and Przybyłowicz, K.E. (2021) A Review of Colorectal Cancer in Terms of Epidemiology, Risk Factors, Development, Symptoms and Diagnosis. Cancers, 13, Article No. 2025. [Google Scholar] [CrossRef] [PubMed]
[2] Simon, K. (2016) Colorectal Cancer Development and Advances in Screening. Clinical Interventions in Aging, 11, 967-976. [Google Scholar] [CrossRef
[3] Muhammad, Z.-U.-D., Huang, Z., Gu, N. and Muhammad, U. (2022) DCANet: Deep Context Attention Network for Automatic Polyp Segmentation. Visual Computer, 39, 5513-5525. [Google Scholar] [CrossRef
[4] Zimmermann-Fraedrich, K., et al. (2019) Right-Sided Location Not Associated with Missed Colorectal Adenomas in an Individual-Level Reanalysis of Tandem Colonoscopy Studies. Gastroenterology, 157, 660-671. [Google Scholar] [CrossRef] [PubMed]
[5] Djinbachian, R., et al. (2023) Comparing Size Measurement of Colorectal Polyps Using a Novel Virtual Scale Endoscope, Endoscopic Ruler or Forceps: A Preclinical Randomized Trial. Endoscopy International Open, 11, E128-E135. [Google Scholar] [CrossRef] [PubMed]
[6] Lou, A., Guan, S. and Loew, M.H. (2023) Caranet: Context Axial Reverse Attention Network for Segmentation of Small Medical Objects. Journal of Medical Imaging, 10, Article ID: 014005. [Google Scholar] [CrossRef
[7] Zhang, R., Li, G., Li, Z., Cui, S., Qian, D. and Yu, Y. (2020) Adaptive Context Selection for Polyp Segmentation. Medical Image Computing and Computer Assisted InterventionMICCAI 2020, Lima, 4-8 October 2020, 253-262. [Google Scholar] [CrossRef
[8] Kim, T., Lee, H. and Kim, D. (2021) UACANet: Uncertainty Augmented Context Attention for Polyp Segmentation. Proceedings of the 29th ACM International Conference on Multimedia (MM ‘21), 20-24 October 2021, 2167-2175. [Google Scholar] [CrossRef
[9] Zhu, J., Ge, M., Chang, Z. and Dong, W. (2023) CRCNet: Global-Local Context and Multi-Modality Cross Attention for Polyp Segmentation. Biomedical Signal Processing and Control, 83, Article ID: 104593. [Google Scholar] [CrossRef
[10] Fan, D.-P., Ji, G.-P., Zhou, T., Chen, G., Fu, H., Shen, J. and Shao, L. (2020) PraNet: Parallel Reverse Attention Network for Polyp Segmentation. Medical Image Computing and Computer Assisted InterventionMICCAI 2020, Lima, 4-8 October 2020, 263-273. [Google Scholar] [CrossRef
[11] Song, P., Li, J. and Fan, H. (2022) Attention Based Multi-Scale Parallel Network for Polyp Segmentation. Computers in Biology and Medicine, 146, Article ID: 105476. [Google Scholar] [CrossRef] [PubMed]
[12] Shan, L., Li, X. and Wang, W. (2021) Decouple the High-Frequency and Low-Frequency Information of Images for Semantic Segmentation. ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing, Toronto, 6-11 June 2021, 1805-1809. [Google Scholar] [CrossRef
[13] Su, Y., et al. (2023) FeDNet: Feature Decoupled Network for Polyp Segmentation from Endoscopy Images. Biomedical Signal Processing and Control, 83, Article ID: 104699. [Google Scholar] [CrossRef
[14] Tang, X., Peng, J., Zhong, B., Li, J. and Yan, Z. (2021) Introducing Frequency Representation into Convolution Neural Networks for Medical Image Segmentation via Twin-Kernel Fourier Convolution. Computer Methods and Programs in Biomedicine, 205, Article ID: 106110. [Google Scholar] [CrossRef] [PubMed]
[15] Chi, L., Jiang, B. and Mu, Y. (2020) Fast Fourier Convolution. 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, 6-12 December 2020, 4479-4488.
[16] Li, C., Li, L., Geng, Y., Jiang, H., Cheng, M., Zhang, B., Ke, Z., Xu, X. and Chu, X. (2023) YOLOv6 V3.0: A Full-Scale Reloading.
[17] Ouyang, D., He, S., Zhan, J., Guo, H., Huang, Z., Luo, M. and Zhang, G. (2023) Efficient Multi-Scale Attention Module with Cross-Spatial Learning. ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, 4-10 June 2023, 1-5. [Google Scholar] [CrossRef
[18] Bernal, J., Sánchez, F.J., Fernández-Esparrach, G., Gil, D., Rodríguez, C. and Vilariño, F. (2015) WM-DOVA Maps for Accurate Polyp Highlighting in Colonoscopy: Validation vs. Saliency Maps from Physicians. Computerized Medical Imaging and Graphics, 43, 99-111. [Google Scholar] [CrossRef] [PubMed]
[19] Bernal, J., Sánchez, F.J. and Vilariño, F. (2012) Towards Automatic Polyp Detection with a Polyp Appearance Model. Pattern Recognition, 45, 3166-3182. [Google Scholar] [CrossRef
[20] Vázquez, D., et al. (2017) A Benchmark for Endoluminal Scene Segmentation of Colonoscopy Images. Journal of Healthcare Engineering, 2017, Article ID: 4037190. [Google Scholar] [CrossRef] [PubMed]
[21] Silva, J., Histace, A., Romain, O., Dray, X. and Granado, B. (2014) Toward Embedded Detection of Polyps in WCE Images for Early Diagnosis of Colorectal Cancer. International Journal of Computer Assisted Radiology and Surgery, 9, 283-293. [Google Scholar] [CrossRef] [PubMed]
[22] Jha, D., Smedsrud, P.H., Riegler, M., Halvorsen, P., De Lange, T., Johansen, D. and Johansen, H.D. (2019) Kvasir-SEG: A Segmented Polyp Dataset. Conference on Multimedia Modeling, Daejeon, 5-8 January 2020, 451-462. [Google Scholar] [CrossRef
[23] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. [Google Scholar] [CrossRef
[24] Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N. and Liang, J. (2018) UNet : A Nested U-Net Architecture for Medical Image Segmentation. In: Stoyanov, D., et al., Eds., Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Springer, Berlin, 3-11. [Google Scholar] [CrossRef] [PubMed]
[25] Jha, D., Smedsrud, P.H., Riegler, M., Johansen, D., De Lange, T., Halvorsen, P. and Johansen, H.D. (2019) ResUNet : An Advanced Architecture for Medical Image Segmentation. 2019 IEEE International Symposium on Multimedia (ISM), San Diego, 9-11 December 2019, 225-2255. [Google Scholar] [CrossRef
[26] Fang, Y.Q., Chen, C., Yuan, Y.X. and Tong, K.-Y. (2019) Selective Feature Aggregation Network with Area-Boundary Constraints for Polyp Segmentation. In: Medical Image Computing and Computer Assisted Intervention MICCAI 2019, Springer-Verlag, Berlin, 302-310. [Google Scholar] [CrossRef
[27] Wei, J., Hu, Y., Zhang, R., Li, Z., Zhou, S. and Cui, S. (2021) Shallow Attention Network for Polyp Segmentation. [Google Scholar] [CrossRef
[28] Zhao, X., Zhang, L. and Lu, H. (2021) Automatic Polyp Segmentation via Multi-Scale Subtraction Network. International Conference on Medical Image Computing and Computer-Assisted Intervention, Strasbourg, 27 September-1 October 2021, 120-130. [Google Scholar] [CrossRef
[29] Zhao, X., Jia, H., Pang, Y., Lv, L., Tian, F., Zhang, L., Sun, W. and Lu, H. (2023) M2SNet: Multi-Scale in Multi-Scale Subtraction Network for Medical Image Segmentation.
[30] Zhang, R., Lai, P., Wan, X., Fan, D., Gao, F., Wu, X. and Li, G. (2023) Lesion-Aware Dynamic Kernel for Polyp Segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, Singapore, 18-22 September 2022, 99-109. [Google Scholar] [CrossRef
[31] Shi, J., Zhang, Q., Tang, Y. and Zhang, Z. (2023) Polyp-Mixer: An Efficient Context-Aware MLP-Based Paradigm for Polyp Segmentation. IEEE Transactions on Circuits and Systems for Video Technology, 33, 30-42. [Google Scholar] [CrossRef
[32] Zhou, T., Zhou, Y., He, K., Gong, C., Yang, J., Fu, H. and Shen, D. (2023) Cross-Level Feature Aggregation Network for Polyp Segmentation. Pattern Recognition, 140, Article ID: 109555. [Google Scholar] [CrossRef
[33] Muhammad, Z., Usman, M., Huang, Z. and Gu, N. (2024) MMFIL-Net: Multi-Level and Multi-Source Feature Interactive Lightweight Network for Polyp Segmentation. Displays, 81, Article ID: 102600. [Google Scholar] [CrossRef