SC-SENet:一种基于阴道镜图像的宫颈上皮内瘤变分类诊断模型
SC-SENet: A Classification Diagnosis Model for Cervical Intraepithelial Neoplasia Based on Colposcopic Images
DOI: 10.12677/mos.2025.144264, PDF,   
作者: 徐志扬, 谷雪莲*, 邹任玲, 楚胜轩, 方庆斌:上海理工大学健康科学与工程学院,上海;管 睿:海军军医大学第一附属医院长海医院妇产科,上海
关键词: 深度学习宫颈上皮内瘤变阴道镜图像卷积神经网络Swin TransformerDeep Learning Cervical Intraepithelial Neoplasia Colposcopic Images CNN Swin Transformer
摘要: 宫颈上皮内瘤变(Cervical Intraepithelial Neoplasia, CIN)是宫颈浸润癌的癌前病变阶段。为了提高诊断的效率与准确性,本研究使用深度学习技术对阴道镜图像进行三分类识别,搭建了一个结合了Swin Transformer与卷积神经网络(CNN)的分类诊断模型Swin-Conv Squeeze-and-Excitation Network (SC-SENet)。该模型包括Swin Transformer分支与结合了通道注意力机制的卷积分支,通过并联的方式结合分别用于图像全局特征与局部特征的提取,结合两种特征实现病灶区域信息的特征识别以提高诊断准确度。本研究使用临床采集的阴道镜图像作为原始数据集,SC-SENet模型在CIN三分类诊断任务的总体准确率为90.81%。实验结果表明,本研究提出SC-SENet模型能有效诊断宫颈上皮内瘤变的病变等级。
Abstract: Cervical Intraepithelial Neoplasia (CIN) is a precancerous stage of invasive cervical cancer. In order to improve the efficiency and accuracy of diagnosis, this study used deep learning technology to classify colposcopy images into three categories. In this paper, a classification diagnosis model Swin-Conv Squeeze-and-Excitation Network (SC-SENet) combining Swin Transformer and Convolutional neural Network (CNN) is built. The model includes the Swin Transformer branch and the convolution branch combined with the channel attention mechanism, which are combined in parallel to extract the global features and local features of the image respectively. The two features are combined to realize the feature recognition of the lesion area information to improve the diagnostic accuracy. In this study, clinically collected colposcopy images were used as the original data set, and the overall accuracy of SC-SENet model in CIN three-classification diagnosis task was 90.81%. The experimental results show that the SC-SENet model proposed in this study can effectively diagnose the lesion grade of cervical intraepithelial neoplasia.
文章引用:徐志扬, 谷雪莲, 管睿, 邹任玲, 楚胜轩, 方庆斌. SC-SENet:一种基于阴道镜图像的宫颈上皮内瘤变分类诊断模型[J]. 建模与仿真, 2025, 14(4): 50-60. https://doi.org/10.12677/mos.2025.144264

参考文献

[1] Mattiuzzi, C. and Lippi, G. (2019) Cancer Statistics: A Comparison between World Health Organization (WHO) and Global Burden of Disease (GBD). European Journal of Public Health, 30, 1026-1027. [Google Scholar] [CrossRef] [PubMed]
[2] Rowland, A.G. and Brady, J. (2020) Colposcopy and Cervical Intraepithelial Neoplasia. Obstetrics, Gynaecology & Reproductive Medicine, 30, 133-138. [Google Scholar] [CrossRef
[3] Waghe, T. and Acharya, N. (2024) Advancements in the Management of Cervical Intraepithelial Neoplasia: A Comprehensive Review. Cureus, 16, e58645. [Google Scholar] [CrossRef] [PubMed]
[4] Xue, P., Ng, M.T.A. and Qiao, Y. (2020) The Challenges of Colposcopy for Cervical Cancer Screening in LMICs and Solutions by Artificial Intelligence. BMC Medicine, 18, Article No. 169. [Google Scholar] [CrossRef] [PubMed]
[5] Dong, S., Wang, P. and Abbas, K. (2021) A Survey on Deep Learning and Its Applications. Computer Science Review, 40, Article 100379. [Google Scholar] [CrossRef
[6] Kather, J.N., Weis, C., Bianconi, F., Melchers, S.M., Schad, L.R., Gaiser, T., et al. (2016) Multi-Class Texture Analysis in Colorectal Cancer Histology. Scientific Reports, 6, Article No. 27988. [Google Scholar] [CrossRef] [PubMed]
[7] Lundervold, A.S. and Lundervold, A. (2019) An Overview of Deep Learning in Medical Imaging Focusing on MRI. Zeitschrift für Medizinische Physik, 29, 102-127. [Google Scholar] [CrossRef] [PubMed]
[8] Nogales, A., García-Tejedor, Á.J., Monge, D., Vara, J.S. and Antón, C. (2021) A Survey of Deep Learning Models in Medical Therapeutic Areas. Artificial Intelligence in Medicine, 112, Article 102020. [Google Scholar] [CrossRef] [PubMed]
[9] Zhang, T., Luo, Y., Li, P., Liu, P., Du, Y., Sun, P., et al. (2020) Cervical Precancerous Lesions Classification Using Pre-Trained Densely Connected Convolutional Networks with Colposcopy Images. Biomedical Signal Processing and Control, 55, Article 101566. [Google Scholar] [CrossRef
[10] Miyagi, Y., Takehara, K., Nagayasu, Y. and Miyake, T. (2019) Application of Deep Learning to the Classification of Uterine Cervical Squamous Epithelial Lesion from Colposcopy Images Combined with HPV Types. Oncology Letters, 75, Article 103589. [Google Scholar] [CrossRef] [PubMed]
[11] Saini, S.K., Bansal, V., Kaur, R. and Juneja, M. (2020) Colponet for Automated Cervical Cancer Screening Using Colposcopy Images. Machine Vision and Applications, 31, Article No. 15. [Google Scholar] [CrossRef
[12] Liu, L., Wang, Y., Liu, X., Han, S., Jia, L., Meng, L., et al. (2021) Computer-aided Diagnostic System Based on Deep Learning for Classifying Colposcopy Images. Annals of Translational Medicine, 9, 1045-1045. [Google Scholar] [CrossRef] [PubMed]
[13] Chen, J., Li, P., Xu, T., Xue, H., Wang, X., Li, Y., et al. (2022) Detection of Cervical Lesions in Colposcopic Images Based on the Retinanet Method. Biomedical Signal Processing and Control, 75, Article 103589. [Google Scholar] [CrossRef
[14] Zhang, X. and Zhao, S. (2018) Cervical Image Classification Based on Image Segmentation Preprocessing and a Capsnet Network Model. International Journal of Imaging Systems and Technology, 29, 19-28. [Google Scholar] [CrossRef
[15] Xu, T., Li, P. and Wang, X. (2021) Cervical Lesions Classification Based on Pre-Trained Mobilenet Model. 2021 IEEE 15th International Conference on Anti-counterfeiting, Security, and Identification (ASID), Xiamen, 29-31 October 2021, 93-96. [Google Scholar] [CrossRef
[16] Moghtaderi, S., Yaghoobian, O., Wahid, K.A. and Lukong, K.E. (2024) Endoscopic Image Enhancement: Wavelet Transform and Guided Filter Decomposition-Based Fusion Approach. Journal of Imaging, 10, Article 28. [Google Scholar] [CrossRef] [PubMed]
[17] Al-Stouhi, S. and Reddy, C.K. (2015) Transfer Learning for Class Imbalance Problems with Inadequate Data. Knowledge and Information Systems, 48, 201-228. [Google Scholar] [CrossRef] [PubMed]
[18] Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., et al. (2021). Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, 10-17 October 2021, 9992-10002. [CrossRef
[19] Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I. and Salakhutdinov, R.R. (2012) Improving Neural Networks by Preventing Co-Adaptation of Feature Detectors. [Google Scholar] [CrossRef
[20] Srivastava, N. (2013) Improving Neural Networks with Dropout.
https://api.semanticscholar.org/CorpusID:17084851
[21] Hu, J., Shen, L. and Sun, G. (2018) Squeeze-and-Excitation Networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 7132-7141. [Google Scholar] [CrossRef
[22] Lin, T., Goyal, P., Girshick, R., He, K. and Dollar, P. (2020) Focal Loss for Dense Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 318-327. [Google Scholar] [CrossRef] [PubMed]
[23] Mao, A., Mohri, M. and Zhong, Y. (2023) Cross-Entropy Loss Functions: Theoretical Analysis and Applications. [Google Scholar] [CrossRef
[24] Smith, L.N. (2017) Cyclical Learning Rates for Training Neural Networks. 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, 24-31 March 2017, 464-472. [Google Scholar] [CrossRef
[25] Fawcett, T. (2006) An Introduction to ROC Analysis. Pattern Recognition Letters, 27, 861-874. [Google Scholar] [CrossRef
[26] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[27] Simonyan, K. and Zisserman, A. (2015) Very Deep Convolutional Networks for Large-Scale Image Recognition.
http://arxiv.org/abs/1409.1556
[28] Xie, S., Girshick, R., Dollar, P., Tu, Z. and He, K. (2017) Aggregated Residual Transformations for Deep Neural Networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 5987-5995. [Google Scholar] [CrossRef
[29] Huang, G., Liu, Z., Van Der Maaten, L. and Weinberger, K.Q. (2017) Densely Connected Convolutional Networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 2261-2269. [Google Scholar] [CrossRef
[30] Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., et al. (2021) An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. [Google Scholar] [CrossRef