CF-ResNet：融合注意力与多尺度特征的糖尿病视网膜病变分级诊断模型研究

doi:10.12677/csa.2026.163082

期刊菜单

CF-ResNet：融合注意力与多尺度特征的糖尿病视网膜病变分级诊断模型研究
CF-ResNet: Research on Grading Diagnosis Model of Diabetic Retinopathy Fused with Attention and Multi-Scale Features

DOI: 10.12677/csa.2026.163082, PDF,
作者: 刘新雨, 陈俊, 刘俊男, 姚乙铮, 王梓成, 崔旭旭^*：辽宁科技大学计算机与软件工程学院，辽宁鞍山
关键词: 糖尿病视网膜病变；深度学习；注意力机制；数据增强；损失函数优化；Diabetic Retinopathy； Deep Learning； Attention Mechanism； Data Augmentation； Loss Function Optimization

摘要: 针对糖尿病视网膜病变(DR)自动诊断中存在的病灶特征识别不精准、数据分布不均衡及模型泛化能力不足等问题，我们提出一种基于ResNet50的改进模型CF-ResNet。以Kaggle公开眼底图像数据集为研究对象，通过多维度优化策略提升模型诊断性能：引入CBAM注意力机制强化对微小病灶的特征聚焦能力；采用Focal Loss损失函数缓解数据类别不平衡带来的训练偏差；结合多种针对性数据增强方法扩充有效样本并提升模型鲁棒性；新增多尺度特征融合模块(MSFM)适配不同尺寸病变的特征提取需求。实验结果表明，CF-ResNet模型在测试集上的准确率达90.3%、召回率为90.6%、特异性为92.3%、F1分数为90.4%，各项指标均优于原始ResNet50及主流对比模型。消融实验验证了各改进模块的有效性，模型在普通设备上单张图像推理耗时仅0.06秒，具备临床辅助诊断与大规模筛查的应用潜力。

Abstract: Aiming at the problems of inaccurate lesion feature recognition, unbalanced data distribution and insufficient model generalization ability in the automatic diagnosis of Diabetic Retinopathy (DR), an improved model CF-ResNet based on ResNet50 is proposed. Taking the public Kaggle fundus image dataset as the research object, the diagnostic performance of the model is improved through multi-dimensional optimization strategies: introducing the CBAM attention mechanism to strengthen the feature focusing ability on micro lesions; adopting the Focal Loss function to alleviate the training bias caused by unbalanced data categories; combining a variety of targeted data augmentation methods to expand effective samples and improve model robustness; adding a Multi-Scale Feature Fusion Module (MSFM) to adapt to the feature extraction of lesions of different sizes. Experimental results show that the CF-ResNet model achieves an accuracy of 90.3%, a recall rate of 90.6%, a specificity of 92.3%, and an F1 score of 90.4% on the test set, and all indicators are superior to the original ResNet50 and mainstream comparison models. Ablation experiments verify the effectiveness of each improved module, and the average inference time of the model for a single image on ordinary equipment is only 0.06 seconds, which has the potential for clinical auxiliary diagnosis and large-scale screening.

文章引用：刘新雨, 陈俊, 刘俊男, 姚乙铮, 王梓成, 崔旭旭. CF-ResNet：融合注意力与多尺度特征的糖尿病视网膜病变分级诊断模型研究[J]. 计算机科学与应用, 2026, 16(3): 1-10. https://doi.org/10.12677/csa.2026.163082

参考文献

[1]	Ebrahimi, B., Le, D., Abtahi, M., Dadzie, A.K., Lim, J.I. and Yao, X. (2023) OCTA Layer Information Fusion for Deep Learning Classification of Diabetic Retinopathy. Investigative Ophthalmology & Visual Science, 64, 275-275.
[2]	Zong, X., Liang, B., Qin, Y., Ding, X. and Wang, W. (2026) Weakly Supervised Object Detection Network for Diabetic Retinopathy. Medical Physics, 53, e70264. [Google Scholar] [CrossRef]
[3]	Raja, D.S.S., Kumarganesh, S., Sagayam, K.M. and Dang, H. (2026) Diabetic Retinopathy Detection and Grading System Using Deep Learning Approach. Digital Health, 12, Article 20552076251410982. [Google Scholar] [CrossRef]
[4]	Kirubakaran, M. and Vijayarajan, V. (2026) Wavemem-Shapnet: A Transparent Deep Learning Approach to Early Diagnosis of Diabetic Retinopathy. SN Computer Science, 7, Article No. 73. [Google Scholar] [CrossRef]
[5]	Xia, Z., Xu, J., Tan, J., Gu, K., Shen, Y. and Li, W. (2026) Dvdrvit: Dual-View Diabetic Retinopathy Grading Based on Vit Interactive Attention Network. Physica Scripta, 101, Article 025001. [Google Scholar] [CrossRef]
[6]	Khurshid, M., Chiranjeev, C., Singh, R. and Vatsa, M. (2026) Classifying Retinal Images via Vascular-Optic Disc Cross-Segmentation and Attentive Feature Selection. Scientific Reports, 16, Article No. 2398. [Google Scholar] [CrossRef]
[7]	Chitradevi, B., Mathiyalagan, P., Ramachandran, A., Dhanapal, R., Sheikdavood, K. and Gnanamurugan, S. (2026) Conv-Vit: An Improved Discrete Convolution-Based Vision Transformer for Diabetic Retinopathy Detection. Franklin Open, 14, Article 100477. [Google Scholar] [CrossRef]
[8]	Ahmad, I., Singh, V.P. and Gore, M.M. (2026) NGCF-RVFL: Next Generation Convolutional Feature with Random Vector Functional Link for Multi-Grade Diabetic Retinopathy Detection. Computers and Electrical Engineering, 131, Article 110972. [Google Scholar] [CrossRef]
[9]	Kumar, N.A., Madhusudan, D., Ioannou, I., Ghantasala, G.S.P. and Vassiliou, V. (2026) A Self-Supervised Hybrid CNN with Uncertainty-Aware Referral for Diabetic Retinopathy Screening. Biomedical Signal Processing and Control, 116, Article 109482. [Google Scholar] [CrossRef]
[10]	Kamal, E.S. and Sharmin, N. (2026) Retinal Vessel Segmentation Using a Swin Transformer-Based Encoder-Decoder Architecture. Signal, Image and Video Processing, 20, Article No. 27. [Google Scholar] [CrossRef]
[11]	Li, T., Gao, Y., Wang, K., Guo, S., Liu, H. and Kang, H. (2019) Diagnostic Assessment of Deep Learning Algorithms for Diabetic Retinopathy Screening. Information Sciences, 501, 511-522. [Google Scholar] [CrossRef]
[12]	Yu, G., Cao, C., Shu, X. and Yao, L. (2026) Association between Vitamin D Deficiency and the Risk of Diabetic Retinopathy in Patients with Type 2 Diabetes: A Meta‐Analysis. Molecular Genetics & Genomic Medicine, 14, e70157. [Google Scholar] [CrossRef]
[13]	Zhang, J., Hao, J., Chang, D., Zhao, M. and Chen, M. (2026) Associations between Diabetic Retinopathy and Disease Severity of Diabetic Nephropathy in Patients with Type 2 Diabetes. Journal of Diabetes and its Complications, 40, Article 109256. [Google Scholar] [CrossRef]

为你推荐

友情链接