基于ViT与ResNet-18的深度学习模型在肺结节良恶性识别中的性能比较
Comparing ViT and ResNet-18 Deep Learning Models for Pulmonary Nodule Benign-Malignant Classification
DOI: 10.12677/acm.2026.161213, PDF,   
作者: 周世轩:石河子大学第一附属医院影像中心,新疆 石河子
关键词: 肺结节深度学习CT预测模型人工智能Pulmonary Nodules Deep Learning CT Predictive Model Artificial Intelligence
摘要: 目的:比较基于局部纹理特征与全局上下文信息的深度学习模型在肺结节良恶性鉴别中的性能。方法:纳入2023年6月至2025年7月在石河子大学第一附属医院经手术及病理证实的134例肺结节患者(良性51例,恶性83例),共收集1172张含肺结节的CT切片,按7:3比例划分为训练集和内部测试集。训练集用于训练关注局部纹理的ResNet-18模型及关注全局信息的Vision Transformer (ViT)模型。通过受试者工作特征曲线下面积(AUC)和决策曲线分析(DCA)评估不同模型的诊断性能。结果:ViT模型在训练集与测试集上均表现更优,其训练集AUC为0.977,准确率为0.978;测试集AUC为0.901,准确率为0.878,均高于ResNet-18模型(训练集AUC 0.959,准确率0.930;测试集AUC 0.880,准确率0.812)。DCA进一步表明ViT模型具有更高的临床净获益。结论:在肺结节良恶性鉴别中,对全局上下文信息的建模能力优于对局部纹理的分析,是提升鉴别性能的关键。
Abstract: Objective: This paper aims to compare the performance of deep learning models focusing on local texture features versus global contextual information in differentiating benign and malignant pulmonary nodules. Methods: A total of 134 patients with pathologically confirmed pulmonary nodules, treated between June 2023 and July 2025 at the First Affiliated Hospital of Shihezi University (51 benign, 83 malignant), were included. A total of 1172 CT slices containing pulmonary nodules were collected and split into a training set and an internal test set at a 7:3 ratio. The training set was used to train a ResNet-18 model focusing on local texture features and a Vision Transformer (ViT) model emphasizing global contextual information. Model performance was evaluated using the area under the receiver operating characteristic curve (AUC) and decision curve analysis (DCA). Results: The ViT model outperformed ResNet-18 on both the training and test sets. In the training set, the ViT achieved an AUC of 0.977 and an accuracy of 0.978, while in the test set, it reached an AUC of 0.901 and an accuracy of 0.878, all higher than those of ResNet-18 (training set AUC 0.959, accuracy 0.930; test set AUC 0.880, accuracy 0.812). DCA further indicated that the ViT model provided greater clinical net benefit. Conclusion: In differentiating benign and malignant pulmonary nodules, modeling global contextual information is superior to analyzing local texture features, representing a key factor in improving diagnostic performance.
文章引用:周世轩. 基于ViT与ResNet-18的深度学习模型在肺结节良恶性识别中的性能比较[J]. 临床医学进展, 2026, 16(1): 1672-1680. https://doi.org/10.12677/acm.2026.161213

参考文献

[1] Lu, H. and Jiang, Z. (2018) Advances in Antibody Therapeutics Targeting Small-Cell Lung Cancer. Advances in Clinical and Experimental Medicine, 27, 1317-1323. [Google Scholar] [CrossRef] [PubMed]
[2] Thawani, R., McLane, M., Beig, N., Ghose, S., Prasanna, P., Velcheti, V., et al. (2018) Radiomics and Radiogenomics in Lung Cancer: A Review for the Clinician. Lung Cancer, 115, 34-41. [Google Scholar] [CrossRef] [PubMed]
[3] Siegel, R.L., Miller, K.D. and Jemal, A. (2020) Cancer Statistics, 2020. CA: A Cancer Journal for Clinicians, 70, 7-30. [Google Scholar] [CrossRef] [PubMed]
[4] Aberle, D.R., Adams, A.M., Berg, C.D., et al. (2011) Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening. New England Journal of Medicine, 365, 395-409. [Google Scholar] [CrossRef] [PubMed]
[5] Swensen, S.J., Jett, J.R., Hartman, T.E., Midthun, D.E., Sloan, J.A., Sykes, A., et al. (2003) Lung Cancer Screening with CT: Mayo Clinic Experience. Radiology, 226, 756-761. [Google Scholar] [CrossRef] [PubMed]
[6] 刘芯宇, 兰箭. 实性肺结节的良恶性分析[J]. 临床医学进展, 2025, 15(2): 861-866.
[7] Ather, S., Kadir, T. and Gleeson, F. (2020) Artificial Intelligence and Radiomics in Pulmonary Nodule Management: Current Status and Future Applications. Clinical Radiology, 75, 13-19. [Google Scholar] [CrossRef] [PubMed]
[8] 高杭宇, 杨阳, 李建英. 人工智能技术在肺结节影像学诊断中的应用: 进展、挑战与展望[J]. 临床个性化医学, 2024, 3(4): 1896-1902.
[9] Wulaningsih, W., Villamaria, C., Akram, A., Benemile, J., Croce, F. and Watkins, J. (2024) Deep Learning Models for Predicting Malignancy Risk in CT-Detected Pulmonary Nodules: A Systematic Review and Meta-Analysis. Lung, 202, 625-636. [Google Scholar] [CrossRef] [PubMed]
[10] Zhang, Y., Feng, W., Wu, Z., Li, W., Tao, L., Liu, X., et al. (2023) Deep-Learning Model of Resnet Combined with CBAM for Malignant-Benign Pulmonary Nodules Classification on Computed Tomography Images. Medicina, 59, Article 1088. [Google Scholar] [CrossRef] [PubMed]
[11] Shen, S., Han, S.X., Aberle, D.R., Bui, A.A. and Hsu, W. (2019) An Interpretable Deep Hierarchical Semantic Convolutional Neural Network for Lung Nodule Malignancy Classification. Expert Systems with Applications, 128, 84-95. [Google Scholar] [CrossRef] [PubMed]
[12] de Margerie-Mellon, C. and Chassagnon, G. (2023) Artificial Intelligence: A Critical Review of Applications for Lung Nodule and Lung Cancer. Diagnostic and Interventional Imaging, 104, 11-17. [Google Scholar] [CrossRef] [PubMed]
[13] 田军伟. 基于深度视觉的医学影像辅助诊断关键技术[D]: [硕士学位论文]. 北京: 北京建筑大学, 2022.
[14] 宁显璞, 夏丽波, 孔维双. 实性肺结节良恶性鉴别诊断的研究进展[J]. 临床肺科杂志, 2025, 30(11): 1761-1765.
[15] Wu, W., Pierce, L.A., Zhang, Y., Pipavath, S.N.J., Randolph, T.W., Lastwika, K.J., et al. (2019) Comparison of Prediction Models with Radiological Semantic Features and Radiomics in Lung Cancer Diagnosis of the Pulmonary Nodules: A Case-Control Study. European Radiology, 29, 6100-6108. [Google Scholar] [CrossRef] [PubMed]
[16] 卢梁杰. CT影像组学和深度学习预测肺实性小结节良恶性的初步研究[D]: [硕士学位论文]. 泸州: 西南医科大学, 2023.
[17] Huang, W., Deng, H., Li, Z., Xiong, Z., Zhou, T., Ge, Y., et al. (2023) Baseline Whole-Lung CT Features Deriving from Deep Learning and Radiomics: Prediction of Benign and Malignant Pulmonary Ground-Glass Nodules. Frontiers in Oncology, 13, Article ID: 1255007. [Google Scholar] [CrossRef] [PubMed]
[18] Zhou, Y., Xu, H., Jiang, W., Zhang, J., Chen, S., Yang, S., et al. (2025) A Fusion Model of Resnet and Vision Transformer for Efficacy Prediction of HIFU Treatment of Uterine Fibroids. Academic Radiology, 32, 7294-7306. [Google Scholar] [CrossRef
[19] 黄栎有, 徐璐. 温林春. 等. CT影像组学模型及深度学习技术预测肺腺癌EGFR突变[J]. 放射学实践, 2022, 37(8): 971-976.
[20] 王蕾, 丁明凤. 融合深度学习和影像组学特征的胰腺囊性肿瘤分类模型研究[J]. 医疗卫生装备, 2025, 46(1): 7-12.
[21] Yu, J., Yang, B., Wang, J., Leader, J., Wilson, D. and Pu, J. (2020) 2D CNN versus 3D CNN for False-Positive Reduction in Lung Cancer Screening. Journal of Medical Imaging, 7, Article 051202. [Google Scholar] [CrossRef] [PubMed]