基于加权共表达网络的肺癌的预后预测模型构建
Construction of Lung Cancer Prognostic Pre-diction Model Based on Weighted Co-Expression Network
摘要: 肺癌是我国恶性肿瘤当中发病率第一,死亡率第一的恶性肿瘤。目前我国肺癌患者的五年平均生存率仍不足20%,对于肺癌患者的诊疗效果仍有待提升。所以有必要进一步研究肺癌患者的预后影响因素,建立预后预测模型,预测患者预后风险及生存情况,可以帮助临床医生更好地判断患者的预后情况,并发现新的疾病相关因素。本文基于机器学习算法以及TCGA数据库中肺癌患者的多组学数据,以探究肺癌患者生存期是否超过五年为目标,利用加权共表达网络算法找到影响肺癌患者预后生存的关键特征基因。通过结合分类算法和加权共表达网络算法来构建预后预测模型,并使用AUC值对模型的分类效果进行评估,最终得到使用基于KNN回归构建的预后模型效果最好,能够较为准确地判断肺癌患者生存时间是否超过5年。
Abstract: Lung cancer is the malignant tumor with the highest incidence and the highest mortality rate in my country. At present, the five-year average survival rate of lung cancer patients in my country is still less than 20%, and the diagnosis and treatment of lung cancer patients still need to be improved. Therefore, it is necessary to further study the prognostic factors of lung cancer patients, and estab-lish a prognostic prediction model to predict the prognostic risk and survival of patients, which can help clinicians better judge the prognosis of patients and discover new disease-related factors. Based on the machine learning algorithm and the multi-omics data of lung cancer patients in the TCGA database, this paper aims to explore whether the survival time of lung cancer patients ex-ceeds five years, and uses the weighted co-expression network algorithm to find the key feature genes that affect the prognosis and survival of lung cancer patients. By combining the classification algorithm and the weighted co-expression network algorithm, the prognosis prediction model is constructed, and the AUC value is used to evaluate the classification effect of the model. Finally, the prognostic model constructed based on KNN regression has the best effect and can more accurately judge the survival of lung cancer patients whether the time is more than 5 years.
文章引用:刘晓东, 宋凯. 基于加权共表达网络的肺癌的预后预测模型构建[J]. 应用数学进展, 2022, 11(6): 4022-4031. https://doi.org/10.12677/AAM.2022.116430

参考文献

[1] Deshmukh, P.R. and Phalnikar, R. (2020) TNM Cncer Stage Detection from Unstructured Pathology Reports of Breast Cancer Patients: Proceeding of International Conference on Computational Science and Applications. Saint Petersburg, 1-4 July 2019, 411-418. [Google Scholar] [CrossRef
[2] Zhu, X., Yao, J. and Huang, J. (2016) Deep Convolutional Neural Network for Survival Analysis with Pathological Images. 2016 IEEE International Confer-ence on Bioinformatics and Biomedicine, Shenzhen, 15-18 December 2016, 544-547. [Google Scholar] [CrossRef
[3] Zhu, X., Yao, J., Luo, X., Xiao, G., Xie, Y., Gazdar, A. and Huang, J. (2016) Lung Cancer Survival Prediction from Pathological Images and Genetic Data—An Integration Study. 2016 IEEE 13th International Symposium on Biomedical Imaging, Prague, 13-16 April 2016, 1173-1176. [Google Scholar] [CrossRef
[4] Yao, J., Wang, S., Zhu, X. and Huang, J. (2016) Imaging Bi-omarker Discovery for Lung Cancer Survival Prediction. International Conference on Medical Image Computing and Computer-Assisted Intervention, Athens, 17-21 October 2016, 649-657. [Google Scholar] [CrossRef
[5] Wang, H., Xing, F., Su, H., Stromberg, A. and Yang, L. (2014) Novel Image Markers for Non-Small Cell Lung Cancer Classification and Survival Prediction. BMC Bioinfor-matics, 15, Article No. S1. [Google Scholar] [CrossRef
[6] Bøvelstad, H.M., Nygård, S., Størvold, H.L., Aldrin, M., Borgan, Ø., Frigessi, A. and Lingjærde, O.C. (2007) Predicting Survival from Microarray Data—A Comparative Study. Bioinformatics, 23, 2080-2087. [Google Scholar] [CrossRef] [PubMed]
[7] Dagogo-Jack, I. and Shaw, A.T. (2018) Tumour Heterogeneity and Resistance to Cancer Therapies. Nature Reviews Clinical Oncology, 15, 81-94. [Google Scholar] [CrossRef] [PubMed]
[8] 陈钰莹, 黄小华, 刘念, 唐玲玲, 胡云涛. 影像组学在胰腺癌中的临床研究进展[J]. 磁共振成像, 2021, 12(8): 108-110.
[9] 侯东辉, 吴宁. 影像组学在肺癌中的应用进展[J]. 癌症进展 2019, 17(2):128-130+144.
[10] Choi, W., Oh, J.H., Riyahi, S., Liu, C.J., Jiang, F., Chen, W., White, C., Rimner, A., Mechalakos, J.G. and Deasy, J.O. (2018) Radiomics Analysis of Pulmonary Nodules in Low-Dose CT for Early Detection of Lung Cancer. Medical Physics, 45, 1537-1549. [Google Scholar] [CrossRef] [PubMed]
[11] 杨娟, 母齐鸣, 谭琴. 影响肝内胆管癌手术预后的危险因素Logistic回归分析[J]. 江苏大学学报: 医学版, 2019, 29(2): 161-165.
[12] Zhu, X., Yao, J., Xin, L., Xiao, G. and Huang, J. (2016) Lung Cancer Survival Prediction from Patho-logical Images and Genetic Data—An Integration Study. IEEE International Symposium on Biomedical Imag-ing.
[13] Mobadersany, P., Yousefi, S., Amgad, M., Gutman, D.A., Barnholtz-Sloan, J.S., Vega, J.E.V., Brat, D.J. and Cooper, L.A. (2018) Predicting Cancer Outcomes from Histology and Genomics Using Convolutional Networks. Pro-ceedings of the National Academy of Sciences of the United States of America, 115, E2970-E2979. [Google Scholar] [CrossRef] [PubMed]
[14] Lai, Y.H., Chen, W.N., Hsu, T.C., Lin, C. and Wu, S. (2020) Overall Survival Prediction of Non-Small Cell Lung Cancer by Integrating Microarray and Clinical Data with Deep Learning. Scientific Reports, 10, Article No. 4679. [Google Scholar] [CrossRef] [PubMed]
[15] Zhang, Y., Ao, L., Chen, P. and Wang, M. (2016) Improve Glioblastoma Multiforme Prognosis Prediction by Using Feature Selection and Multiple Kernel Learning. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 13, 825-835. [Google Scholar] [CrossRef
[16] Zhang, B. and Horvath, S. (2005) A General Framework for Weighted Gene Co-Expression Network Analysis. Statistical Applications in Genetics and Molecular Biology, 4, Article No. 17. [Google Scholar] [CrossRef] [PubMed]
[17] Langfelder, P. and Horvath S (2008) WGCNA: An R pack-age for Weighted Correlation Network Analysis. BMC Bioinformatics, 9, Article No. 559. [Google Scholar] [CrossRef] [PubMed]