基于肿瘤组织反卷积和SVM算法的疾病预测分析
Disease Prediction Analysis Based on Tumor Tissue Deconvolution and SVM Algorithm
DOI: 10.12677/AAM.2022.117468, PDF,    国家自然科学基金支持
作者: 周素素, 张伟伟*:东华理工大学理学院,江西 南昌
关键词: 疾病预测肿瘤组织异质性反卷积SVMDisease Prediction Tumor Tissue Heterogeneity Deconvolution SVM
摘要: 预后和诊断在疾病的预防和治疗中起着至关重要的作用。本文基于临床肿瘤组织的DNA甲基化芯片数据,利用反卷积算法对肿瘤组织进行分解,将估计得到的肿瘤组织中各细胞类型所占比例和细胞类型特异性的甲基化位点作为生物标志物,利用SVM算法构建该疾病的预测模型。TCGA肺腺癌、肾透明细胞癌的数据分析表明,所提方法在预测精确度和鲁棒性上都优于常用算法。
Abstract: Prognosis and diagnosis play a crucial role in the prevention and treatment of diseases. Based on the DNA methylation microarray data of clinical tumor tissue, this paper decomposes the tumor tissue by using the deconvolution algorithm, takes the estimated cell type proportions and cell type-specific methylation sites as biomarkers, and constructs the disease prediction model by using SVM algorithm. The data analysis of TCGA lung adenocarcinoma and renal clear cell carcinoma shows that the proposed method is superior to the common algorithms in accuracy and robustness.
文章引用:周素素, 张伟伟. 基于肿瘤组织反卷积和SVM算法的疾病预测分析[J]. 应用数学进展, 2022, 11(7): 4419-4426. https://doi.org/10.12677/AAM.2022.117468

参考文献

[1] Hanahan, D. and Weinberg, R.A. (2011) Hallmarks of Cancer: The Next Generation. Cell, 144, 646-674. [Google Scholar] [CrossRef] [PubMed]
[2] Li, Z. and Wu, H. (2019) TOAST: Improving Reference-Free Cell Composition Estimation by Cross-Cell Type Differential Analysis. Genome Biology, 20, 190. [Google Scholar] [CrossRef] [PubMed]
[3] Joyce, J.A. and Pollard, J.W. (2009) Microenvironmental Regula-tion of Metastasis. Nature Reviews Cancer, 9, 239-252. [Google Scholar] [CrossRef] [PubMed]
[4] Anders, S. and Huber, W. (2010) Differential Expression Analysis for Sequence Count Data. Genome Biology, 11, 1-12. [Google Scholar] [CrossRef] [PubMed]
[5] Onuchic, V., et al. (2016) Epigenomic Deconvolution of Breast Tumors Reveals Metabolic Coupling between Constituent Cell Types. Cell Reports, 17, 2075-2086. [Google Scholar] [CrossRef] [PubMed]
[6] Zhang, W., Wu, H. and Li, Z. (2021) Complete Deconvolution of DNA Methylation Signals from Complex Tissues: A Geometric Approach. Bioinformatics, 37, 1052-1059. [Google Scholar] [CrossRef] [PubMed]
[7] Houseman, E.A., et al. (2012) DNA Methylation Arrays as Surrogate Measures of Cell Mixture Distribution. BMC Bioinformatics, 13, Article No. 86. [Google Scholar] [CrossRef] [PubMed]
[8] Liu, Y., et al. (2013) Epigenome-Wide Association Data Implicate DNA Methylation as an Intermediary of Genetic Risk in Rheumatoid Arthritis. Nature Biotechnology, 31, 142-147. [Google Scholar] [CrossRef] [PubMed]
[9] Houseman, E.A., et al. (2014) Reference-Free Cell Mixture Adjustments in Analysis of DNA Methylation Data. Bioinformatics, 30, 1431-1439. [Google Scholar] [CrossRef] [PubMed]
[10] Rahmani, E., et al. (2016) Sparse PCA Corrects for Cell Type Heterogeneity in Epigenome-Wide Association Studies. Nature Methods, 13, 443-445. [Google Scholar] [CrossRef] [PubMed]
[11] Reinius, L.E., Acevedo, N., Joerink, M., et al. (2012) Differential DNA Methylation in Purified Human Blood Cells: Implications for Cell Lineage and Studies on Disease Susceptibility. PLOS ONE, 7, e41361. [Google Scholar] [CrossRef] [PubMed]
[12] Johnson, W.E., et al. (2007) Adjusting Batch Effects in Micro-array Expression Data Using Empirical Bayes Methods. Biostatistics, 8, 118-127. [Google Scholar] [CrossRef] [PubMed]
[13] Lin, Y., Chen, D., Ding, Q., Zhu, X., Zhu, R. and Chen, Y. (2021) Progress in Single-Cell RNA Sequencing of Lung Adenocarcinoma. Chinese Journal of Lung Cancer, 24, 434-440.
[14] Houseman, E.A., Kile, M.L., Christiani, D.C., Ince, T.A., Kelsey, K.T. and Marsit, C.J. (2016) Refer-ence-Free Deconvolution of DNA Methylation Data and Mediation by Cell Composition Effects. BMC Bioinformatics, 17, Article No. 259. [Google Scholar] [CrossRef] [PubMed]
[15] Whiteside, T.L. (2008) The Tumor Microen-vironment and Its Role in Promoting Tumor Growth. Oncogene, 27, 5904-5912. [Google Scholar] [CrossRef] [PubMed]