基于转录组分析的冠状动脉疾病枢纽基因鉴定及诊断列线图构建
Identification of Hub Genes and Construction of a Diagnostic Nomogram for Coronary Artery Disease Based on Transcriptomic Analysis
DOI: 10.12677/acm.2026.1641706, PDF,   
作者: 李晨宇:山东大学齐鲁医院内分泌与代谢病科,山东 济南;孙雪纯*:山东大学公共卫生学院营养与食品卫生学系,山东 济南
关键词: 冠状动脉疾病诊断生物标志物免疫细胞组成转录组学枢纽基因列线图Coronary Artery Disease Diagnostic Biomarkers Immune Cell Composition Transcriptomics Hub Genes Nomogram
摘要: 背景:冠状动脉疾病(CAD)是一类以冠状动脉粥样硬化为主要病理基础的慢性心血管疾病,其分子机制尚未完全阐明。本研究旨在筛选CAD的诊断相关枢纽基因,并探讨其在免疫病理过程中的潜在作用。方法:从GEO数据库获取GSE20680数据集。采用limma筛选差异表达基因(DEGs),并进行GO和KEGG富集分析;采用WGCNA识别与CAD相关的模块;利用CIBERSORT推断免疫细胞相对组成;通过SVM-RFE和随机森林算法交集筛选枢纽基因;进一步构建诊断列线图,并采用校准曲线、DCA及ROC进行评价。结果:CAD相关差异基因主要涉及氧化磷酸化、核糖体、能量代谢及免疫炎症相关通路。最终筛选出3个枢纽基因:PSMA6、TOMM7和RPL9。免疫特征分析显示,CAD组与对照组在多种免疫细胞亚群相对丰度上存在差异。基于枢纽基因构建的列线图具有一定判别能力(AUC约为0.65)和校准性能。结论:PSMA6、TOMM7和RPL9可能作为CAD的潜在诊断生物标志物。基于枢纽基因构建的列线图可为CAD的早期识别及风险评估提供一定参考。
Abstract: Background: Coronary artery disease (CAD) is a chronic cardiovascular disorder primarily based on coronary atherosclerosis, and its molecular mechanisms remain incompletely understood. This study aimed to identify diagnostic hub genes for CAD and to explore their potential roles in its immunopathological processes. Methods: The GSE20680 dataset was obtained from the GEO database. Differentially expressed genes (DEGs) were identified using limma, followed by GO and KEGG enrichment analyses. WGCNA was performed to identify CAD-related modules. CIBERSORT was used to infer the relative composition of immune cells. Hub genes were screened by intersecting the results of SVM-RFE and random forest. A diagnostic nomogram was then constructed and evaluated using calibration curves, decision curve analysis (DCA), and receiver operating characteristic (ROC) curves. Results: CAD-related DEGs were mainly involved in oxidative phosphorylation, ribosome, energy metabolism, and immune-inflammatory pathways. Three hub genes were ultimately identified: PSMA6, TOMM7, and RPL9. Immune profiling analysis showed differences in the relative abundance of multiple immune cell subsets between the CAD and control groups. The nomogram constructed based on the hub genes showed a certain discriminative ability (AUC approximately 0.65) and calibration performance. Conclusion: PSMA6, TOMM7, and RPL9 may serve as potential diagnostic biomarkers for CAD. The nomogram constructed based on these hub genes may provide a useful reference for the early identification and risk assessment of CAD.
文章引用:李晨宇, 孙雪纯. 基于转录组分析的冠状动脉疾病枢纽基因鉴定及诊断列线图构建[J]. 临床医学进展, 2026, 16(4): 4369-4382. https://doi.org/10.12677/acm.2026.1641706

参考文献

[1] Roth, G.A., Mensah, G.A., Johnson, C.O., Addolorato, G., et al. (2020) Global Burden of Cardiovascular Diseases and Risk Factors, 1990-2019 Update from the GBD 2019 Study. The Journal of the American College of Cardiology, 76, 2982-3021.
[2] Kyu, H.H., Abate, D., Abate, K.H., Abay, S.M., Abbafati, C., et al. (2018) Global, Regional, and National Disability-Adjusted Life-Years (DALYs) for 359 Diseases and Injuries and Healthy Life Expectancy (HALE) for 195 Countries and Territories, 1990-2017: A Systematic Analysis for the Global Burden of Disease Study 2017. The Lancet, 392, 1859-1922.
[3] Barrett, T., Wilhite, S.E., Ledoux, P., Evangelista, C., Kim, I.F., Tomashevsky, M., et al. (2013) NCBI GEO: Archive for Functional Genomics Data Sets—Update. Nucleic Acids Research, 41, D991-D995. [Google Scholar] [CrossRef] [PubMed]
[4] Lai, H.M., Li, X.M., Yang, Y.N., Ma, Y.T., et al. (2015) Genetic Variation in NFKB1 and NFKBIA and Susceptibility to Coronary Artery Disease in a Chinese Uygur Population. PLOS ONE, 10, e0129144. [Google Scholar] [CrossRef] [PubMed]
[5] Libby, P. (2002) Inflammation in Atherosclerosis. Nature, 420, 868-874. [Google Scholar] [CrossRef] [PubMed]
[6] Elashoff, M.R., Wingrove, J.A., Beineke, P., Daniels, S.E., Tingley, W.G., Rosenberg, S., et al. (2011) Development of a Blood-Based Gene Expression Algorithm for Assessment of Obstructive Coronary Artery Disease in Non-Diabetic Patients. BMC Medical Genomics, 4, Article No. 26. [Google Scholar] [CrossRef] [PubMed]
[7] Ritchie, M.E., Phipson, B., Wu, D., Hu, Y., Law, C.W., Shi, W., et al. (2015) Limma Powers Differential Expression Analyses for RNA-Sequencing and Microarray Studies. Nucleic Acids Research, 43, e47. [Google Scholar] [CrossRef] [PubMed]
[8] Glickman, M.E., Rao, S.R. and Schultz, M.R. (2014) False Discovery Rate Control Is a Recommended Alternative to Bonferroni-Type Adjustments in Health Studies. Journal of Clinical Epidemiology, 67, 850-857. [Google Scholar] [CrossRef] [PubMed]
[9] Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., et al. (2000) Gene Ontology: Tool for the Unification of Biology. Nature Genetics, 25, 25-29. [Google Scholar] [CrossRef] [PubMed]
[10] Kanehisa, M. and Goto, S. (2000) KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Research, 28, 27-30. [Google Scholar] [CrossRef] [PubMed]
[11] Yu, G., Wang, L., Han, Y. and He, Q. (2012) Clusterprofiler: An R Package for Comparing Biological Themes among Gene Clusters. OMICS: A Journal of Integrative Biology, 16, 284-287. [Google Scholar] [CrossRef] [PubMed]
[12] Subramanian, A., Tamayo, P., Mootha, V.K., Mukherjee, S., Ebert, B.L., Gillette, M.A., et al. (2005) Gene Set Enrichment Analysis: A Knowledge-Based Approach for Interpreting Genome-Wide Expression Profiles. Proceedings of the National Academy of Sciences, 102, 15545-15550. [Google Scholar] [CrossRef] [PubMed]
[13] Liberzon, A., Subramanian, A., Pinchback, R., Thorvaldsdóttir, H., Tamayo, P. and Mesirov, J.P. (2011) Molecular signatures database (MSigDB) 3.0. Bioinformatics, 27, 1739-1740. [Google Scholar] [CrossRef] [PubMed]
[14] Langfelder, P. and Horvath, S. (2008) WGCNA: An R Package for Weighted Correlation Network Analysis. BMC Bioinformatics, 9, Article No. 559. [Google Scholar] [CrossRef] [PubMed]
[15] Huang, S., Cai, N., Pacheco, P.P., Narandes, S., et al. (2017) Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. Cancer Genomics Proteomics, 15, 41-51.
[16] Breiman, L. (2001) Random Forests. Machine Learning, 45, 5-32. [Google Scholar] [CrossRef
[17] Friedman, J., Hastie, T. and Tibshirani, R. (2010) Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software, 33, 1-22.
[18] Newman, A.M., Liu, C.L., Green, M.R., Gentles, A.J., Feng, W., Xu, Y., et al. (2015) Robust Enumeration of Cell Subsets from Tissue Expression Profiles. Nature Methods, 12, 453-457. [Google Scholar] [CrossRef] [PubMed]
[19] Harrell, F.E. (2015) Introduction. In: Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis, Springer, 1-11.
[20] Vickers, A.J. and Elkin, E.B. (2006) Decision Curve Analysis: A Novel Method for Evaluating Prediction Models. Medical Decision Making, 26, 565-574. [Google Scholar] [CrossRef] [PubMed]
[21] Robin, X., Turck, N., Hainard, A., Tiberti, N., Lisacek, F., Sanchez, J., et al. (2011) pROC: An Open-Source Package for R and S+ to Analyze and Compare ROC Curves. BMC Bioinformatics, 12, Article No. 77. [Google Scholar] [CrossRef] [PubMed]
[22] Bennett, D.A., Xu, P., Clarke, R., Zondervan, K., Parish, S., Palmer, A., et al. (2008) The Exon 1-8C/G SNP in the PSMA6 Gene Contributes Only a Small Amount to the Burden of Myocardial Infarction in 6946 Cases and 2720 Controls from a United Kingdom Population. European Journal of Human Genetics, 16, 480-486. [Google Scholar] [CrossRef] [PubMed]
[23] Shi, D., Qi, M., Zhou, L., Li, X., Ni, L., Li, C., et al. (2018) Endothelial Mitochondrial Preprotein Translocase Tomm7-Rac1 Signaling Axis Dominates Cerebrovascular Network Homeostasis. Arteriosclerosis, Thrombosis, and Vascular Biology, 38, 2665-2677. [Google Scholar] [CrossRef] [PubMed]
[24] Lezzerini, M., Penzo, M., O’Donohue, M., Marques dos Santos Vieira, C., Saby, M., Elfrink, H.L., et al. (2020) Ribosomal Protein Gene RPL9 Variants Can Differentially Impair Ribosome Function and Cellular Metabolism. Nucleic Acids Research, 48, 770-787. [Google Scholar] [CrossRef] [PubMed]