基于GraphSAGE模型预测circRNA与疾病关联关系
Prediction of circRNA and Disease Association Based on GraphSAGE Model
DOI: 10.12677/hjcb.2025.151001, PDF,   
作者: 李小露:大连交通大学基础部理学院,辽宁 大连
关键词: circRNA-disease预测GraphSAGE模型异质图circRNA-Disease Prediction GraphSAGE Model Heterogeneous Graph
摘要: 环状RNA (circRNA)是一类内源性的非编码RNA,许多研究表明circRNA在复杂疾病中发挥着重要作用。然而,由于circRNA的功能复杂性和实验验证的高成本,传统的实验方法难以高效挖掘circRNA与疾病的关联关系,因此迫切需要高效的计算方法来揭示circRNA与疾病的关联关系。在现有数据库的基础上,本文提出了一种基于GraphSAGE模型的circRNA与疾病关联预测方法,通过整合circRNA相似性、疾病相似性以及已知的circRNA-disease关联数据构建异质图,随后借助GraphSAGE模型获得异质图中节点对应特征的高阶聚合表示,从而有效预测circRNA-disease关联。实验结果表明,GraphSAGE模型的AUC值为0.921,F1-score为0.865,Precision为0.879,Recall为0.852,以上四个评估指标均优于现有的DWNN-RLS和RWR模型。总之,GraphSAGE是预测circRNA-disease关联的有效方法。
Abstract: Circular RNA (circRNA) is a class of endogenous non-coding RNAs. Many studies have shown that circRNA plays an important role in complex diseases. However, due to the functional complexity of circRNA and the high cost of experimental verification, it is difficult for traditional experimental methods to efficiently mine the association between circRNA and disease, so efficient computational methods are urgently needed to reveal the association between circRNA and disease. Based on the existing database, this paper proposed a method for predicting the association between circRNA and disease based on GraphSAGE model. By integrating circRNA similarity, disease similarity and known circRNA-disease association data, a heterogeneous graph network was constructed, and then a high-level aggregated representation of the corresponding features of nodes in the heterogeneous graph network was obtained by GraphSAGE model, so as to effectively predict the circRNA-disease association. The experimental results demonstrate that the GraphSAGE model achieves an AUC of 0.921, F1-score of 0.865, Precision of 0.879 and Recall of 0.852, all of which were better than the existing DWNN-RLS and RWR models. In conclusion, GraphSAGE is an effective method to predict the association of circRNA-disease.
文章引用:李小露. 基于GraphSAGE模型预测circRNA与疾病关联关系[J]. 计算生物学, 2025, 15(1): 1-11. https://doi.org/10.12677/hjcb.2025.151001

参考文献

[1] Ye, C., Chen, L., Liu, C., Zhu, Q. and Fan, L. (2015) Widespread Noncoding Circular RNAs in Plants. New Phytologist, 208, 88-95. [Google Scholar] [CrossRef] [PubMed]
[2] Kristensen, L.S., Hansen, T.B., Venø, M.T. and Kjems, J. (2017) Circular RNAs in Cancer: Opportunities and Challenges in the Field. Oncogene, 37, 555-565. [Google Scholar] [CrossRef] [PubMed]
[3] 周红, 张海博, 闫瑞娟, 魏海梁, 闫曙光, 李京涛, 常占杰. circRNA在肝细胞癌发生发展中的作用及机制[J]. 肿瘤防治研究, 2022, 49(5): 496-502.
[4] 朱慧静, 杨明明, 周丹, 刘宇光, 刘亚丽. 2型糖尿病患者血浆环状RNA差异性表达的研究[J]. 中国糖尿病杂志, 2024, 32(1): 16-22.
[5] 傅丽蓉, 张晨. 环状RNA在精神分裂症中作用的研究进展[J]. 上海交通大学学报(医学版), 2023, 43(11): 1445-1449.
[6] Seo, J., Jung, H. and Ko, Y. (2023) PRID: Prediction Model Using RWR for Interactions between Drugs. Pharmaceutics, 15, Article No. 2469. [Google Scholar] [CrossRef] [PubMed]
[7] Bian, C., Lei, X. and Wu, F. (2021) GATCDA: Predicting circRNA-Disease Associations Based on Graph Attention Network. Cancers, 13, Article No. 2595. [Google Scholar] [CrossRef] [PubMed]
[8] Li, G., Wang, D., Zhang, Y., Liang, C., Xiao, Q. and Luo, J. (2022) Using Graph Attention Network and Graph Convolutional Network to Explore Human CircRNA-Disease Associations Based on Multi-Source Data. Frontiers in Genetics, 13, Article ID: 829937. [Google Scholar] [CrossRef] [PubMed]
[9] Xiao, Q., Dai, J. and Luo, J. (2021) A Survey of Circular RNAs in Complex Diseases: Databases, Tools and Computational Methods. Briefings in Bioinformatics, 23, 7-8. [Google Scholar] [CrossRef] [PubMed]
[10] Yan, C., Wang, J. and Wu, F. (2018) DWNN-RLS: Regularized Least Squares Method for Predicting circRNA-Disease Associations. BMC Bioinformatics, 19, Article No. 520. [Google Scholar] [CrossRef] [PubMed]
[11] Hamilton, W., Ying, Z. and Leskovec, J. (2017) Inductive Representation Learning on Large Graphs. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 1025-1035.
[12] Zhou, B., Ran, B. and Chen, L. (2024) A GraphSAGE-Based Model with Fingerprints Only to Predict Drug-Drug Interactions. Mathematical Biosciences and Engineering, 21, 2922-2942. [Google Scholar] [CrossRef] [PubMed]
[13] Momanyi, B.M., Zhou, Y., Grace-Mercure, B.K., Temesgen, S.A., Basharat, A., Ning, L., et al. (2024) SAGESDA: Multi-GraphSAGE Networks for Predicting SnoRNA-Disease Associations. Current Research in Structural Biology, 7, Article ID: 100122. [Google Scholar] [CrossRef] [PubMed]
[14] Koca, M.B., Nourani, E., Abbasoğlu, F., Karadeniz, İ. and Sevilgen, F.E. (2022) Graph Convolutional Network Based Virus-Human Protein-Protein Interaction Prediction for Novel Viruses. Computational Biology and Chemistry, 101, Article ID: 107755. [Google Scholar] [CrossRef] [PubMed]
[15] Tang, X., Lei, X. and Zhang, Y. (2024) Prediction of Drug-Target Affinity Using Attention Neural Network. International Journal of Molecular Sciences, 25, Article No. 5126. [Google Scholar] [CrossRef] [PubMed]
[16] Fan, C., Lei, X., Fang, Z., Jiang, Q. and Wu, F. (2018) CircR2Disease: A Manually Curated Database for Experimentally Supported Circular RNAs Associated with Various Diseases. Database, 2018, Article No. 44. [Google Scholar] [CrossRef] [PubMed]
[17] Fan, C., Lei, X., Tie, J., Zhang, Y., Wu, F. and Pan, Y. (2021) CircR2Disease V2.0: An Updated Web Server for Experimentally Validated circRNA-Disease Associations and Its Application. Genomics, Proteomics & Bioinformatics, 20, 435-445. [Google Scholar] [CrossRef] [PubMed]
[18] Wang, J.Z., Du, Z., Payattakool, R., Yu, P.S. and Chen, C. (2007) A New Method to Measure the Semantic Similarity of GO Terms. Bioinformatics, 23, 1274-1281. [Google Scholar] [CrossRef] [PubMed]
[19] Wei, H. and Liu, B. (2019) iCircDA-MF: Identification of circRNA-Disease Associations Based on Matrix Factorization. Briefings in Bioinformatics, 21, 1356-1367. [Google Scholar] [CrossRef] [PubMed]
[20] Peng, L., Yang, C., Chen, Y. and Liu, W. (2023) Predicting circRNA-Disease Associations via Feature Convolution Learning with Heterogeneous Graph Attention Network. IEEE Journal of Biomedical and Health Informatics, 27, 3072-3082. [Google Scholar] [CrossRef] [PubMed]
[21] Fan, C., Lei, X. and Wu, F. (2018) Prediction of circRNA-Disease Associations Using KATZ Model Based on Heterogeneous Networks. International Journal of Biological Sciences, 14, 1950-1959. [Google Scholar] [CrossRef] [PubMed]