TCNLDA:基于自编码器和时序卷积网络预测lncRNA-疾病的关联
TCNLDA: Prediction of lncRNA-Disease Associations Based on Autoencoder and Temporal Convolutional Network
摘要: 研究表明长非编码RNA (long non-coding RNA, lncRNA)影响着许多疾病的生物学过程,例如疾病的发生、传播、治愈等。因此,预测潜在lncRNA-疾病关联(lncRNA-disease associations, LDAs)对疾病的诊疗和治疗有着重要意义。本文提出一种新的深度学习方法预测LDAs,称为TCNLDA。首先,分别构建了lncRNA的功能相似性矩阵、高斯相似性矩阵和序列相似性矩阵,以及疾病的语义相似性矩阵和高斯相似性矩阵,并将其进行矩阵融合处理。然后,构建lncRNA-疾病对,并对其使用自编码器(Autoencoder, AE)进行特征提取。最后,将提取好的特征输入到时序卷积网络(Temporal Convolutional Network, TCN)中进行训练输出预测得分。两个数据集中,TCNLDA与其他模型进行了比较,结果显示TCNLDA优于其他LDAs预测方法。消融实验验证了TCNLDA中各部分的不可缺少性。案例研究进一步表明,TCNLDA在预测新型LDAs方面有着很好的实用性。
Abstract: Studies have shown that long non-coding RNA (lncRNA) influences the biological processes of many diseases, such as disease onset, spread, and cure. Therefore, predicting potential lncRNA-disease associations (LDAs) is important for disease diagnosis and treatment. In this paper, we propose a new deep learning method to predict LDAs, called TCNLDA. Firstly, the functional similarity matrix, Gaussian similarity matrix and sequence similarity matrix of lncRNAs, and the semantic similarity matrix and Gaussian similarity matrix of diseases are constructed and processed for matrix fusion, respectively. Then, the lncRNA-disease pairs are constructed and feature extraction is performed on them using autoencoder (AE). Finally, the extracted features were input into Temporal Convolutional Network (TCN) for training to output the prediction scores. In both datasets, TCNLDA was compared with other models, and the results showed that TCNLDA outperformed other LDAs prediction methods. Ablation experiments verified the indispensability of the components in TCNLDA. The case study further shows that TCNLDA has good utility in predicting novel LDAs.
文章引用:孟令宇, 谭建军. TCNLDA:基于自编码器和时序卷积网络预测lncRNA-疾病的关联[J]. 生物医学, 2025, 15(3): 611-625. https://doi.org/10.12677/hjbm.2025.153070

参考文献

[1] Nagano, T. and Fraser, P. (2011) No-Nonsense Functions for Long Noncoding RNAs. Cell, 145, 178-181. [Google Scholar] [CrossRef] [PubMed]
[2] Zhao, T. (2019) Long Noncoding RNA and Its Role in Virus Infection and Pathogenesis. Frontiers in Bioscience, 24, 777-789. [Google Scholar] [CrossRef] [PubMed]
[3] Tüncel, Ö., Kara, M., Yaylak, B., Erdoğan, İ. and Akgül, B. (2022) Noncoding RNAs in Apoptosis: Identification and Turkish Journal of Biology, 46, 1-40.
[4] Chen, J., Ao, L. and Yang, J. (2019) Long Non-Coding RNAs in Diseases Related to Inflammation and Immunity. Annals of Translational Medicine, 7, 494-494. [Google Scholar] [CrossRef] [PubMed]
[5] Yan, J., Qu, W., Li, X., Wang, R. and Tan, J. (2024) GATLGEMF: A Graph Attention Model with Line Graph Embedding Multi-Complex Features for ncRNA-Protein Interactions Prediction. Computational Biology and Chemistry, 108, Article 108000. [Google Scholar] [CrossRef] [PubMed]
[6] Wang, F., Lin, H., Su, Q. and Li, C. (2022) Cuproptosis-Related lncRNA Predict Prognosis and Immune Response of Lung Adenocarcinoma. World Journal of Surgical Oncology, 20, Article No. 275. [Google Scholar] [CrossRef] [PubMed]
[7] Yan, J., Wang, R. and Tan, J. (2023) Recent Advances in Predicting lncRNA-Disease Associations Based on Computational Methods. Drug Discovery Today, 28, Article 103432. [Google Scholar] [CrossRef] [PubMed]
[8] Sun, J., Shi, H., Wang, Z., Zhang, C., Liu, L., Wang, L., et al. (2014) Inferring Novel lncRNA-Disease Associations Based on a Random Walk Model of a lncRNA Functional Similarity Network. Molecular BioSystems, 10, 2074-2081. [Google Scholar] [CrossRef] [PubMed]
[9] Zhang, J., Zhang, Z., Chen, Z. and Deng, L. (2019) Integrating Multiple Heterogeneous Networks for Novel lncRNA-Disease Association Inference. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 16, 396-406. [Google Scholar] [CrossRef] [PubMed]
[10] Xi, J., Wang, M. and Li, A. (2017) Discovering Potential Driver Genes through an Integrated Model of Somatic Mutation Profiles and Gene Functional Information. Molecular BioSystems, 13, 2135-2144. [Google Scholar] [CrossRef] [PubMed]
[11] Xi, W., Zhou, F., Gao, Y., Liu, J. and Zheng, C. (2023) LDCMFC: Predicting Long Non-Coding RNA and Disease Association Using Collaborative Matrix Factorization Based on Correntropy. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 20, 1774-1782. [Google Scholar] [CrossRef] [PubMed]
[12] Lu, C., Yang, M., Li, M., Li, Y., Wu, F. and Wang, J. (2020) Predicting Human lncRNA-Disease Associations Based on Geometric Matrix Completion. IEEE Journal of Biomedical and Health Informatics, 24, 2420-2429. [Google Scholar] [CrossRef] [PubMed]
[13] Tan, J., Li, X., Zhang, L. and Du, Z. (2022) Recent Advances in Machine Learning Methods for Predicting lncRNA and Disease Associations. Frontiers in Cellular and Infection Microbiology, 12, Article 1071972. [Google Scholar] [CrossRef] [PubMed]
[14] Sheng, N., Huang, L., Lu, Y., Wang, H., Yang, L., Gao, L., et al. (2023) Data Resources and Computational Methods for lncRNA-Disease Association Prediction. Computers in Biology and Medicine, 153, Article 106527. [Google Scholar] [CrossRef] [PubMed]
[15] Chen, X. and Yan, G. (2013) Novel Human lncRNA-Disease Association Inference Based on lncRNA Expression Profiles. Bioinformatics, 29, 2617-2624. [Google Scholar] [CrossRef] [PubMed]
[16] Yao, D., Zhan, X., Zhan, X., Kwoh, C.K., Li, P. and Wang, J. (2020) A Random Forest Based Computational Model for Predicting Novel lncRNA-Disease Associations. BMC Bioinformatics, 21, Article No. 126. [Google Scholar] [CrossRef] [PubMed]
[17] Xuan, P., Cao, Y., Zhang, T., Kong, R. and Zhang, Z. (2019) Dual Convolutional Neural Networks with Attention Mechanisms Based Method for Predicting Disease-Related lncRNA Genes. Frontiers in Genetics, 10, Article 416. [Google Scholar] [CrossRef] [PubMed]
[18] Fan, Y., Chen, M. and Pan, X. (2021) GCRFLDA: Scoring lncRNA-Disease Associations Using Graph Convolution Matrix Completion with Conditional Random Field. Briefings in Bioinformatics, 23, bbab361. [Google Scholar] [CrossRef] [PubMed]
[19] Fu, G., Wang, J., Domeniconi, C. and Yu, G. (2017) Matrix Factorization-Based Data Fusion for the Prediction of lncRNA-Disease Associations. Bioinformatics, 34, 1529-1537. [Google Scholar] [CrossRef] [PubMed]
[20] Ning, S., Zhang, J., Wang, P., Zhi, H., Wang, J., Liu, Y., et al. (2015) Lnc2Cancer: A Manually Curated Database of Experimentally Supported lncRNAs Associated with Various Human Cancers. Nucleic Acids Research, 44, D980-D985. [Google Scholar] [CrossRef] [PubMed]
[21] Chen, G., Wang, Z., Wang, D., Qiu, C., Liu, M., Chen, X., et al. (2012) LncRNADisease: A Database for Long-Non-Coding RNA-Associated Diseases. Nucleic Acids Research, 41, D983-D986. [Google Scholar] [CrossRef] [PubMed]
[22] Li, J., Liu, S., Zhou, H., Qu, L. and Yang, J. (2013) Starbase V2.0: Decoding miRNA-ceRNA, miRNA-ncRNA and Protein-RNA Interaction Networks from Large-Scale CLIP-Seq Data. Nucleic Acids Research, 42, D92-D97. [Google Scholar] [CrossRef] [PubMed]
[23] Li, Y., Qiu, C., Tu, J., Geng, B., Yang, J., Jiang, T., et al. (2013) HMDD V2.0: A Database for Experimentally Supported Human MicroRNA and Disease Associations. Nucleic Acids Research, 42, D1070-D1074. [Google Scholar] [CrossRef] [PubMed]
[24] Gao, Y., Shang, S., Guo, S., Li, X., Zhou, H., Liu, H., et al. (2020) Lnc2Cancer 3.0: An Updated Resource for Experimentally Supported lncRNA/circRNA Cancer Associations and Web Tools Based on RNA-Seq and scRNA-Seq Data. Nucleic Acids Research, 49, D1251-D1258. [Google Scholar] [CrossRef] [PubMed]
[25] Lin, X., Lu, Y., Zhang, C., Cui, Q., Tang, Y., Ji, X., et al. (2023) LncRNADisease V3.0: An Updated Database of Long Non-Coding RNA-Associated Diseases. Nucleic Acids Research, 52, D1365-D1369. [Google Scholar] [CrossRef] [PubMed]
[26] Cui, C., Zhong, B., Fan, R. and Cui, Q. (2023) HMDD V4.0: A Database for Experimentally Supported Human MicroRNA-Disease Associations. Nucleic Acids Research, 52, D1327-D1332. [Google Scholar] [CrossRef] [PubMed]
[27] Wang, D., Wang, J., Lu, M., Song, F. and Cui, Q. (2010) Inferring the Human MicroRNA Functional Similarity and Functional Network Based on MicroRNA-Associated Diseases. Bioinformatics, 26, 1644-1650. [Google Scholar] [CrossRef] [PubMed]
[28] Liang, Y., Zhang, Z., Liu, N., Wu, Y., Gu, C. and Wang, Y. (2022) MAGCNSE: Predicting lncRNA-Disease Associations Using Multi-View Attention Graph Convolutional Network and Stacking Ensemble Model. BMC Bioinformatics, 23, Article No. 189. [Google Scholar] [CrossRef] [PubMed]
[29] Lu, C. and Xie, M. (2023) LDAEXC: LncRNA-Disease Associations Prediction with Deep Autoencoder and XGBoost Classifier. Interdisciplinary Sciences: Computational Life Sciences, 15, 439-451. [Google Scholar] [CrossRef] [PubMed]
[30] Xuan, P., Pan, S., Zhang, T., Liu, Y. and Sun, H. (2019) Graph Convolutional Network and Convolutional Neural Network Based Method for Predicting lncRNA-Disease Associations. Cells, 8, Article 1012. [Google Scholar] [CrossRef] [PubMed]
[31] Shi, Y., Lei, M., Ma, R. and Niu, L. (2019) Learning Robust Auto-Encoders with Regularizer for Linearity and Sparsity. IEEE Access, 7, 17195-17206. [Google Scholar] [CrossRef
[32] Lu, C., Yang, M., Luo, F., Wu, F., Li, M., Pan, Y., et al. (2018) Prediction of lncRNA-Disease Associations Based on Inductive Matrix Completion. Bioinformatics, 34, 3357-3364. [Google Scholar] [CrossRef] [PubMed]
[33] Zhu, R., Wang, Y., Liu, J. and Dai, L. (2021) IPCARF: Improving lncRNA-Disease Association Prediction Using Incremental Principal Component Analysis Feature Selection and a Random Forest Classifier. BMC Bioinformatics, 22, Article No. 175. [Google Scholar] [CrossRef] [PubMed]
[34] Shi, Z., Zhang, H., Jin, C., Quan, X. and Yin, Y. (2021) A Representation Learning Model Based on Variational Inference and Graph Autoencoder for Predicting lncRNA-Disease Associations. BMC Bioinformatics, 22, Article No. 136. [Google Scholar] [CrossRef] [PubMed]
[35] Wang, L. and Zhong, C. (2022) gGATLDA: LncRNA-Disease Association Prediction Based on Graph-Level Graph Attention Network. BMC Bioinformatics, 23, Article No. 11. [Google Scholar] [CrossRef] [PubMed]
[36] Smyth, E.C., Nilsson, M., Grabsch, H.I., van Grieken, N.C. and Lordick, F. (2020) Gastric Cancer. The Lancet, 396, 635-648. [Google Scholar] [CrossRef] [PubMed]
[37] Yu, Z., Wang, Z., Lee, K., Yuan, P. and Ding, J. (2017) Effect of Silencing Colon Cancer-Associated Transcript 2 on the Proliferation, Apoptosis and Autophagy of Gastric Cancer BGC-823 Cells. Oncology Letters, 15, 3127-3132. [Google Scholar] [CrossRef] [PubMed]
[38] Qi, P., Lin, W., Zhang, M., Huang, D., Ni, S., Zhu, X., et al. (2018) E2F1 Induces LSINCT5 Transcriptional Activity and Promotes Gastric Cancer Progression by Affecting the Epithelial-Mesenchymal Transition. Cancer Management and Research, 10, 2563-2571. [Google Scholar] [CrossRef] [PubMed]
[39] Feng, C., Zhao, Y., Li, Y., Zhang, T., Ma, Y. and Liu, Y. (2019) LncRNA MALAT1 Promotes Lung Cancer Proliferation and Gefitinib Resistance by Acting as a miR-200a Sponge. Archivos de Bronconeumología (English Edition), 55, 627-633. [Google Scholar] [CrossRef
[40] Nasim, F., Sabath, B.F. and Eapen, G.A. (2019) Lung Cancer. Medical Clinics of North America, 103, 463-473. [Google Scholar] [CrossRef] [PubMed]
[41] Dong, L., Li, G., Li, Y. and Zhu, Z. (2019) Upregulation of Long Noncoding RNA GAS5 Inhibits Lung Cancer Cell Proliferation and Metastasis via miR-205/PTEN Axis. Medical Science Monitor, 25, 2311-2319. [Google Scholar] [CrossRef] [PubMed]
[42] Zheng, F., Li, J., Ma, C., Tang, X., Tang, Q., Wu, J., et al. (2020) Novel Regulation of miR‐34a‐5p and HOTAIR by the Combination of Berberine and Gefitinib Leading to Inhibition of EMT in Human Lung Cancer. Journal of Cellular and Molecular Medicine, 24, 5578-5592. [Google Scholar] [CrossRef] [PubMed]
[43] Wang, M., Sun, X., Yang, Y. and Jiao, W. (2018) Long Non‐Coding RNA OIP5‐AS1 Promotes Proliferation of Lung Cancer Cells and Leads to Poor Prognosis by Targeting miR‐378a‐3p. Thoracic Cancer, 9, 939-949. [Google Scholar] [CrossRef] [PubMed]