一种基于多头注意力机制的关键蛋白质靶点识别方法
A Method for Essential Protein Target Recognition Based on a Multi-Head Attention Mechanism
DOI: 10.12677/MOS.2023.126517, PDF,   
作者: 杨文雅, 关 月:贵州大学大数据与信息工程学院,贵州 贵阳;周 予:信阳职业技术学院数学与信息工程学院,河南 信阳
关键词: 多头注意力机制图神经网络蛋白质相互作用网基因表达谱同源相似性Multi-Head Attention Mechanism Graph Neural Networks PPI Networks Gene Expression Profiles Homology Similarity
摘要: 蛋白质是机体功能的重要执行者。特定的一些蛋白质对生物的生存、繁殖和生理调节尤为关键,它们往往成为疾病发生、发展中的重要参与者,这类蛋白质被称为关键蛋白质。因此,在疾病预防和治疗过程中快速找到蛋白质关键靶点是尤为重要的。本文提出了一种借助多头注意力机制来解决这一难题的方法。该方法将蛋白质相互作用(PPI)网络的拓扑特征、基因表达谱特征以及同源特融合,从而构建一个融合PPI网络。进而,我们采用图注意力神经网络(GAT)模型,学习融合PPI网络中节点的特征表示,为了更好的地捕获蛋白质之间的关联关系,我们引入多头注意力机制增强模型的学习效果。最终,通过在DIP酵母蛋白数据集上的训练和测试,实验结果证明了我们的方法相较于传统的基于拓扑的策略具有更高的识别精度。
Abstract: Proteins are important performers of organismal functions. Certain proteins are especially critical for organisms’ survival, reproduction, and physiological regulation. They often play a significant role in the onset and progression of the disease, which are called essential proteins. Therefore, rapidly identifying the essential protein targets is of paramount importance in the prevention and treat-ment of diseases. In this paper, we propose an approach to address this challenge with the help of a multi-head attention mechanism. Our approach uses topological features of protein-protein inter-action (PPI) networks, gene expression profile features, and homology features to construct a fusion PPI network. Further, we employ a graph attention neural network (GAT) model to learn the feature representations of the nodes in the fusion PPI network. To better capture the association relation-ships among proteins, we introduce a multi-head attention mechanism to enhance the learning ef-fect of the model. Finally, through training and testing on DIP yeast protein datasets, the experi-mental results demonstrate the higher recognition accuracy of our method compared to the tradi-tional topology-based strategy.
文章引用:杨文雅, 关月, 周予. 一种基于多头注意力机制的关键蛋白质靶点识别方法[J]. 建模与仿真, 2023, 12(6): 5693-5702. https://doi.org/10.12677/MOS.2023.126517

参考文献

[1] Giaever, G., et al. (2002) Functional Profiling of the Saccharomyces Cerevisiae Genome. Nature, 418, 387-391.
[2] Roemer, T., et al. (2003) Large-Scale Essential Gene Identification in Candida albicans and Applications to Antifungal Drug Discovery. Molecular Microbiology, 50, 167-181. [Google Scholar] [CrossRef] [PubMed]
[3] Cullen, L.M. and Arndt, G.M. (2005) Genome-Wide Screening for Genefunction Using RNAi in Mammalian Cells. Immunology & Cell Biology, 83, 217-223. [Google Scholar] [CrossRef] [PubMed]
[4] Li, M., Zhang, H., Wang, J.X. and Pan, Y. (2012) A New Essential Protein Discovery Method Based on the Integration of Protein-Protein Interaction and Gene Expression Data. BMC Systems Biology, 6, Article No. 15. [Google Scholar] [CrossRef] [PubMed]
[5] Pál, C., Papp, B. and Hurst, L.D. (2003) Genomic Function: Rate of Evolution and Gene Dispensability, Nature, 421, 496-497. [Google Scholar] [CrossRef] [PubMed]
[6] Zhang, W., Chen, Y., Liu, F., Luo, F., Tian, G., and Li, X. (2017) Predicting Potential Drug-Drug Interactions by Integrating Chemical, Bi-ological, Phenotypic and Network Data. BMC Bioinformatics, 18, 1-12.
[7] Vallabhajosyula, R.R., Chakravarti, D., Lutfeali, S., Ray, A. and Raval, A. (2009) Identifying Hubs in Protein Interaction Networks. PLOS ONE, 4, e5344. [Google Scholar] [CrossRef] [PubMed]
[8] Li, M., Wang, J., Chen, X., Wang, H. and Pan, Y. (2011) A Local Averageconnectivity-Based Method for Identifying Essential Proteinsfrom the Network Level. Computational Bi-ology and Chemistry, 35, 143-150. [Google Scholar] [CrossRef] [PubMed]
[9] Bonacich, P. and Lloyd, P. (2001) Eigenvector-Like Measures of Centralityfor Asymmetric Relations. Social Networks, 23, 191-201. [Google Scholar] [CrossRef
[10] Stephenson, K. and Zelen, M. (1989) Rethinking Centrality: Methods and Examples. Social Networks, 11, 1-37. [Google Scholar] [CrossRef
[11] Li, M., Wang, J., Wang, H. and Pan, Y. (2012) Identification of Essentialproteins Based on Edge Clustering Coefficient. IEEE/ACM Transactions on Computational Biology and Bio-informatics, 9, 1070-1080. [Google Scholar] [CrossRef
[12] Estrada, E. and Rodríguez-Velíazquez, J.A. (2005) Subgraph Central-ity in Complex Networks. Physical Review E: Statistical, Nonlinear, and Soft Matter Physics, 71, 056103. [Google Scholar] [CrossRef
[13] Wuchty, S. and Stadler, P.F. (2003) Centers of Complex Net-works. Journal of Theoretical Biology, 223, 45-53. [Google Scholar] [CrossRef
[14] Barabási, A.-L., Gulbahce, N. and Loscalzo, J. (2011) Net-work Medicine: A Network-Based Approach to Human Disease. Nature Reviews Genetics, 12, 56-68. [Google Scholar] [CrossRef] [PubMed]
[15] Winzeler, E.A., Shoemaker, D.D., Astromoff, A., Liang, H., Anderson, K., Andre, B. and Bussey, H. (1999) Functional Characterization of the S. Cerevisiae Genome by Gene Deletionand Parallel Analysis. Science, 285, 901-906.
[16] Zhang, X., Xu, J. and Xiao, W. (2013) A New Method for the Discovery of Es-sential Proteins. PLOS ONE, 8, e58763. [Google Scholar] [CrossRef] [PubMed]
[17] Tang, X., Wang, J., Zhong, J. and Pan, Y. (2014) Predicting Essential Proteins Based on Weighted Degree Centrality. IEEE/ACM Transactions on Computational Biology and Bioin-formatics, 11, 407-418. [Google Scholar] [CrossRef
[18] Peng, W., Wang, J., Cheng, Y. and Lu, Y. (2015) UDoNC: An Algorithmfor Identifying Essential Proteins Based on Protein Domains Andprotein-Protein Interaction Networks. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 12, 276-288. [Google Scholar] [CrossRef
[19] Peng, W., Wang, J.X., Wang, W., et al. (2012) Iteration Method for Predicting Essential Proteins Based on Orthology and Protein-Protein Interaction Networks. BMC Systems Biology, 6, Article No. 87. [Google Scholar] [CrossRef] [PubMed]
[20] Pereira-Leal, J.B., Audit, B., Peregrin-Alvarez, J.M. and Ouzounis, C.A. (2005) An Exponential Core in the Heart of the Yeast Protein Interaction Network. Molecular Biology and Evolu-tion, 22, 421-425. [Google Scholar] [CrossRef] [PubMed]
[21] Jancura, P., Mavridou, E., Pontes, B. and Marchiori, E (2011) De-scribing the Orthology Signal in a PPI Network at a Functional, Complex Level. International Symposium on Bioinfor-matics Research and Applications, 6674, 209-226. [Google Scholar] [CrossRef
[22] Wuchty, S., Barabasi, A.L., Ferdig, M.T. (2006) Stable Evolu-tionary Signal in a Yeast Protein Interaction Network. BMC Evolutionary Biology, 6, 8. [Google Scholar] [CrossRef] [PubMed]