基于实体不可知表示学习的知识图谱嵌入
Knowledge Graph Embedding Based on Entity-Agnostic Representation Learning
DOI: 10.12677/sea.2024.133033, PDF,   
作者: 杨科林, 杨 斌:陇南通途公路养护工程有限公司,甘肃 陇南;秦崇良, 雷荣军, 张永超, 屈睿涛*:中国电信股份有限公司兰州分公司,甘肃 兰州
关键词: 知识图谱嵌入实体不可知表示学习连接关系K近邻保留实体Knowledge Graph Embedding Entity-Agnostic Representation Learning Connection Relation K-Nearest Neighbor Preserved Entities
摘要: 针对知识图谱嵌入(KGE)引发的参数存储效率低下的问题,本研究提出了一种创新的实体不可知表示学习方法。传统的知识图谱嵌入技术通过为知识图谱中的各个元素(涵盖实体和关系)分配特定的嵌入(即向量化表达),将其映射至连续的向量空间。然而,这种方法导致了嵌入参数随着知识图谱规模的扩大而呈现线性增长的趋势。在此基础上,我们提出了名为实体不可知表示学习(EARL)的模型。该模型仅学习一小部分实体的嵌入,这些实体被称为保留实体。为了获取完整实体集的嵌入,我们巧妙地结合关系、K近邻保留实体以及多跳邻居中的信息,以编码这些保留实体的独特特征。通过学习通用且实体不可知的编码器,我们将这些特征高效地转换为实体的嵌入。相较于传统的知识图谱嵌入技术,这种创新方法使得我们提出的EARL模型在保持高效的同时,具有更少的参数量,从而展现出更高的静态性。实验结果充分验证了EARL在链路预测任务上的卓越性能,并且在参数效率方面展现出显著优势,进一步凸显了其在减少参数使用方面的有效性。
Abstract: In order to solve the problem of inefficient parameter storage caused by Knowledge Graph Embedding (KGE), this study proposes an innovative entity-agnostic representation learning method. Traditional knowledge graph embedding techniques map each element (covering entities and relations) in the knowledge graph to a continuous vector space by assigning specific embeddings (i.e. vectorized representations). However, this approach leads to a linear growth trend of embedding parameters as the size of the knowledge graph increases. On this basis, we propose a model called Entity-Agnostic Representation Learning (EARL). The model learns the embedding of only a small subset of entities, which are known as reserved entities. To obtain embeddings for the full set of entities, we skillfully combine information from relationships, K-nearest neighbor preserved entities, and multi-hop neighbors to encode the unique characteristics of these retained entities. By learning a generic and entity-agnostic encoder, we efficiently translate these features into entity embeddings. Compared with the traditional knowledge graph embedding technology, this innovative method enables the proposed EARL model to have fewer parameters while maintaining high efficiency, so as to show higher staticity. The experimental results fully verify the excellent performance of EARL in the link prediction task, and show significant advantages in parameter efficiency, which further highlights its effectiveness in reducing the use of parameters.
文章引用:杨科林, 杨斌, 秦崇良, 雷荣军, 张永超, 屈睿涛. 基于实体不可知表示学习的知识图谱嵌入[J]. 软件工程与应用, 2024, 13(3): 330-335. https://doi.org/10.12677/sea.2024.133033

参考文献

[1] Galkin, M., Denis, E., Wu, J., et al. (2021) NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs. arXiv: 2106.12144.
[2] Sun, Z., Deng, Z.H., Nie, J.Y., et al. (2019) Rotate: Knowledge Graph Embedding by Relational Rotation in Complex Space. arXiv: 1902.10197.
[3] Bordes, A., Usunier, N., Garcia-Duran, A., et al. (2013) Translating Embeddings for Modeling Multi-Relational Data. Proceedings of the 26th International Conference on Neural Information Processing Systems, Lake Tahoe, December 2013, 2787-2795.
[4] Peng, C., Xia, F., Naseriparsa, M. and Osborne, F. (2023) Knowledge Graphs: Opportunities and Challenges. Artificial Intelligence Review, 56, 13071-13102. [Google Scholar] [CrossRef] [PubMed]
[5] Tamašauskaitė, G. and Groth, P. (2023) Defining a Knowledge Graph Development Process through a Systematic Review. ACM Transactions on Software Engineering and Methodology, 32, 1-40. [Google Scholar] [CrossRef
[6] Pan, J.Z., Razniewski, S., Kalo, J.C., et al. (2023) Large Language Models and Knowledge Graphs: Opportunities and Challenges. arXiv: 2308.06374.
[7] Zhong, L., Wu, J., Li, Q., et al. (2023) A Comprehensive Survey on Automatic Knowledge Graph Construction. arXiv: 2302.05019.