一种基于图折叠的网络嵌入方法
A Network Embedding Method Based on Graph Folding
DOI: 10.12677/CSA.2019.99197, PDF,    科研立项经费支持
作者: 冯晓硕:海军研究院,北京;王冬琦*:东北大学软件学院,辽宁 沈阳
关键词: 网络嵌入图折叠k完全子图DeepwalkNetwork Embedding Graph Folding k-Clique Deepwalk
摘要: 随着信息技术的广泛应用,信息网络正在变得无处不在,社交网络、引文网络、电信网络乃至生物网络等各类网络让信息网络研究受到了众多学科研究人员的关注。网络嵌入是一种保留网络拓扑信息和节点内容等其他附带信息的网络节点低维向量表示学习方法,在新的低维空间中网络分析挖掘任务可能更容易被解决,任务的运算复杂性也有可能降低。本文设计实现了一种基于完全子图折叠的网络嵌入方法,该方法把目标网络的k完全子图视为超节点,在以超节点为单位的新网络上使用任意网络嵌入算法学习超节点的向量表示,之后把超节点的向量表示作为对应k-完全子图中所有节点输入到任意网络嵌入学习算法的初始值,重新学习获得节点最终的向量表示。本文使用Deepwalk算法进行了实验,实验结果表明,本方法不但大幅提升了网络嵌入的速度,而且本方法学到的节点向量在一些下游应用中的表现也优于纯粹的Deepwalk算法。
Abstract: With the full application of information technology, information networks are becoming ubiquitous. Social networks, citation networks, telecommunication networks, and even biological networks have made information network research attract the attention of researchers in many disciplines. Network embedding is a low-dimensional vector representation learning method of nodes that preserves information such as network topology and node content. In the low-dimensional space, network analysis and mining tasks may be easier to solve, and the computational complexity of tasks may also be lowered. This paper designs and implements a network embedding method based on complete subgraph folding. This method regards the k-complete subgraphs of the target network as supernodes and uses an arbitrary network embedding algorithm to learn the vector representation of supernodes in the new network of supernodes. Then we use the learned vector representation to initialize the members’ vector representation of the supernode, after that, the target network will be fed into an arbitrary network embedding algorithm and learn to get the final vector representation of nodes. In this paper, we select to use Deepwalk algorithm in the experiments. Experiment results show that the proposed method significantly improved the speed of network embedding. At the same time, the node vectors learned by the proposed method also out-performed the original Deepwalk algorithm in selected downstream applications.
文章引用:冯晓硕, 王冬琦. 一种基于图折叠的网络嵌入方法[J]. 计算机科学与应用, 2019, 9(9): 1753-1760. https://doi.org/10.12677/CSA.2019.99197

参考文献

[1] Belkin, M. and Niyogi, P. (2002) Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering. Pro-ceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic, Van-couver, 3-8 December 2001, 585-591.
[2] Perozzi, B., Al-Rfou, R. and Skiena, S. (2014) Deepwalk: Online Learning of Social Representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, New York, 701-710. [Google Scholar] [CrossRef
[3] Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J. and Mei, Q. (2015) Line: Large-Scale Information Network Embedding. In: Proceedings of the 24th International Conference on World Wide Web, International World Wide Web Conferences Steering Commit-tee, Geneva, 1067-1077. [Google Scholar] [CrossRef
[4] Grover, A. and Leskovec, J. (2016) node2vec: Scalable Feature Learning for Networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, New York, 855-864. [Google Scholar] [CrossRef] [PubMed]
[5] Newman, M. (2018) Networks. Oxford University Press, Oxford. [Google Scholar] [CrossRef
[6] Hu, Y. (2005) Efficient, High-Quality Force-Directed Graph Drawing. Mathematica Journal, 10, 37-71.
[7] Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S. and Dean, J. (2013) Distributed Representations of Words and Phrases and Their Compositionality. Proceedings of the 26th Interna-tional Conference on Neural Information Processing Systems, Volume 2, Lake Tahoe, 5-10 December 2013, 3111-3119.
[8] Newman, M. (2013) Books about US Politics. http://www-personal.umich.edu/~mejn/netdata
[9] Girvan, M. and Newman, M.E. (2002) Community Structure in So-cial and Biological Networks. Proceedings of the National Academy of Sciences, 99, 7821-7826. [Google Scholar] [CrossRef] [PubMed]
[10] Estévez, P.A., Tesmer, M., Perez, C.A. and Zurada, J.M. (2009) Normalized Mutual Information Feature Selection. IEEE Transactions on Neural Networks, 20, 189-201. [Google Scholar] [CrossRef