基于节点拓扑属性的社团结构划分方法
Community Structure Division Method Based on Node Topology Attributes
摘要: 社团结构,作为复杂网络分析中的核心概念之一,指的是那些拥有相似特性和功能的节点所构成的子图结构。社团结构的划分不仅有助于深入挖掘网络内在的信息和特征,而且还有助于揭示复杂网络系统的演化机制。本研究提出了一种创新的社团结构划分算法,其核心围绕网络结构中节点的拓扑属性。在该算法中,我们首先通过节点的接近中心性和介质中心性构建二维决策矩阵。具体来说,接近中心性揭露了节点在网络中与其他节点的亲密程度,而介质中心性则考虑了节点在网络中的桥接作用。二维决策矩阵的构建旨在从网络的广度和深度双视角考察节点的重要性和功能。然后利用基于K-means的迭代策略找到最优的聚类中心。最后通过加权的Dijkstra算法将剩余节点依据就近的聚类中心进行分配完成社团划分。为验证所提出的算法的有效性,在多个实际存在的社会网络上进行了实验,并将其与已有的算法进行了综合比较。实验结果验证了所提算法的性能,特别是在识别具有强社交关系的Karate数据集的社团关系中,本文所提算法在NMI,ARI,Purity三个评价指标中分别达到了81.56%、83.34%、100.00%,提高了社团划分结果的准确性。
Abstract: Community structure, as a fundamental concept in the analysis of complex networks, refers to sub-graph structures composed of nodes with similar characteristics and functions. The division of community structure not only aids in the in-depth exploration of intrinsic information and features within networks but also helps uncover the evolutionary mechanisms of complex network systems. This study introduces an innovative algorithm for community structure detection, primarily focus-ing on the topological attributes of nodes within the network. In this algorithm, we initially con-struct a two-dimensional decision matrix using node closeness centrality and betweenness central-ity. Specifically, closeness centrality reveals the intimacy of nodes with respect to other nodes in the network, while betweenness centrality considers the bridging role of nodes within the network. The construction of the two-dimensional decision matrix aims to examine the importance and function-ality of nodes from both breadth and depth perspectives of the network. Subsequently, an iterative strategy based on K-means is employed to identify the optimal clustering centers. Finally, the re-maining nodes are allocated to clusters based on their proximity to the designated clustering cen-ters using a weighted Dijkstra algorithm, thus completing the community partition. To validate the effectiveness of the proposed algorithm, experiments were conducted on multiple real-world social networks, and comprehensive comparisons were made with existing algorithms. The experimental results have validated the performance of the proposed algorithm, particularly in identifying the community relationships within the Karate dataset, which is characterized by strong social ties. The proposed algorithm achieved 81.56%, 83.34% and 100.00% respectively in the three evaluation indexes of NMI, ARI and Purity. These metrics indicate an enhancement in the accuracy of the community detection results
文章引用:骆勇. 基于节点拓扑属性的社团结构划分方法[J]. 应用数学进展, 2023, 12(12): 4946-4954. https://doi.org/10.12677/AAM.2023.1212487

参考文献

[1] Newman, M.E.J. (2006) Finding Community Structure in Networks Using the Eigenvectors of Matrices. Physical Review E, 74, Article ID: 036104. [Google Scholar] [CrossRef
[2] 李欢, 莫欣岳. 复杂网络重叠社团检测算法研究综述[J]. 传感器与微系统, 2017, 36(1): 1-4. [Google Scholar] [CrossRef
[3] Ferligoj, A. and Batagelj, V. (1992) Direct Mul-ticriteria Clustering Algorithms. Journal of Classification, 9, 43-61. [Google Scholar] [CrossRef
[4] 蒋忠元, 陈贤宇, 马建峰. 社交网络中的社团隐私研究综述[J]. 网络与信息安全学报, 2021, 7(2): 10-21.
[5] Souravlas, S., Sifaleras, A., Tsintogianni, M. and Katsavounis, S. (2021) A Classification of Community Detection Methods in Social Networks: A Survey. International Journal of General Sys-tems, 50, 63-91. [Google Scholar] [CrossRef
[6] 方倩, 窦永香, 王帮金. 基于Web of Science的社会化媒体环境下社区发现研究综述[J]. 中文信息学报, 2017, 31(3): 1-8.
[7] 黄琳凯. 基于知识图谱的Web信息关联网络分析与主题社区发现[J]. 无线互联科技, 2018, 15(10): 23-24.
[8] 向卓元, 利朝香. 知识图谱视域下国内外社区发现研究动态与热点分析[J]. 图书馆研究与工作, 2019(1): 52-57.
[9] Raghavan, U.N., Albert, R. and Kumara, S. (2007) Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks. Physical Review E, 76, Article ID: 036106. [Google Scholar] [CrossRef
[10] Zhang, Y., Zhang, Y., Chen, Q., Ai, Z. and Gong, Z. (2017) True-Link Clustering through Signaling Process and Subcommunity Merge in Overlapping Community Detection. Neural Computing & Applications, 30, 3613-3621. [Google Scholar] [CrossRef
[11] Blondel, V.D., Guillaume, J., Lambiotte, R. and Lefebvre, E. (2008) Fast Unfolding of Communities in Large Networks. Journal of Statistical Mechanics: Theory and Experiment, No. 10, P10008. [Google Scholar] [CrossRef
[12] Zhe, C., Sun, A. and Xiao, X. (2019) Community Detec-tion on Large Complex Attribute Network. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, 4-8 August 2019, 2041-2049. [Google Scholar] [CrossRef
[13] Zhu, J., Chen, B. and Zeng, Y. (2020) Community Detection Based on Modularity and K-Plexes. Information Sciences, 513, 127-142. [Google Scholar] [CrossRef
[14] Perozzi, B., Al-Rfou, R. and Skiena, S. (2014) DeepWalk: Online Learning of Social Representations. Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, 24-27 August 2014, 701-710. [Google Scholar] [CrossRef
[15] Grover, A. and Leskovec, J. (2016) Node2vec: Scalable Feature Learning for Networks. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, 13-17 August 2016, 855-864. [Google Scholar] [CrossRef] [PubMed]
[16] Sabidussi, G. (1966) The Centrality Index of a Graph. Psy-chometrika, 31, 581-603. [Google Scholar] [CrossRef
[17] Freeman, L.C. (1977) A Set of Measures of Centrality Based on Be-tweenness. Sociometry, 40, 35-41. [Google Scholar] [CrossRef
[18] Dijkstra, E.W. (1959) A Note on Two Problems in Connexion with Graphs. Numerische Mathematik, 1, 269-271. [Google Scholar] [CrossRef
[19] Zachary, W.W. (1977) An Information Flow Model for Conflict and Fission in Small Groups. Journal of Anthropological Research, 33, 452-473. [Google Scholar] [CrossRef
[20] Girvan, M. and Newman, M.E.J. (2002) Community Structure in Social and Biological Networks. Proceedings of the National Academy of Sciences of the United States of America, 99, 7821-7826. [Google Scholar] [CrossRef] [PubMed]
[21] Church, K.W. and Hanks, P. (1989) Word Association Norms, Mutual Information, and Lexicography. Computational linguistics, 16, 22-29. [Google Scholar] [CrossRef
[22] Rand, W.M. (1971) Objective Criteria for the Evaluation of Clustering Methods. Journal of the American Statistical Association, 66, 846-850. [Google Scholar] [CrossRef
[23] Hennig, C. (2007) Cluster-Wise Assessment of Cluster Stability. Computational Statistics & Data Analysis, 52, 258-271. [Google Scholar] [CrossRef
[24] Newman, M.E. and Girvan, M. (2004) Finding and Evaluating Community Structure in Networks. Physical Review E, 69, Article ID: 026113. [Google Scholar] [CrossRef