基于投票理论的三支聚类分析
Three-Way Clustering Analysis Based on Voting Theory
DOI: 10.12677/CSA.2019.912261, PDF,    国家自然科学基金支持
作者: 谷留全, 柴瑞林, 王平心*:江苏科技大学理学院,江苏 镇江
关键词: K-Means三支聚类标签匹配投票规则K-Means Three-Way Clustering Label Matching Voting Rules
摘要: 目前,大多数聚类方法是二支聚类,即每个对象要么属于一个类,要么不属于一个类,聚类结果具有清晰的边界。然而,将某些不确定的对象强制分配到某个类簇中会降低聚类结果精度。而三支聚类是一种重叠聚类方法,它采用核心域和边界域来表示每个类簇,较好地处理了含有不确定性信息对象的类别归属问题,同时有效地降低了决策风险。本文主要介绍一种三支决策聚类模型,并给出基于k-means的三支决策聚类算法作为实例进行分析。首先,通过已有的聚类算法得到相同数据集的不同二支决策聚类结果,然后对聚类成员的类簇标签进行匹配,最后制定投票规则确定样本的类簇归属。通过实验结果分析,本文所提出的聚类方法的聚类结果在各聚类性能度量指标上有了明显的提升。
Abstract: At present, most of existed clustering methods are two-way clustering which are based on the as-sumption that a cluster must be represented by a set with crisp boundary. However, assigning uncertain objects into a certain cluster will reduce the accuracy of clustering results. Three-way clustering is an overlapping clustering which describes each cluster by core region and fringe region. It handles the category problem of uncertain objects and reduces the decision risk effectively. This paper mainly introduces a model of three-way clustering, and gives three-way clustering algorithm based on k-means as an example for analysis. Firstly, different clustering results of the same data set are obtained by ensemble clustering. Then, the label matching method is used. Finally, the cluster of objects is determined according to the voting rules. Through the analysis of experimental results, it is verified that the effect of the clustering method has been significantly improved.
文章引用:谷留全, 柴瑞林, 王平心. 基于投票理论的三支聚类分析[J]. 计算机科学与应用, 2019, 9(12): 2349-2356. https://doi.org/10.12677/CSA.2019.912261

参考文献

[1] Li, J.H., Huang, C.C., Qi, J.J., Qia, Y.H. and Liu, W.Q. (2017) Three-Way Cognitive Concept Learning via Mul-ti-Granularity. Information Sciences, 378, 244-263. [Google Scholar] [CrossRef
[2] Lingras, P. and Yan, R. (2004) Interval Clustering Using Fuzzy and Rough Set Theory. IEEE Annual Meeting of the Fuzzy Information, Banff, 27-30 June 2004, 780-784. [Google Scholar] [CrossRef
[3] Liu, D., Yao, Y.Y. and Li, T.R. (2011) Three-Way Investment Decisions with Decision-Theoretic Rough Sets. International Journal of Computa-tional Intelligence Systems, 4, 66-74. [Google Scholar] [CrossRef
[4] Li, Y., Zhang, C. and Swanb, J.R. (2000) An Information Filtering on the Web and Its Application in Job Agent. Knowledge-Based Systems, 13, 285-296. [Google Scholar] [CrossRef
[5] Yao, Y.Y. (2010) Three-Way Decisions with Probabilistic Rough Sets. Information Sciences, 180, 341-353. [Google Scholar] [CrossRef
[6] Yao, Y.Y. (2011) The Superiority of Three-Way Decisions in Probabilistic Rough Set Models. Information Sciences, 181, 1080-1096. [Google Scholar] [CrossRef
[7] Yao, Y.Y. (2012) An Outline of a Theory of Three-Way Decisions. In: Yao, J., Yang, Y., et al., Eds., Rough Sets and Current Trends in Computing, Springer, Berlin, Heidelberg, Vol. 7413, 1-17. [Google Scholar] [CrossRef
[8] Yu, H., Chu, S.S. and Yang, D.C. (2010) Autonomous Knowledge-Oriented Clustering Using Decision-Theoretic Rough Set Theory. Rough Set and Knowledge Technology, In: Yu, J., Greco, S., Lingras, P., Wang, G. and Skowron, A., Eds., Springer, Berlin, Heidelberg, Vol. 6401, 687-694. [Google Scholar] [CrossRef
[9] Yu, H., Liu, Z.G. and Wang, G.Y. (2014) An Automatic Method to Determine the Number of Clusters Using Decision-Theoretic Rough Set. International Journal of Approxi-mate Reasoning, 55, 101-115. [Google Scholar] [CrossRef
[10] Yu, H., Zhang, C. and Wang, G.Y. (2016) A Tree-Based Incremen-tal Overlapping Clustering Method Using the Three-Way Decision Theory. Knowledge Based Systems, 91, 189-203. [Google Scholar] [CrossRef
[11] Yu, H., Jiao, P., Yao, Y.Y., et al. (2016) Detecting and Refining Overlapping Regions in Complex Networks with Three-Way Decisions. Information Sciences, 373, 21-41. [Google Scholar] [CrossRef
[12] 于洪. 三支聚类分析[J]. 数码设计, 2016, 5(1): 31-35.
[13] Macqueen, J. (1967) Some Methods for Classification and Analysis of Multivariate Observations. Proceed-ings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, 1967, 281-297.
[14] Schölkopf, B., Platt, J. and Hofmann, T. (2006) A Local Learning Approach for Clustering. International Conference on Neural Information Processing Systems, MIT Press, 2007, 1529-1536.
[15] Davies, D.L. and Bouldin, D.W. (1979) A Cluster Separation Measure. IEEE Transactions on Pattern Analysis & Machine Intelligence, 1, 224-227. [Google Scholar] [CrossRef