基于属性集成的三支聚类算法
Three-Way Clustering Based on Attribute In-tegration
DOI: 10.12677/AAM.2022.1110777, PDF,    科研立项经费支持
作者: 薛文龙, 王平心:江苏科技大学理学院,江苏 镇江
关键词: 三支聚类属性集成KmeansThere-Way Clusters Attribute Integration Kmeans
摘要: 在三支聚类中,每一个类簇由核心域、边界域、琐碎域三个集合来表示,体现出聚类中的不确定性,降低了决策风险。针对属性数较多的数据集,本文提出了一种基于属性集成的三支聚类算法。该算法通过对样本随机选择部分属性使用kmeans算法进行聚类,并根据每次聚类的结果采用集成策略将元素划分至核心域、边界域或琐碎域中。在UCI等数据集上测试的结果表明,该算法相较于传统的kmeans算法,可以较好地提升聚类结果的ACC、DBI和AS指标。
Abstract: In the three-way clustering, each cluster is represented by three sets of core domains, boundary domains, and trivial domains, reflecting the uncertainty in the cluster, and reducing the risk of de-cision-making. In this paper, a three-branch clustering method based on attribute integration is proposed for datasets with a large number of attributes. The algorithm uses k-means algorithm to cluster by randomly selecting some attributes, and classifies elements into core, boundary, or trivial domains based on the results of each clustering. Tested on the UCI and other dataset, the results obtained are better than the conventional clustering method in ACC, DBI and AS indicators, which can improve the clustering effect.
文章引用:薛文龙, 王平心. 基于属性集成的三支聚类算法[J]. 应用数学进展, 2022, 11(10): 7317-7324. https://doi.org/10.12677/AAM.2022.1110777

参考文献

[1] Xu, D. and Tian, Y. (2015) A Comprehensive Survey of Clustering Algorithms. Annals of Data Science, 2, 165-193. [Google Scholar] [CrossRef
[2] Hoppner, F., Klawonn, F., Kruse, R., et al. (1999) Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition. Wiley, New York.
[3] Lingras, P. and West, C. (2004) Interval Set Clustering of Web Users with Rough k-Means. Journal of Intelligent Information Systems, 23, 5-16. [Google Scholar] [CrossRef
[4] Yao, Y.Y., Lingras, P., Wang, R.Z., et al. (2009) Interval Set Cluster Analysis: A Re-Formulation. In: Sakai, H., Chakraborty, M.K., Hassanien, A.E., et al., Eds., Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, Springer, Berlin, 398-405. [Google Scholar] [CrossRef
[5] Yu, H., Chu, S.S. and Yang, D.C. (2012) Autonomous Knowledge-Oriented Clustering Using Decision-Theoretic Rough Set Theory. Fundamenta Informaticae, 115, 141-156. [Google Scholar] [CrossRef
[6] Yu, H., Liu, Z.G. and Wang, G.Y. (2014) An Automatic Method to De-termine the Number of Clusters Using Decision Theoretic Rough Set. International Journal of Approximate Reasoning, 55, 101-115. [Google Scholar] [CrossRef
[7] Yu, H., Chu, S.S. and Yang, D.C. (2010) Autonomous Knowledge-Oriented Clustering Using Decision-Theoretic Rough Set Theory. In: Yu, J., Greco, S., Lingras, P., et al., Eds., Rough Set and Knowledge Technology, Springer, Berlin, 687-694. [Google Scholar] [CrossRef
[8] Yao, Y.Y. (2010) Three-Way Decisions with Probabilistic Rough Sets. Information Sciences, 180, 341-353. [Google Scholar] [CrossRef
[9] Yao, Y.Y. (2011) The Superiority of Three-Way Decisions in Probabilistic Rough Set Models. Information Sciences, 181, 1086-1096. [Google Scholar] [CrossRef
[10] Yao, Y.Y. (2012) An Outline of a Theory of Three-Way Decisions. In: Yao, J., Ed., Rough Sets and Current Trends in Computing, Springer, Berlin, 1-17. [Google Scholar] [CrossRef
[11] Little, M.A., McSharry, P.E., Hunter, E.J. and Ramig, L.O. (2009) Suitability of Dysphonia Measurements for Telemonitoring of Parkinson’s Disease. IEEE Transactions on Bio-medical Engineering, 56, 1015-1022. [Google Scholar] [CrossRef
[12] Tsanas, A., Little, M.A., Fox, C. and Ramig, L.O. (2014) Ob-jective Automatic Assessment of Rehabilitative Speech Treatment in Parkinson Disease. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 22, 181-190. [Google Scholar] [CrossRef
[13] Fänti, P. and Sieranoja, S. (2018) K-Means Properties on Six Clustering Benchmark Datasets. Applied Intelligence, 48, 4743-4759. [Google Scholar] [CrossRef
[14] Veenman, C.J., Reinders, M.J.T. and Backer, E. (2002) A Maxi-mum Variance Cluster Algorithm. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 1273-1280. [Google Scholar] [CrossRef
[15] Schölkopf, B., Platt, J. and Hofmann, T. (2007) A Local Learning Approach for Clustering. In: International Conference on Neural Information Processing Systems, MIT Press, Vancouver, 1529-1536. [Google Scholar] [CrossRef
[16] Fahad, A., Alshatri, N., Tari, Z., et al. (2014) A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis. IEEE Transactions on Emerging Topics in Computing, 2, 267-279. [Google Scholar] [CrossRef