基于粗糙集的主成分聚类方法
Principal Component Clustering Method Based on Rough Set
摘要: 针对当前大规模高维数据难处理以及传统的二支聚类造成误分类等问题。本文首先根据主成分分析方法对数据进行降维处理,并将粗糙集理论与聚类相结合,提出了基于粗糙集的主成分聚类方法。最后,对陕西省内11个城市的25个经济指标进行实验分析,根据经济指标的聚类结果将城市进行三划分,提高了聚类的准确率。
Abstract: In view of the difficulties in processing large-scale high-dimensional data and the misclassification caused by traditional two-way clustering, firstly, this paper reduces the dimension of the data according to the principal component analysis method, and combines the rough set theory with clustering, and the principal component clustering method based on rough set are proposed. Finally, 25 economic indicators of 11 cities in Shaanxi Province are experimentally analyzed. According to the clustering results of economic indicators, the cities are divided into three groups, which improves the accuracy of clustering.
文章引用:花遇春, 杨璇, 熊文丹. 基于粗糙集的主成分聚类方法[J]. 计算机科学与应用, 2022, 12(5): 1378-1388. https://doi.org/10.12677/CSA.2022.125137

参考文献

[1] Jain, A.K. (2009) Data Clustering: 50 Years beyond K-Means. Pattern Recognition Letters, 31, 651-666. [Google Scholar] [CrossRef
[2] Kaufamann, L. and Rousseeuw, P.J. (1987) Clustering by Means of Medoids. In: Dodge, Y., Ed., Statistical Data Analysis Based on the L1-Norm & Related Methods, North-Holland, Amsterdam, 405-416.
[3] Qian, W.N. and Zhou, A.Y. (2002) Analyzing Popular Clustering Al-gorithms from Different Viewpoints. Journal of Software, 32, 432-445.
[4] Grabmeier, J. and Rudolph, A. (2002) Techniques of Cluster Algorithms in Data Mining. Data Mining and Knowledge Discovery, 6, 303-360. [Google Scholar] [CrossRef
[5] Hoppner, F., Klawonn, F., Kruse, R., et al. (1999) Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition. Wiley Press, Chichester, 1-48.
[6] Yao, Y.Y., Lingras, P., Wang, R.Z., et al. (2009) Interval Set Cluster Analysis: A Re-Formulation. Pro-ceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, Delhi, 16-18 December 2009, 398-405. [Google Scholar] [CrossRef
[7] Nie, B., Du, J., Liu, H., et al. (2009) Crowds’ Classification Using Hierarchical Cluster, Rough Sets, Principal Component Analysis and Its Combination. 2009 International Forum on Computer Science-Technology and Applications, Chongqing, 25-27 December 2009, 287-290. [Google Scholar] [CrossRef
[8] Zhao, L. and Guo, Z. (2011) Face Recognition Method Based on Adaptively Weighted Block-Two Dimensional Principal Component Analysis. Third International Conference on Computational Intelligence, Communication Systems and Networks, CICSyN 2011, Bali, 26-28 July, 2011, 22-25.
[9] Zhang, X. and Ren, X. (2011) Two Dimensional Principal Component Analysis Based Independent Component Analysis for Face Recognition. 2011 International Conference on Mul-timedia Technology, Hangzhou, 26-28 July 2011, 934-936. [Google Scholar] [CrossRef
[10] Liu, G., Song, X. and Zhao, X. (2007) Rough Sets and Zadeh’s Extension Principles. Proceedings-2007 IEEE International Conference on Granular Computing, Fremont, 2-4 November 2007, 180-185. [Google Scholar] [CrossRef
[11] 苗夺谦, 李道国. 粗糙集理论、算法与应用[M]. 北京: 清华大学出版社, 2008.
[12] Pawlak, Z. and Skowron, A. (2007) Rudiments of Rough Sets. Information Sciences, 177, 3-27. [Google Scholar] [CrossRef
[13] Pawlak, Z. (1984) Rough Classification. International Journal of Man-Machine Studies, 20, 469-483. [Google Scholar] [CrossRef
[14] Yao, Y.Y. (2015) Rough Sets and Three-Way Deci-sions. International Conference on Rough Sets & Knowledge Technology, Tianjin, 20-23 November 2015, 62-73. [Google Scholar] [CrossRef
[15] 王平心, 刘强, 杨习贝, 等. 基于动态邻域的三支聚类分析[J]. 计算机科学, 2018, 45(1): 62-66+89.