基于拉普拉斯约束的半监督模糊C均值算法

doi:10.12677/AAM.2021.102049

期刊菜单

基于拉普拉斯约束的半监督模糊C均值算法
Semi-Supervised Fuzzy C-Means Algorithm Based on Laplace Constraint

DOI: 10.12677/AAM.2021.102049, PDF, 国家自然科学基金支持
作者: 张宁, 马盈仓, 朱恒东：西安工程大学理学院，陕西西安
关键词: 拉普拉斯约束；先验信息；隶属度；聚类；Laplacian Constraint Sparse； Prior Information； Membership； Clustering

摘要: 模糊聚类算法作为经典的无监督算法之一，在未提供先验信息的基础上容易陷入局部最优。为了能够将监督学习与无监督学习相结合，同时利用已标签数据和未标签数据共同进行训练学习，本文通过对目标函数进行拉普拉斯约束，通过验证隶属度的范围始终大于等于零，能够证明该算法是有效的。在其基础上加入先验信息来挖掘大量有用的信息，使之在未提供先验信息的基础上，算法能够合理、有效地利用部分已标识样本的类别信息对未标识样本产生影响，从而提高半聚类算法的聚类性能；最后，将文章中提出的两类改进算法与原始模糊c均值(FCM)进行聚类指标对比，能够显示其具有良好的聚类效果。

Abstract: As one of the classical unsupervised algorithms, fuzzy clustering algorithm is easy to fall into local optimum without providing prior information. In order to combine supervised learning with unsupervised learning and use both labeled and unlabeled data for training learning, this paper proved the effectiveness of the algorithm through Laplace constraint on the objective function and verification that the range of membership is always greater than or equal to zero. On this basis, prior information is added to mine a lot of useful information, so that the algorithm can reasonably and effectively use the category information of part of the identified samples to affect the unidentified samples, so as to improve the clustering performance of the semi-clustering algorithm. Finally, the two improved algorithms proposed in this paper are compared with the original Fuzzy C-means (FCM) for clustering index, and the results show that the proposed algorithm has good clustering effect.

文章引用：张宁, 马盈仓, 朱恒东. 基于拉普拉斯约束的半监督模糊C均值算法[J]. 应用数学进展, 2021, 10(2): 433-443. https://doi.org/10.12677/AAM.2021.102049

参考文献

[1]	Johnson, S.C. (1967) Hierarchical Clustering Schemes. Psychometrika, 32, 241-254. [Google Scholar] [CrossRef]
[2]	Ng, A.Y., Jordan, M.I. and Weiss, Y. (2002) On Spectral Clustering: Analysis and an Algorithm. The Conference and Workshop on Neural Information Processing Systems, Vol. 14, 849-856.
[3]	Bezdek, J.C. (1981) Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York. [Google Scholar] [CrossRef]
[4]	Wu, L., Hoi, S., Jin, R., et al. (2010) Learning Bregman Distance Functions for Semi-Supervised Clustering. IEEE Transactions on Knowledge & Data Engineering, 24, 478-491. [Google Scholar] [CrossRef]
[5]	白福均, 高建瓴, 宋文慧, 等. 半监督模糊聚类算法的研究与改进[J]. 通信技术, 2018, 317(5): 71-75.
[6]	Brodinova, S., Filzmoser, P., Ortner, T., et al. (2019) Robust and Sparse K-Means Clustering for High-Dimensional Data. Advances in Data Analysis & Classification, 13, 905-932. [Google Scholar] [CrossRef]
[7]	朱乐为. 模糊C-means聚类算法的拓展研究[J]. 云南民族大学学报(自然科学版), 2019, 28(3): 64-70.
[8]	Pedrycz, W. and Waletzky, J. (1997) Fuzzy Clustering with Partial Supervision. IEEE Transactions on Systems Man and Cybernetics Part B—Cybernetics, 27, 787-795. [Google Scholar] [CrossRef] [PubMed]
[9]	Tari, L., Baral, C. and Kim, S. (2009) Fuzzy c-Means Clustering with Prior Biological Knowledge. Journal of Biomedical Informatics, 42, 74-81. [Google Scholar] [CrossRef] [PubMed]
[10]	Zhang, H.X. and Lu, J. (2009) Semi-Supervised Fuzzy Clustering: A Kernel-Based Approach. Knowledge-Based Systems, 22, 477-481. [Google Scholar] [CrossRef]
[11]	Zhang, D.Q., Zhou, Z.H. and Chen, S.C. (2007) Semi-Supervised Dimensionality Reduction. Proceedings of the Seventh Siam International Conference on Data Mining, Minneapolis, 26-28 April 2007, 629-634. [Google Scholar] [CrossRef]
[12]	Zhang, R., Nie, F. and Li, X. (2017) Self-Weighted Spectral Clustering with Parameter-Free Constraint. Neurocomputing, 241, 164-170. [Google Scholar] [CrossRef]
[13]	Zhang, R., Nie, F., Guo, M., et al. (2019) Joint Learning of Fuzzy k-Means and Nonnegative Spectral Clustering with Side Information. IEEE Transactions on Image Processing, 28, 2152-2162. [Google Scholar] [CrossRef]
[14]	Wang, D., Nie, F. and Huang, H. (2015) Feature Selection via Global Redundancy Minimization. IEEE Transactions on Knowledge & Data Engineering, 27, 2743-2755. [Google Scholar] [CrossRef]
[15]	李龙龙, 何东健, 王美丽. 模糊半监督加权聚类算法的有效性评价研究[J]. 计算机技术与发展, 2016, 26(6): 65-68.
[16]	Li, L.L., Jonathan, G., He, D.J., et al. (2015) Semi-Supervised Fuzzy Clustering with Feature Discrimination. PLoS ONE, 10, 131-160. [Google Scholar] [CrossRef] [PubMed]
[17]	郭新辰, 郗仙田, 樊秀玲, 等. 基于半监督的模糊C-均值聚类算法[J]. 吉林大学学报: 理学版, 2015, 53(4): 705-709.

为你推荐

友情链接