基于最大熵的深度模糊聚类方法研究
Research on Deep Fuzzy Clustering Based on Maximum Entropy
DOI: 10.12677/csa.2024.144097, PDF,   
作者: 黄皓宇, 李少勇, 陈傲天:五邑大学智能制造学部,广东 江门
关键词: 模糊聚类深度聚类可解释性Fuzzy Clustering Deep Clustering Interpretability
摘要: 高维数据聚类是数据挖掘和模式识别研究领域的一项关键且具有挑战性的任务。深度聚类方法借助神经网络高效地特征提取能力,往往比传统聚类方法具有更好的性能。因此,本文提出了一种基于最大熵的深度模糊聚类算法(DFMEC)。该算法通过构建神经网络来表示模糊聚类,具有算法模型的可解释性。联合深度自动编码器模型,DFMEC通过梯度下降实现了深度特征学习和聚类中心的同步更新,解决了硬聚类由于其离散性而不能更新梯度的问题。此外,在所提出方法的目标函数的优化中,添加了基于模糊分配的最大熵正则项来提高聚类模型的鲁棒性。在各个高维数据集上的综合实验表明,与其他先进的深度聚类方法相比,该方法在重构表示和聚类质量方面都有更好的性能,大量实验的深入分析证明了这一点。
Abstract: Clustering of high-dimensional data is a crucial but challenging task in data mining and pattern recognition. The deep clustering usually have better performance than traditional methods due to more efficient feature extraction capability of neural networks. Therefore, this paper proposes a deep fuzzy clustering algorithm based on maximum entropy (DFMEC). The method represents fuzzy clustering by constructing neural networks and has the interpretability of the model. Jointly the deep autoencoder model, DFMEC through the network gradient realizes deep feature extraction and updating the clustering centers simultaneously, solving the problem that hard clustering cannot update gradients due to the discrete. Moreover, in the optimization of the objective function of the proposed method, a maximum entropy regularity term based on fuzzy assignment is added to improve the robustness of the clustering model. Comprehensive experiments on various high-dimensional datasets show that, in contrast to other advanced deep clustering methods, the method has better performance in terms of reconstruction representation and clustering quality, it is demonstrated by extensive experimental analyses.
文章引用:黄皓宇, 李少勇, 陈傲天. 基于最大熵的深度模糊聚类方法研究[J]. 计算机科学与应用, 2024, 14(4): 276-289. https://doi.org/10.12677/csa.2024.144097

参考文献

[1] Cios, K.J., Pedrycz, W. and Swiniarski, R.W. (2012) Data Mining Methods for Knowledge Discovery. Springer Science & Business Media, New York.
[2] Boopathi, S., Pandey, B.K. and Pandey, D. (2023) Advances in Artificial Intelligence for Image Processing: Techniques, Applications, and Optimization. In: Pandey, B.K., Pandey, D., Anand, R., Mane, D.S. and Nassa, V.K., Eds., Handbook of Research on Thrust Technologies Effect on Image Processing, IGI Global, Hershey, 73-95. [Google Scholar] [CrossRef
[3] Paolanti, M. and Frontoni, E. (2020) Multidisciplinary Pattern Recognition Applications: A Review. Computer Science Review, 37, Article ID: 100276. [Google Scholar] [CrossRef
[4] Zadeh, L.A. (1965) Fuzzy Sets. Information and Control, 8, 338-353. [Google Scholar] [CrossRef
[5] Bezdek, J.C., Ehrlich, R. and Full, W. (1984) FCM: The Fuzzy C-Means Clustering Algorithm. Computers & Geosciences, 10, 191-203. [Google Scholar] [CrossRef
[6] Pimentel, B.A., Silva, R. and Costa, J. (2022) Fuzzy C-Means Clustering Algorithms with Weighted Membership and Distance. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 30, 567-594. [Google Scholar] [CrossRef
[7] Hashemi, S.E., Gholian-Jouybari, F. and Hajiaghaei-Keshteli, M. (2023) A Fuzzy C-Means Algorithm for Optimizing Data Clustering. Expert Systems with Applications, 227, Article ID: 120377. [Google Scholar] [CrossRef
[8] Zadeh, L.A. (1971) Similarity Relations and Fuzzy Orderings. Information Sciences, 3, 177-200. [Google Scholar] [CrossRef
[9] Kriegel, H.P., Kröger, P. and Zimek, A. (2009) Clustering High-Dimensional Data: A Survey on Subspace Clustering, Pattern-Based Clustering, and Correlation Clustering. ACM Transactions on Knowledge Discovery from Data, 3, 1-58. [Google Scholar] [CrossRef
[10] Zeng, S., Wang, X., Duan, X., et al. (2020) Kernelized Mahalanobis Distance for Fuzzy Clustering. IEEE Transactions on Fuzzy Systems, 29, 3103-3117. [Google Scholar] [CrossRef
[11] Tan, D., Zhong, W., Jiang, C., et al. (2020) High-Order Fuzzy Clustering Algorithm Based on Multikernel Mean Shift. Neurocomputing, 385, 63-79. [Google Scholar] [CrossRef
[12] Zhu, X., Pedrycz, W. and Li, Z. (2017) Fuzzy Clustering with Nonlinearly Transformed Data. Applied Soft Computing, 61, 364-376. [Google Scholar] [CrossRef
[13] Rathore, P., Bezdek, J.C., Erfani, S.M., et al. (2018) Ensemble Fuzzy Clustering Using Cumulative Aggregation on Random Projections. IEEE Transactions on Fuzzy Systems, 26, 1510-1524. [Google Scholar] [CrossRef
[14] Long, C. and Kong, L. (2017) Fuzzy Clustering in High-Dimensional Approximated Feature Space. International Conference on Fuzzy Theory & Its Applications, Taichung, 9-11 November 2016, 1-6.
[15] Rathore, P., et al. (2018) A Rapid Hybrid Clustering Algorithm for Large Volumes of High Dimensional Data. IEEE Transactions on Knowledge and Data Engineering, 31, 641-654. [Google Scholar] [CrossRef
[16] Wen, Z., Hou, B., Wu, Q., et al. (2018) Discriminative Transformation Learning for Fuzzy Sparse Subspace Clustering. IEEE Transactions on Cybernetics, 48, 2218-2231.
[17] Ji, P., Zhang, T., Li, H., et al. (2017) Deep Subspace Clustering Networks. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 23-32.
[18] Peng, X., Feng, J., Zhou, J.T., et al. (2020) Deep Subspace Clustering. IEEE Transactions on Neural Networks and Learning Systems, 31, 5509-5521. [Google Scholar] [CrossRef
[19] Elhamifar, E. and Vidal, R. (2013) Sparse Subspace Clustering: Algorithm, Theory, and Applications. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 2765-2781. [Google Scholar] [CrossRef
[20] Xie, J., Girshick, R. and Farhadi, A. (2016) Unsupervised Deep Embedding for Clustering Analysis. International Conference on Machine Learning, 48, 478-487.
[21] Yang, B., Fu, X., Sidiropoulos, N.D., et al. (2017) Towards K-Means-Friendly Spaces: Simultaneous Deep Learning and Clustering. International Conference on Machine Learning, 70, 3861-3870.
[22] Guo, X., Gao, L., Liu, X., et al. (2017) Improved Deep Embedded Clustering with Local Structure Preservation. International Joint Conference on Artificial Intelligence, 17, 1753-1759. [Google Scholar] [CrossRef
[23] Li, F., Qiao, H. and Zhang, B. (2018) Discriminatively Boosted Image Clustering with Fully Convolutional Auto-Encoders. Pattern Recognition, 83, 161-173. [Google Scholar] [CrossRef
[24] Wang, Q., Cheng, J., Gao, Q., et al. (2020) Deep Multi-View Subspace Clustering with Unified and Discriminative Learning. IEEE Transactions on Multimedia, 23, 3483-3493. [Google Scholar] [CrossRef
[25] Duan, L., Ma, S., Aggarwal, C. and Sathe, S. (2021) Improving Spectral Clustering with Deep Embedding, Cluster Estimation and Metric Learning. Knowledge and Information Systems, 63, 675-694. [Google Scholar] [CrossRef
[26] Li, W., Wang, S., Guo, X., et al. (2023) Deep Graph Clustering with Multi-Level Subspace Fusion. Pattern Recognition, 134, Article ID: 109077. [Google Scholar] [CrossRef
[27] 曾春艳, 严康, 王志锋, 等. 深度学习模型可解释性研究综述[J]. 计算机工程与应用, 2021, 57(8): 1-9.
[28] Hinton, G.E. and Salakhutdinov, R.R. (2006) Reducing the Dimensionality of Data with Neural Networks. Science, 313, 504-507. [Google Scholar] [CrossRef] [PubMed]
[29] Sagheer, A. and Kotb, M. (2019) Unsupervised Pre-Training of a Deep LSTM-Based Stacked Autoencoder for Multivariate Time Series Forecasting Problems. Scientific Reports, 9, Article No. 19038. [Google Scholar] [CrossRef] [PubMed]
[30] Dang, H.V. and Ngo, D. (2019) Attentional Autoencoder for Weighted Implicit Collaborative Filtering. Proceedings of the 2019 2nd International Conference on Computational Intelligence and Intelligent Systems, Bangkok, 23-25 November 2019, 168-172. [Google Scholar] [CrossRef
[31] Karayiannis, N.B. (1994) MECA: Maximum Entropy Clustering Algorithm. Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference, Orlando, 26-29 June 1994, 630-635.
[32] Li, X., Xiong, H., Li, X., et al. (2022) Interpretable Deep Learning: Interpretation, Interpretability, Trustworthiness, and Beyond. Knowledge and Information Systems, 64, 3197-3234. [Google Scholar] [CrossRef
[33] MacQueen, J. (1967) Some Methods for Classification and Analysis of Multivariate Observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, 281-297.
[34] Lopez, R., Regier, J., Jordan, M.I., et al. (2018) Information Constraints on Auto-Encoding Variational Bayes. 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, 3-8 December 2018.
[35] Radford, A., Metz, L. and Chintala, S. (2015) Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks. arXiv: 1511.06434.
[36] Yu, S., Liu, J., Han, Z., et al. (2021) Representation Learning Based on Autoencoder and Deep Adaptive Clustering for Image Clustering. Mathematical Problems in Engineering, 2021, Article ID: 3742536. [Google Scholar] [CrossRef