深度聚类模型训练加速方法研究
Research on Accelerated Methods for Deep Clustering Model Training
DOI: 10.12677/csa.2024.1410205, PDF,   
作者: 陈傲天, 李少勇:五邑大学电子与信息工程学院,广东 江门
关键词: 深度K-Means随机采样策略正交变换特征Deep K-Means Random Sampling Strategy Orthogonal Transform Features
摘要: 深度聚类通过联合深度学习和传统的聚类方法,可以有效解决高维数据聚类问题,在数据处理领域受到广泛关注,然而,需要花费大量计算资源的深度聚类模型往往会制约其研究发展乃至应用。因此,本文针对深度聚类模型训练耗费时间过长的问题,从减少单次迭代时间和缩短达到期望精度的迭代次数两个思路去探索提高模型训练效率的方法,分别提出了基于随机采样策略的深度K-means (Deep K-means based on Random Sampling Strategy, RSDK)和基于正交变换特征的二阶段深度K-means (Two Stage Deep K-means based on Orthogonal Transform Features, OTDK),前者利用随机采样策略优化深度聚类模型,通过减少单次纪元需要处理的数据量以缩短其耗费的时间,致使在相同纪元数的条件下模型总的训练时间减少。后者则是从训练策略、损失函数、网络架构多个角度对深度聚类模型进行改进,企图让模型参数经历较少的更新次数就令其聚类结果达到预期。最终在MNIST、F-MNIST、CIFAR-10三个数据集验证所提出的两种改进算法,可以发现RSDK所耗费的训练时间会随着采样率下降而下降,而OTDK在MNIST数据集上可以让模型参数花费较少的更新次数就获得较高的聚类精度,虽然在另外两个数据集上获得的聚类精度还未能处于非常优越的水准,但与RSDK相比无明显差异,且模型具有收敛较快的优点。
Abstract: Deep clustering, by combining deep learning and traditional clustering methods, can effectively solve the problem of high-dimensional data clustering and has received widespread attention in the field of data processing. However, deep clustering models that require a large amount of computational resources often constrain their research and development, and even their applications. Therefore, this article explores methods to improve the training efficiency of deep clustering models by reducing the single iteration time and shortening the number of iterations required to achieve the desired accuracy. Two methods are proposed: Deep K-means based on Random Sampling Strategy (RSDK) and Two Stage Deep K-means based on Orthogonal Transform Features (OTDK). The former optimizes the deep clustering model using a random sampling strategy by reducing the amount of data that needs to be processed in a single epoch to shorten its training time, resulting in a reduction in the total training time of the model under the same epoch conditions. The latter improves the deep clustering model from multiple perspectives, such as training strategy, loss function, and network architecture, attempting to achieve the expected clustering results with fewer updates to the model parameters. The two improved algorithms proposed were ultimately validated on three datasets: MNIST, F-MNIST, and CIFAR-10. It was found that the training time consumed by RSDK decreased with decreasing sampling rate, while OTDK achieved higher clustering accuracy with fewer updates of model parameters on the MNIST dataset. Although the clustering accuracy obtained on the other two datasets was not yet at a very superior level, there was no significant difference compared to RSDK, and the model had the advantage of faster convergence.
文章引用:陈傲天, 李少勇. 深度聚类模型训练加速方法研究[J]. 计算机科学与应用, 2024, 14(10): 85-101. https://doi.org/10.12677/csa.2024.1410205

参考文献

[1] Ren, Y., Pu, J., Yang, Z., Xu, J., Li, G., Pu, X., et al. (2024) Deep Clustering: A Comprehensive Survey. IEEE Transactions on Neural Networks and Learning Systems. [Google Scholar] [CrossRef] [PubMed]
[2] 孔玉洁. 基于深度学习的无监督图像聚类算法研究[D]: [硕士学位论文]. 郑州: 河南农业大学, 2024.
[3] Tan, D., Huang, Z., Peng, X., et al. (2023) Deep Adaptive Fuzzy Clustering for Evolutionary Unsupervised Representation Learning. IEEE Transactions on Neural Networks and Learning Systems, 35, 6103-6117.
[4] 钱宇华, 程占文, 李飞江. 近邻一致性策略下的图像深度聚类算法研究[J/OL]. 山西大学学报(自然科学版), 2024: 1-10.
https://link.cnki.net/urlid/14.1105.N.20240605.0845.002
[5] Bouali, A., Ouariachi, I.E., Zahi, A. and Zenkouar, K. (2024) Robust Deep Image Clustering Using Convolutional Autoencoder with Separable Discrete Krawtchouk and Hahn Orthogonal Moments. Intelligent Systems with Applications, 22, Article ID: 200387. [Google Scholar] [CrossRef
[6] 陈俊芬, 赵佳成, 翟俊海, 等. 基于无监督学习视觉特征的深度聚类方法[J]. 南京航空航天大学学报, 2021, 53(5): 718-725.
[7] Li, Y. and Xie, X. (2023) Deep Multi-View Fuzzy K-Means with Weight Allocation and Entropy Regularization. Applied Intelligence, 53, 30593-30606. [Google Scholar] [CrossRef
[8] 谢胜利, 陈泓达, 高军礼. 基于分布对齐变分自编码器的深度多视图聚类[J]. 计算机学报, 2023, 46(5): 945-957.
[9] Li, P., Gao, J., Zhang, J., Jin, S. and Chen, Z. (2023) Deep Reinforcement Clustering. IEEE Transactions on Multimedia, 25, 8183-8193. [Google Scholar] [CrossRef
[10] Li, Y., Hu, P., Peng D, et al. (2024) Image Clustering with External Guidance. [Google Scholar] [CrossRef
[11] Ronen, M., Finder, S.E. and Freifeld, O. (2022) Deepdpm: Deep Clustering with an Unknown Number of Clusters. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 9851-9860. [Google Scholar] [CrossRef
[12] 陶文彬, 钱育蓉, 张伊扬. 基于自编码器的深度聚类算法综述[J]. 计算机工程与应用, 2022, 58(18): 16-25.
[13] Zhou, S., Xu, H., Zheng, Z., et al. (2022) A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions. arXiv: 2206.07579.
[14] Huang, X., Hu, Z. and Lin, L. (2021) Deep Clustering Based on Embedded Auto-Encoder. Soft Computing, 27, 1075-1090. [Google Scholar] [CrossRef
[15] Goel, A., Majumdar, A., Chouzenoux, E. and Chierchia, G. (2022) Deep Convolutional K-Means Clustering. 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, 16-19 October 2022, 211-215. [Google Scholar] [CrossRef
[16] Wang, H. and Lu, N. (2020) Deep Embedded Clustering with Asymmetric Residual Autoencoder. 2020 Chinese Automation Congress (CAC), Shanghai, 6-8 November 2020, 4531-4534. [Google Scholar] [CrossRef
[17] 陈俊芬, 张明, 赵佳成, 等. 结合降噪和自注意力的深度聚类算法[J]. 计算机科学与探索, 2021, 15(9): 1717-1727.
[18] 韩洁, 陈俊芬, 李艳, 等. 基于自注意力的自监督深度聚类算法[J]. 计算机科学, 2022, 49(3): 134-143.
[19] Feng, Q., Chen, L., Chen, C.L.P. and Guo, L. (2020) Deep Fuzzy Clustering—A Representation Learning Approach. IEEE Transactions on Fuzzy Systems, 28, 1420-1433. [Google Scholar] [CrossRef
[20] Ikotun, A.M., Ezugwu, A.E., Abualigah, L., Abuhaija, B. and Heming, J. (2023) K-Means Clustering Algorithms: A Comprehensive Review, Variants Analysis, and Advances in the Era of Big Data. Information Sciences, 622, 178-210. [Google Scholar] [CrossRef
[21] Zhang, R., Li, X., Zhang, H. and Nie, F. (2020) Deep Fuzzy K-Means with Adaptive Loss and Entropy Regularization. IEEE Transactions on Fuzzy Systems, 28, 2814-2824. [Google Scholar] [CrossRef
[22] Guo, W., Lin, K. and Ye, W. (2021) Deep Embedded K-Means Clustering. 2021 International Conference on Data Mining Workshops (ICDMW), Auckland, 7-10 December 2021, 686-694. [Google Scholar] [CrossRef
[23] Hu, Y., Song, Z., Wang, B., Sun, Y. and Ym, B. (2021) Real Deep K-Means with Multiple Auto-Encoders. 2021 China Automation Congress (CAC), Beijing, 22-24 October 2021, 4661-4665. [Google Scholar] [CrossRef
[24] Wu, X., Yu, Y., Chen, L., Ding, W. and Wang, Y. (2024) Robust Deep Fuzzy k-Means Clustering for Image Data. Pattern Recognition, 153, Article ID: 110504. [Google Scholar] [CrossRef
[25] Peng, X., Li, Y., Tsang, I.W., et al. (2022) XAI Beyond Classification: Interpretable Neural Clustering. Journal of Machine Learning Research, 23 1-28.
[26] Jiang, S. and Wang, S. (2021). Fast Training Methods and Their Experiments for Deep Learning CNN Models. 2021 40th Chinese Control Conference (CCC), Shanghai, 26-28 July 2021, 8253-8260.[CrossRef