基于强化学习的聚类算法
Clustering Algorithm Based on Reinforcement Learning
摘要: 创新性地将强化学习技术引入聚类算法中,旨在解决传统聚类方法面临的两大难题:初始聚类中心选择的不确定性以及计算过程中欧氏距离划分样本导致的高时间复杂度。通过引入强化学习的奖惩机制,设计了一种基于“代理”Agent的行为选择策略,有效替代了传统的欧氏距离计算过程,从而消除了初始聚类中心对算法稳定性的潜在影响,并大幅提升了算法的收敛速度。提出了一种全新的基于强化学习的聚类算法,不仅在数学上严谨证明了其收敛性,而且在实际应用中展现了显著优势。通过数值实验验证,该算法在聚类准确率上较传统方法有明显提升,同时在算法性能上也表现出更加优越的特点,这一研究对于提升数据处理效率和准确性具有重要意义。
Abstract: Innovatively introducing reinforcement learning techniques into clustering algorithms, this research aims to address two major challenges faced by traditional clustering methods: the uncertainty in selecting initial cluster centers and the high time complexity caused by the Euclidean distance metric in sample classification. By introducing the reward-punishment mechanism of reinforcement learning, this paper designs a behavior selection strategy based on an “agent,” effectively replacing the traditional Euclidean distance calculation process. This approach eliminates the potential impact of initial cluster centers on the stability of the algorithm and significantly improves the convergence speed. A novel clustering algorithm based on reinforcement learning is proposed, which not only rigorously proves its convergence mathematically but also demonstrates significant advantages in practical applications. Through numerical experiments, it is verified that the algorithm achieves significantly higher clustering accuracy compared to traditional methods, while also exhibiting superior algorithm performance. This research is of great significance for improving the efficiency and accuracy of data processing.
文章引用:李佳辉, 徐应涛, 张莹. 基于强化学习的聚类算法[J]. 计算机科学与应用, 2024, 14(7): 114-120. https://doi.org/10.12677/csa.2024.147169

参考文献

[1] Hashem, I.A.T., Yaqoob, I., Anuar, N.B., Mokhtar, S., Gani, A. and Ullah Khan, S. (2015) The Rise of “Big Data” on Cloud Computing: Review and Open Research Issues. Information Systems, 47, 98-115. [Google Scholar] [CrossRef
[2] Shah, G., Shah, A. and Shah, M. (2019) Panacea of Challenges in Real-World Application of Big Data Analytics in Healthcare Sector. Journal of Data, Information and Management, 1, 107-116. [Google Scholar] [CrossRef
[3] 尹刚, 王涛, 刘冰珣, 等. 面向开源生态的软件数据挖掘技术研究综述[J]. 软件学报, 2018, 29(8): 2258-2271.
[4] 李战怀, 于戈, 杨晓春. 人工智能赋能的数据管理、分析与系统专刊前言[J]. 软件学报, 2020, 31(3): 597-599.
[5] Cireşan, D., Meier, U., Masci, J. and Schmidhuber, J. (2012) Multi-Column Deep Neural Network for Traffic Sign Classification. Neural Networks, 32, 333-338. [Google Scholar] [CrossRef] [PubMed]
[6] Xu, D. and Tian, Y. (2015) A Comprehensive Survey of Clustering Algorithms. Annals of Data Science, 2, 165-193. [Google Scholar] [CrossRef
[7] Cao, H., Jia, L., Si, G. and Zhang, Y. (2013) A Clustering-Analysis-Based Membership Functions Formation Method for Fuzzy Controller of Ball Mill Pulverizing System. Journal of Process Control, 23, 34-43. [Google Scholar] [CrossRef
[8] 朱光宇, 张德颂. 基于强化学习的遗传算法求解一种新的钻削路径优化问题[J]. 控制与决策, 2024, 39(2): 697-704.
[9] 隋丽蓉, 高曙, 何伟. 基于多智能体深度强化学习的船舶协同避碰策略[J]. 控制与决策, 2023, 38(5): 1395-1402.
[10] 王玉荣, 万秋兰, 陈昊. 基于模糊聚类和学习自动机的多目标无功优化[J]. 电网技术, 2012, 36(7): 224-230.
[11] Yang, Y., Cui, Z., Jian, W., et al. (2012) Fuzzy C-Means Clustering and Opposition-Based Reinforcement Learning for Traffic Congestion Identification. Journal of Information & Computational Science, 9, 2441-2450.