基于双Q学习算法的可重入混合流水晶圆车间调度与维护联合优化研究
Research on Joint Optimization of Scheduling and Maintenance of Crystal Circle Shop for Re-Entrant Hybrid Flow Based on Double Q Learning Algorithm
DOI: 10.12677/mos.2024.136587, PDF,    国家自然科学基金支持
作者: 王艺蜚, 刘勤明*, 叶春明, 汪宇杰:上海理工大学管理学院,上海;倪静然:北部湾大学东密歇根联合工程学院,广西 钦州
关键词: 可重入调度模型双Q学习多目标优化模型设备役龄可重入设备维护模型Reentrant Scheduling Constraint Model Double-Q Learning Multi-Objective Model Service Age Reentrant Equipment Maintenance Model
摘要: 针对可重入晶圆车间调度与预维护问题的复杂性,以及近年来人工智能算法的飞速发展和启发式算法在优化复杂生产系统上的不足,本文提出了一种基于双Q学习的可重入晶圆车间调度与维护联合优化模型。首先,考虑到可重入工序的影响,在调度阶段建立可重入调度约束模型。考虑可重入工序以及设备维护后的役龄更新对设备维护频次的影响,并通过设立维护阈值来对设备进行预维护以及机会维护,建立考虑役龄更新的可重入晶圆车间设备维护模型。其次,结合实际生产系统中多因素的影响,以最小完工时间、最低能源消耗以及最低总维护成本为目标函数进行多目标优化。最后,以生产数据为基础,通过双Q学习算法来定义状态和动作,并设置建立奖励函数,采用贪婪策略随机选择动作来跳出局部最优,并通过调整役龄更新因子来进行灵敏度分析验证算法的鲁棒性。经过结果分析以及对比分析,基于双Q学习算法所建立模型的结果均取得了较好的优化结果,并且具有较强的鲁棒性,证明了所提出的基于双Q学习算法的可重入晶圆车间调度与维护联合优化模型的有效性。
Abstract: Due to the complexity of reentrant wafer workshop scheduling and preventive maintenance issues, as well as the rapid development of artificial intelligence algorithms in recent years and the limitations of heuristic algorithms in optimizing complex production systems, this paper proposes a reentrant wafer shop scheduling and maintenance joint optimization model based on double-Q learning. First, considering the influence of reentrant processes, a reentrant scheduling constraint model is established in the scheduling stage. Considering the influence of reentrant process and service age update after equipment maintenance on equipment maintenance frequency, and setting up maintenance thresholds for pre-maintenance and opportunity maintenance of equipment, a reentrant wafer shop equipment maintenance model considering service age update is established. Secondly, the minimum completion time, minimum energy consumption and minimum total maintenance cost are taken as the objective functions for multi-objective optimization, taking into account the influence of multiple factors in the actual production system. Finally, based on the production data, the double-Q learning algorithm is used to define the states and actions, and the reward function is set up to establish a greedy strategy to randomly select the actions to jump out of the local optimum, and the sensitivity analysis is carried out by adjusting the service age update factor to verify the robustness of the algorithm. After the result analysis and comparative analysis, the results have achieved better optimization results and have strong robustness, which proves the effectiveness of the proposed re-entry wafer shop scheduling and maintenance joint model.
文章引用:王艺蜚, 刘勤明, 倪静然, 叶春明, 汪宇杰. 基于双Q学习算法的可重入混合流水晶圆车间调度与维护联合优化研究[J]. 建模与仿真, 2024, 13(6): 6416-6431. https://doi.org/10.12677/mos.2024.136587

参考文献

[1] 吴秀丽, 李雨馨, 匡源, 等. 改进迭代贪婪算法求解可重入流水车间调度问题[J]. 计算机集成制造系统, 2024, 30(7): 2364-2380.
[2] 董君, 叶春明. 具有学习效应的半导体晶圆制造绿色车间调度问题研究[J]. 运筹与管理, 2021, 30(4): 217-223.
[3] 雷德明, 刘敬裕. 基于协作蛙跳算法的可重入混合流水车间调度[J]. 华中科技大学学报(自然科学版), 2023, 51(5): 125-130.
[4] Siraskar, R., Kumar, S., Patil, S., Bongale, A. and Kotecha, K. (2023) Reinforcement Learning for Predictive Maintenance: A Systematic Technical Review. Artificial Intelligence Review, 56, 12885-12947. [Google Scholar] [CrossRef
[5] Chen, Y., Liu, Y. and Xiahou, T. (2022) A Deep Reinforcement Learning Approach to Dynamic Loading Strategy of Repairable Multistate Systems. IEEE Transactions on Reliability, 71, 484-499. [Google Scholar] [CrossRef
[6] 赵子夜, 陈晓慧, 安友军, 等. 刚柔混合约束下柔性流水车间生产调度与设备维护集成优化[J]. 计算机集成制造系统, 2024: 1-20.
[7] 江雨燕, 马宁, 李艳, 等. 考虑动态预维护与绿色调度的协同优化问题[J]. 系统仿真学报, 2024: 1-16.
[8] Li, X., Ran, Y., Zhang, G., et al. (2023) Sequential Preventive Maintenance Strategy Considering Difference of Maintenance Effect. Journal of Shanghai Jiaotong University, 57, 1522.
[9] Yu, M., Li, T. and Ma, J. (2022) Joint Optimization Method of Production Scheduling for Prefabricated Components Based on Preventive Maintenance. 2022 41st Chinese Control Conference (CCC), Hefei, 25-27 July 2022, 1940-1944. [Google Scholar] [CrossRef
[10] Kung, L. and Liao, Z. (2022) Optimization for a Joint Predictive Maintenance and Job Scheduling Problem with Endogenous Yield Rates. IEEE Transactions on Automation Science and Engineering, 19, 1555-1566. [Google Scholar] [CrossRef