面向延迟容忍的时序约束多无人机任务分配方法
A Delay-Tolerant and Temporal-Ordering-Constrained Task Allocation Method for Multi-UAV Systems
DOI: 10.12677/mos.2025.145408, PDF,    国家自然科学基金支持
作者: 陈九澳:上海理工大学光电信息与计算机工程学院,上海;何杏宇:上海理工大学光电信息与计算机工程学院,上海;上海理工大学出版学院,上海
关键词: 多无人机系统任务分配深度强化学习时序约束动态时间窗Multi-UAV Systems Task Allocation Deep Reinforcement Learning Temporal-Ordering Constraint Dynamic Time Window
摘要: 现有的面向时序约束的多无人机任务分配方法大多假定任务执行是非延迟容忍的情况,这不仅降低了任务被执行的机会,而且极大地限制了多无人机的竞争与协作灵活度,从而影响任务完成率。为此,文章将提出一种面向延迟容忍的时序约束多无人机任务分配方法,并解决延迟容忍带来的任务执行效益时变性问题。具体而言,首先设计了一种动态时间窗机制,为无人机任务执行提供弹性空间,允许无人机在任务可执行初始时间之前到达进行悬停等待或在预计截止时间后超时执行任务,并基于此定义了随动态时间窗变化的任务执行收益。接着,构建强化学习模型以引导无人机进行任务选择,然后定义无人机与任务的匹配因子,设计根据匹配因子、悬停和超时情况解决任务选择冲突的共识机制。最后采用多智能体深度确定性策略梯度算法(MADDPG)对强化学习模型进行求解。实验结果表明,与现有方法相比,本文所提的方法在系统效益和任务完成率方面均表现出显著优势。
Abstract: The majority of existing multi-UAV task allocation approaches under temporal-ordering-constrained are based on the assumption of non-delay-tolerant task execution, which not only reduces the task execution chances but also greatly limits the competition and collaboration flexibility among multiple UAVs, thereby affecting the task completion rate. To address this problem, this paper will propose a delay-tolerant and temporal-ordering-constrained task allocation method, then solve the time-varying problem of task execution benefit related to delay tolerance. In this method, firstly, a dynamic time window mechanism is designed to provide an elastic space for UAV task execution, which allows UAVs to hover and wait for a task before its initial executable time and execute the task after its expected deadline and the time-varying task execution benefit related to the dynamic execution window is also defined; Secondly, a reinforcement learning model is constructed to guide the task selection of UAVs, a matching factor between task and UAVs is defined, and a consensus mechanism is provided to solve task selection conflicts between UAVs based on the matching factor, the waiting and delay time of UAVs. Finally, the multi-agent deep deterministic policy gradient algorithm is used to solve the reinforcement learning model. Experimental results show that, compared with existing methods, the method proposed in this paper exhibits significant advantages in terms of system benefit and task completion rate.
文章引用:陈九澳, 何杏宇. 面向延迟容忍的时序约束多无人机任务分配方法[J]. 建模与仿真, 2025, 14(5): 472-487. https://doi.org/10.12677/mos.2025.145408

参考文献

[1] Wang, Z., Liu, L., Long, T. and Wen, Y. (2018) Multi-UAV Reconnaissance Task Allocation for Heterogeneous Targets Using an Opposition-Based Genetic Algorithm with Double-Chromosome Encoding. Chinese Journal of Aeronautics, 31, 339-350. [Google Scholar] [CrossRef
[2] Wei, Z. and Zhao, X. (2022) Multi-UAVs Cooperative Reconnaissance Task Allocation under Heterogeneous Target Values. IEEE Access, 10, 70955-70963. [Google Scholar] [CrossRef
[3] Yue, L., Yang, R., Zuo, J., Zhang, Y., Li, Q. and Zhang, Y. (2022) Unmanned Aerial Vehicle Swarm Cooperative Decision-Making for SEAD Mission: A Hierarchical Multiagent Reinforcement Learning Approach. IEEE Access, 10, 92177-92191. [Google Scholar] [CrossRef
[4] Chakraa, H., Guérin, F., Leclercq, E. and Lefebvre, D. (2023) Optimization Techniques for Multi-Robot Task Allocation Problems: Review on the State-of-the-Art. Robotics and Autonomous Systems, 168, Article ID: 104492. [Google Scholar] [CrossRef
[5] Jia, Z., Yu, J., Ai, X., Xu, X. and Yang, D. (2018) Cooperative Multiple Task Assignment Problem with Stochastic Velocities and Time Windows for Heterogeneous Unmanned Aerial Vehicles Using a Genetic Algorithm. Aerospace Science and Technology, 76, 112-125. [Google Scholar] [CrossRef
[6] Chen, Y., Yang, D. and Yu, J. (2018) Multi-UAV Task Assignment with Parameter and Time-Sensitive Uncertainties Using Modified Two-Part Wolf Pack Search Algorithm. IEEE Transactions on Aerospace and Electronic Systems, 54, 2853-2872. [Google Scholar] [CrossRef
[7] Ye, F., Chen, J., Tian, Y. and Jiang, T. (2020) Cooperative Task Assignment of a Heterogeneous Multi-UAV System Using an Adaptive Genetic Algorithm. Electronics, 9, Article 687. [Google Scholar] [CrossRef
[8] Chen, L., Liu, W. and Zhong, J. (2022) An Efficient Multi-Objective Ant Colony Optimization for Task Allocation of Heterogeneous Unmanned Aerial Vehicles. Journal of Computational Science, 58, Article ID: 101545. [Google Scholar] [CrossRef
[9] Yu, X., Gao, X., Wang, L., Wang, X., Ding, Y., Lu, C., et al. (2022) Cooperative Multi-UAV Task Assignment in Cross-Regional Joint Operations Considering Ammunition Inventory. Drones, 6, Article 77. [Google Scholar] [CrossRef
[10] Cui, Y., Dong, W., Hu, D. and Liu, H. (2022) The Application of Improved Harmony Search Algorithm to Multi-UAV Task Assignment. Electronics, 11, Article 1171. [Google Scholar] [CrossRef
[11] Zhang, R., Ren, H., Li, X. and Ding, Y. (2023) UAV Cluster Task Assignment Algorithm Based on Improved Artificial Gorilla Troops Optimizer. IEEE Access, 11, 135133-135146. [Google Scholar] [CrossRef
[12] Cui, Y. (2024) An Improved Chaotic Self-Adapting Monkey Algorithm for Multi-UAV Task Assignment. IEEE Journal on Miniaturization for Air and Space Systems, 5, 9-15. [Google Scholar] [CrossRef
[13] Xiao, Y., Li, Y., Liu, H., Chen, Y., Wang, Y. and Wu, G. (2024) Adaptive Large Neighborhood Search Algorithm with Reinforcement Search Strategy for Solving Extended Cooperative Multi Task Assignment Problem of UAVs. Information Sciences, 679, Article ID: 121068. [Google Scholar] [CrossRef
[14] Lei, K., Guo, P., Zhao, W., Wang, Y., Qian, L., Meng, X., et al. (2022) A Multi-Action Deep Reinforcement Learning Framework for Flexible Job-Shop Scheduling Problem. Expert Systems with Applications, 205, Article ID: 117796. [Google Scholar] [CrossRef
[15] Song, W., Chen, X., Li, Q. and Cao, Z. (2023) Flexible Job-Shop Scheduling via Graph Neural Network and Deep Reinforcement Learning. IEEE Transactions on Industrial Informatics, 19, 1600-1610. [Google Scholar] [CrossRef
[16] Su, C., Zhang, C., Xia, D., Han, B., Wang, C., Chen, G., et al. (2023) Evolution Strategies-Based Optimized Graph Reinforcement Learning for Solving Dynamic Job Shop Scheduling Problem. Applied Soft Computing, 145, Article ID: 110596. [Google Scholar] [CrossRef
[17] Jinqiang, H., Husheng, W., Renjun, Z., Rafik, M. and Xuanwu, Z. (2021) Self-organized Search-Attack Mission Planning for UAV Swarm Based on Wolf Pack Hunting Behavior. Journal of Systems Engineering and Electronics, 32, 1463-1476. [Google Scholar] [CrossRef
[18] Zhang, J., Cui, Y. and Ren, J. (2023) Dynamic Mission Planning Algorithm for UAV Formation in Battlefield Environment. IEEE Transactions on Aerospace and Electronic Systems, 59, 3750-3765. [Google Scholar] [CrossRef
[19] Deng, H., Huang, J., Liu, Q., Zhao, T., Zhou, C. and Gao, J. (2023) A Distributed Collaborative Allocation Method of Reconnaissance and Strike Tasks for Heterogeneous UAVs. Drones, 7, Article 138. [Google Scholar] [CrossRef
[20] Ma, Y., Li, X., Wang, H., Kang, Y., Cao, T., Zhang, Y., et al. (2024) Multi-UAVs Collaborative Task Allocation Based on Improved Consensus-Based Grouping Algorithm. 2024 IEEE 18th International Conference on Control & Automation (ICCA), Reykjavík, 18-21 June 2024, 850-855. [Google Scholar] [CrossRef
[21] Liang, Z., Li, Q. and Fu, G. (2023) Multi-UAV Collaborative Search and Attack Mission Decision-Making in Unknown Environments. Sensors, 23, Article 7398. [Google Scholar] [CrossRef] [PubMed]