分层强化学习在无人机领域应用综述

doi:10.12677/AIRR.2024.131008

期刊菜单

分层强化学习在无人机领域应用综述
A Review of the Application of Hierarchical Reinforcement Learning in the Field of Drones

DOI: 10.12677/AIRR.2024.131008, PDF, 被引量
作者: 杨永祥：贵州师范大学数学科学学院，贵州贵阳；王念杰：日照市岚山区行政审批服务局，山东日照；胡涵川：贵州师范大学大数据与科学学院，贵州贵阳
关键词: 分层强化学习；无人机；人工智能；Hierarchical Reinforcement Learning； Drone； Artificial Intelligence

摘要: 分层强化学习是强化学习领域的一个重要分支。基于分而治之的思想，将一个复杂问题分解成多个子问题，最终解决整个问题。近年来，由于传感器能力的提高和人工智能算法的进步，基于分层强化学习的无人机自主导航成为研究热点。本篇文章对国内外发表的具有代表性的文章进行概述，首先分析无人机和分层强化学习的含义，其次重点研究了分层强化学习在无人机轨迹规划和资源分配的优化问题上的应用。

Abstract: Hierarchical reinforcement learning is an important branch in the field of reinforcement learning. Based on the idea of divide and conquer, a complex problem is decomposed into multiple sub-problems and finally the entire problem is solved. In recent years, due to the improvement of sensor capabilities and the advancement of artificial intelligence algorithms, autonomous drone navigation based on hierarchical reinforcement learning has become a research hotspot. This article provides an overview of representative articles published at home and abroad. First, it analyzes the meaning of UAVs and hierarchical reinforcement learning. Secondly, it focuses on the application of hierarchical reinforcement learning in UAV trajectory planning and resource allocation problems.

文章引用：杨永祥, 王念杰, 胡涵川. 分层强化学习在无人机领域应用综述[J]. 人工智能与机器人研究, 2024, 13(1): 66-71. https://doi.org/10.12677/AIRR.2024.131008

参考文献

[1]	Bellman, R. (1954) The Theory of Dynamic Programming. Bulletin of the American Mathematical Society, 60, 503-515. [Google Scholar] [CrossRef]
[2]	Sutton, R.S., Precup, D. and Singh, S. (1999) Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning. Artificial Intelligence, 112, 181-211. [Google Scholar] [CrossRef]
[3]	Dietterich, T.G. (2000) Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition. Journal of Artificial Intelligence Research, 13, 227-303. [Google Scholar] [CrossRef]
[4]	Dayan, P. and Hinton, G.E. (1992) Feudal Reinforcement Learning. Advances in Neural Information Processing Systems, 5, 272-278.
[5]	Vezhnevets, A.S., Osindero, S., Schaul, T., et al. (2017) Feudal Networks for Hierarchical Reinforcement Learning. International Conference on Machine Learning, Sydney, 6 August 2017, 3540-3549.
[6]	Kou, K., Yang, G., Zhang, W., et al. (2022) Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning. 2022 International Conference on Automation, Robotics and Computer Engineering (ICARCE), Wuhan, 16-17 December 2022, 1-5. [Google Scholar] [CrossRef]
[7]	Alpdemir, M.N. (2023) A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments. Turkish Journal of Science and Technology, 18, 243-259. [Google Scholar] [CrossRef]
[8]	程先峰, 严勇杰. 基于MAXQ分层强化学习的有人机/无人机协同路径规划研究[J]. 信息化研究, 2020, 46(1): 13-19. [Google Scholar] [CrossRef]
[9]	Cheng, Y., Li, D., Wong, W.E., et al. (2022) Multi-UAV Collaborative Path Planning Using Hierarchical Reinforcement Learning and Simulated Annealing. International Journal of Performability Engineering, 18, 463-474. [Google Scholar] [CrossRef]
[10]	Ren, T., Niu, J., Dai, B., et al. (2021) Enabling Efficient Scheduling in Large-Scale UAV-Assisted Mobile-Edge Computing via Hierarchical Reinforcement Learning. IEEE Internet of Things Journal, 9, 7095-7109. [Google Scholar] [CrossRef]
[11]	Zhang, Y., Mou, Z, Gao, F., et al. (2020) Hierarchical Deep Reinforcement Learning for Backscattering Data Collection with Multiple UAVs. IEEE Internet of Things Journal, 8, 3786-3800. [Google Scholar] [CrossRef]

为你推荐

友情链接