|
[1]
|
Bellman, R. (1954) The Theory of Dynamic Programming. Bulletin of the American Mathematical Society, 60, 503-515. [Google Scholar] [CrossRef]
|
|
[2]
|
Sutton, R.S., Precup, D. and Singh, S. (1999) Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning. Artificial Intelligence, 112, 181-211. [Google Scholar] [CrossRef]
|
|
[3]
|
Dietterich, T.G. (2000) Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition. Journal of Artificial Intelligence Research, 13, 227-303. [Google Scholar] [CrossRef]
|
|
[4]
|
Dayan, P. and Hinton, G.E. (1992) Feudal Reinforcement Learning. Advances in Neural Information Processing Systems, 5, 272-278.
|
|
[5]
|
Vezhnevets, A.S., Osindero, S., Schaul, T., et al. (2017) Feudal Networks for Hierarchical Reinforcement Learning. International Conference on Machine Learning, Sydney, 6 August 2017, 3540-3549.
|
|
[6]
|
Kou, K., Yang, G., Zhang, W., et al. (2022) Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning. 2022 International Conference on Automation, Robotics and Computer Engineering (ICARCE), Wuhan, 16-17 December 2022, 1-5. [Google Scholar] [CrossRef]
|
|
[7]
|
Alpdemir, M.N. (2023) A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments. Turkish Journal of Science and Technology, 18, 243-259. [Google Scholar] [CrossRef]
|
|
[8]
|
程先峰, 严勇杰. 基于MAXQ分层强化学习的有人机/无人机协同路径规划研究[J]. 信息化研究, 2020, 46(1): 13-19. [Google Scholar] [CrossRef]
|
|
[9]
|
Cheng, Y., Li, D., Wong, W.E., et al. (2022) Multi-UAV Collaborative Path Planning Using Hierarchical Reinforcement Learning and Simulated Annealing. International Journal of Performability Engineering, 18, 463-474. [Google Scholar] [CrossRef]
|
|
[10]
|
Ren, T., Niu, J., Dai, B., et al. (2021) Enabling Efficient Scheduling in Large-Scale UAV-Assisted Mobile-Edge Computing via Hierarchical Reinforcement Learning. IEEE Internet of Things Journal, 9, 7095-7109. [Google Scholar] [CrossRef]
|
|
[11]
|
Zhang, Y., Mou, Z, Gao, F., et al. (2020) Hierarchical Deep Reinforcement Learning for Backscattering Data Collection with Multiple UAVs. IEEE Internet of Things Journal, 8, 3786-3800. [Google Scholar] [CrossRef]
|