|
[1]
|
Lee, J., Hwangbo, J., Wellhausen, L., Koltun, V. and Hutter, M. (2020) Learning Quadrupedal Locomotion over Challenging Terrain. Science Robotics, 5, eabc5986. [Google Scholar] [CrossRef] [PubMed]
|
|
[2]
|
Miki, T., Lee, J., Hwangbo, J., Wellhausen, L., Koltun, V. and Hutter, M. (2022) Learning Robust Perceptive Locomotion for Quadrupedal Robots in the Wild. Science Robotics, 7, eabk2822. [Google Scholar] [CrossRef] [PubMed]
|
|
[3]
|
Kumar, A., Fu, Z., Pathak, D. and Malik, J. (2021) RMA: Rapid Motor Adaptation for Legged Robots. Robotics: Science and Systems XVII, 12-16 July 2021, 1-12. [Google Scholar] [CrossRef]
|
|
[4]
|
Aswin Nahrendra, I.M., Yu, B. and Myung, H. (2023) DreamWaQ: Learning Robust Quadrupedal Locomotion with Implicit Terrain Imagination via Deep Reinforcement Learning. 2023 IEEE International Conference on Robotics and Automation (ICRA), London, 29 May-2 June 2023, 5078-5084. [Google Scholar] [CrossRef]
|
|
[5]
|
Long, J., Wang, Z., Li, Q., et al. (2023) Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response. arXiv: 2312.11460.
|
|
[6]
|
Long, J., Yu, W., Li, Q., et al. (2024) Learning H-Infinity Locomotion Control. arXiv: 2404.14405.
|
|
[7]
|
Lee, J., Hwangbo, J. and Hutter, M. (2019) Robust Recovery Controller for a Quadrupedal Robot Using Deep Reinforcement Learning. arXiv:1901.07517.
|
|
[8]
|
Smith, L., Kew, J.C., Bin Peng, X., Ha, S., Tan, J. and Levine, S. (2022) Legged Robots That Keep on Learning: Fine-Tuning Locomotion Policies in the Real World. 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, 23-27 May 2022, 1593-1599. [Google Scholar] [CrossRef]
|
|
[9]
|
Nahrendra, I.M.A., Oh, M., Yu, B., et al. (2023) Robust Recovery Motion Control for Quadrupedal Robots via Learned Terrain Imagination. arXiv: 2306.12712.
|
|
[10]
|
Bowman, S.R., Vilnis, L., Vinyals, O., Dai, A., Jozefowicz, R. and Bengio, S. (2016) Generating Sentences from a Continuous Space. Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, Berlin, August 2016, 10-21. [Google Scholar] [CrossRef]
|
|
[11]
|
Shen, L., Yang, L., Chen, S., Yuan, B., Wang, X., Tao, D., et al. (2022) Penalized Proximal Policy Optimization for Safe Reinforcement Learning. arXiv: 2205.11814.
|
|
[12]
|
Rudin, N., Hoeller, D., Reist, P. and Hutter, M. (2022) Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning. arXiv: 2109.11978.
|
|
[13]
|
Pinto, L., Andrychowicz, M., Welinder, P., Zaremba, W. and Abbeel, P. (2018) Asymmetric Actor Critic for Image-Based Robot Learning. Robotics: Science and Systems XIV, Pittsburgh, 26-30 June 2018, 1-10. [Google Scholar] [CrossRef]
|
|
[14]
|
Kingma, D.P. and Welling, M. (2013) Auto-Encoding Variational Bayes. arXiv: 1312.6114.
|
|
[15]
|
Higgins, I., Matthey, L., Pal, A., et al. (2017) β-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework. Proceeding of International Conference on Learning Representations (ICLR) 2017, Toulon, 24-26 April 2017, 1-13.
|
|
[16]
|
Burgess, C.P., Higgins, I., Pal, A., et al. (2017) Understanding Disentangling in β-VAE. arXiv: 1804.03599.
|
|
[17]
|
Kullback, S. and Leibler, R.A. (1951) On Information and Sufficiency. The Annals of Mathematical Statistics, 22, 79-86. [Google Scholar] [CrossRef]
|
|
[18]
|
Schulman, J., Wolski, F., Dhariwal, P., et al. (2017) Proximal Policy Optimization Algorithms. arXiv: 1707.06347.
|
|
[19]
|
Lee, J., Schro, K.V., et al. (2023) Evaluation of Constrained Reinforcement Learning Algorithms for Legged Locomotion. arXiv: 2309.15430.
|
|
[20]
|
Makoviychuk, V., Wawrzyniak, L., Guo, Y., et al. (2021) Isaac Gym: High Performance GPU-Based Physics Simulation for Robot Learning. arXiv: 2108.10470.
|
|
[21]
|
Kingma, D.P. and Ba, J. (2015) Adam: A Method for Stochastic Optimization. arXiv: 1412.6980.
|