|
[1]
|
Aydin, M.E., Durgut, R. and Rakib, A. (2024) Why Reinforcement Learning? Algorithms, 17, Article 269. [Google Scholar] [CrossRef]
|
|
[2]
|
Gronauer, S. and Diepold, K. (2021) Multi-agent Deep Reinforcement Learning: A Survey. Artificial Intelligence Review, 55, 895-943. [Google Scholar] [CrossRef]
|
|
[3]
|
Sutton, R.S. and Barto, A.G. (2018) Reinforcement Learning: An Introduction. MIT Press.
|
|
[4]
|
Kaelbling, L.P., Littman, M.L. and Moore, A.W. (1996) Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research, 4, 237-285. [Google Scholar] [CrossRef]
|
|
[5]
|
Shakya, A.K., Pillai, G. and Chakrabarty, S. (2023) Reinforcement Learning Algorithms: A Brief Survey. Expert Systems with Applications, 231, Article ID: 120495. [Google Scholar] [CrossRef]
|
|
[6]
|
Milani, S., Topin, N., Veloso, M. and Fang, F. (2024) Explainable Reinforcement Learning: A Survey and Comparative Review. ACM Computing Surveys, 56, 1-36. [Google Scholar] [CrossRef]
|
|
[7]
|
Song, Y., Suganthan, P.N., Pedrycz, W., Ou, J., He, Y., Chen, Y., et al. (2023) Ensemble Reinforcement Learning: A Survey. Applied Soft Computing, 149, Article ID: 110975. [Google Scholar] [CrossRef]
|
|
[8]
|
Benbrahim, H. and Franklin, J.A. (1997) Biped Dynamic Walking Using Reinforcement Learning. Robotics and Autonomous Systems, 22, 283-302. [Google Scholar] [CrossRef]
|
|
[9]
|
Singh, A., Yang, L., Finn, C. and Levine, S. (2019). End-to-End Robotic Reinforcement Learning without Reward Engineering. Robotics: Science and Systems XV, Breisgau, 22-26 June 2019.[CrossRef]
|
|
[10]
|
Barbič, J., et al. (2024) Segmenting Motion Capture Data into Distinct Behaviors. Proceedings of Graphics Interface 2004, London, 17-19 May 2004, 185-194.
|
|
[11]
|
Schäfer, P., Ermshaus, A. and Leser, U. (2021) ClaSP—Time Series Segmentation. Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 1-5 November 2021, 1578-1587. [Google Scholar] [CrossRef]
|
|
[12]
|
张奕, 王科琪. 基于MVU降维的捕捉数据自动分割[J]. 电子技术与软件工程, 2017(19): 182-184.
|
|
[13]
|
Usmani, M., Memon, Z.A., Zulfiqar, A. and Qureshi, R. (2024) Preptimize: Automation of Time Series Data Preprocessing and Forecasting. Algorithms, 17, Article 332. [Google Scholar] [CrossRef]
|
|
[14]
|
陈思喜, 李建微, 陈纾. 基于KPCA降维的运动捕捉数据自动分割[J]. 福州大学学报(自然科学版), 2015, 43(4): 456-459.
|
|
[15]
|
胡晓雁, 孙波, 朱小明, 等. 基于谱聚类的运动捕获数据分割[J]. 计算机辅助设计与图形学学报, 2016, 28(8): 1306-1315.
|
|
[16]
|
肖惠, 刘静民, 滑东红, 郑秀瑷, 侯曼. GB/T 17245-2004成年人人体惯性参数[S]. 北京: 中国标准出版社, 2004.
|