|
[1]
|
Fedus, W., Gelada, C., Bengio, Y., et al. (2019) Hyperbolic Discounting and Learning over Multiple Horizons. arXiv: 1902.06865.
|
|
[2]
|
Sozou, P.D. (1998) On Hyperbolic Discounting and Uncertain Hazard Rates. Proceedings of the Royal Society of London. Series B: Biological Sciences, 265, 2015-2020. [Google Scholar] [CrossRef]
|
|
[3]
|
Alexander, W.H. and Brown, J.W. (2010) Hyperbolically Dis-counted Temporal Difference Learning. Neural Computation, 22, 1511-1527. [Google Scholar] [CrossRef] [PubMed]
|
|
[4]
|
Alia, I. (2019) A Non-Exponential Discounting Time-Inconsistent Stochastic Optimal Control Problem for Jump-Diffusion. Mathematical Control and Related Fields, 9, 541-570. [Google Scholar] [CrossRef]
|
|
[5]
|
Schultheis, M., Rothkopf, C.A. and Koeppl, H. (2022) Reinforcement Learning with Non-Exponential Discounting. Advances in Neural Information Processing Systems, 35, 3649-3662.
|
|
[6]
|
Nafi, N.M., Ali, R.F. and Hsu, W. (2022) Hyperbolically Discounted Advantage Estimation for Generalization in Reinforcement Learning. Decision Awareness in Reinforcement Learning Work-shop at ICML 2022.
|
|
[7]
|
Ali, R.F. (2023) Non-Exponential Reward Discounting in Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 37, 16111-16112. [Google Scholar] [CrossRef]
|
|
[8]
|
Kwiatkowski, A., Kalogeiton, V., Pettré, J., et al. (2023) UGAE: A Novel Approach to Non-exponential Discounting. arXiv: 2302.05740.
|
|
[9]
|
Hanson, F.B. (2007) Applied Sto-chastic Processes and Control for Jump-Diffusions: Modeling, Analysis and Computation. Society for Industrial and Applied Mathematics, Philadelphia. [Google Scholar] [CrossRef]
|
|
[10]
|
Aalen, O., Borgan, O. and Gjessing, H. (2008) Survival and Event History Analysis: A Process Point of View. Springer, New York. [Google Scholar] [CrossRef]
|
|
[11]
|
Särkkä, S. and Solin, A. (2019) Applied Stochastic Differen-tial Equations. Cambridge University Press, Cambridge. [Google Scholar] [CrossRef]
|
|
[12]
|
Fleming, W.H. and Soner, H.M. (2006) Controlled Markov Processes and Viscosity Solutions. Springer Science & Business Media, New York.
|
|
[13]
|
Simpkins, A. and Todorov, E. (2009) Practical Numerical Methods for Stochastic Optimal Control of Biological Systems in Continuous Time and Space. 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, Nashville, 30 March-2 April 2009, 212-218. [Google Scholar] [CrossRef]
|
|
[14]
|
Tassa, Y. and Erez, T. (2007) Least Squares Solutions of the HJB Equation with Neural Network Value-Function Approximators. IEEE Transactions on Neural Networks, 18, 1031-1041. [Google Scholar] [CrossRef]
|
|
[15]
|
Lutter, M., Belousov, B., Listmann, K., et al. (2020) HJB Optimal Feedback Control with Deep Differential Value Functions and Action Constraints. 3rd Conference on Robot Learning (CoRL 2019), Osaka, 640-650.
|
|
[16]
|
Sirignano, J. and Spiliopoulos, K. (2018) DGM: A Deep Learning Algorithm for Solving Partial Differential Equations. Journal of Computational Physics, 375, 1339-1364. [Google Scholar] [CrossRef]
|