|
[1]
|
Merton, R.C. (1969) Lifetime Portfolio Selection under Uncertainty: The Continuous-Time Case. The Review of Economics and Statistics, 51, 247-257. [Google Scholar] [CrossRef]
|
|
[2]
|
Ferreira, M., Pinheiro, D. and Pinheiro, S. (2023) Optimal Consumption, Investment and Life Insurance Selection under Robust Utilities. International Journal of Financial Engineering, 10, Article ID: 2350016. [Google Scholar] [CrossRef]
|
|
[3]
|
Tao, C., Rong, X. and Zhao, H. (2023) Stochastic Control with Inhomogeneous Regime Switching: Application to Consumption and Investment with Unemployment and Reemployment. Journal of Mathematical Economics, 107, Article ID: 102849. [Google Scholar] [CrossRef]
|
|
[4]
|
Wang, H., Wang, N., Xu, L., Hu, S. and Yan, X. (2022) Household Investment-Consumption-Insurance Policies under the Age-Dependent Risk Preferences. International Journal of Control, 96, 2542-2554. [Google Scholar] [CrossRef]
|
|
[5]
|
Pollak, R.A. (1970) Habit Formation and Dynamic Demand Functions. Journal of Political Economy, 78, 745-763. [Google Scholar] [CrossRef]
|
|
[6]
|
Ryder, H.E. and Heal, G.M. (1973) Optimal Growth with Intertemporally Dependent Preferences. The Review of Economic Studies, 40, 1-31. [Google Scholar] [CrossRef]
|
|
[7]
|
Curatola, G. (2017) Optimal Portfolio Choice with Loss Aversion over Consumption. The Quarterly Review of Economics and Finance, 66, 345-358. [Google Scholar] [CrossRef]
|
|
[8]
|
van Bilsen, S., Laeven, R.J.A. and Nijman, T.E. (2020) Consumption and Portfolio Choice under Loss Aversion and Endogenous Updating of the Reference Level. Management Science, 66, 3927-3955. [Google Scholar] [CrossRef]
|
|
[9]
|
He, L., Liang, Z., Song, Y. and Ye, Q. (2022) Optimal Asset Allocation, Consumption and Retirement Time with the Variation in Habitual Persistence. Insurance: Mathematics and Economics, 102, 188-202. [Google Scholar] [CrossRef]
|
|
[10]
|
Wang, H., Zariphopoulou, T. and Zhou, X.Y. (2020) Reinforcement Learning in Continuous Time and Space: A Stochastic Control Approach. Journal of Machine Learning Research, 21, 1-34.
|
|
[11]
|
Jia, Y. and Zhou, X. (2021) Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms. Journal of Machine Learning Research, 23, 1-50.
|
|
[12]
|
Zhou, M., Han, J. and Lu, J. (2021) Actor-Critic Method for High Dimensional Static Hamilton-Jacobi-Bellman Partial Differential Equations Based on Neural Networks. SIAM Journal on Scientific Computing, 43, A4043-A4066. [Google Scholar] [CrossRef]
|
|
[13]
|
Wang, Z., Bapst, V., Heess, N., et al. (2016) Sample Efficient Actor-Critic with Experience Replay. arXiv: 1611.01224.
|