|
[1]
|
Pronzato, L., Kulcsár, C. and Walter, E. (1996) An Actively Adaptive Control Policy for Linear Models. IEEE Trans-actions on Automatic Control, 41, 855-858. [Google Scholar] [CrossRef]
|
|
[2]
|
Chen, S., Li, X. and Zhou, X.Y. (1998) Stochastic Linear Quadratic Regulators with Indefinite Control Weight Costs. SIAM Journal on Control and Optimization, 36, 1685-1702. [Google Scholar] [CrossRef]
|
|
[3]
|
Chen, S. and Zhou, X.Y. (2000) Stochastic Linear Quadratic Regulators with Indefinite Control Weight Costs. II. SIAM Journal on Control and Opti-mization, 39, 1065-1081. [Google Scholar] [CrossRef]
|
|
[4]
|
Rami, M.A., Moore, J.B. and Zhou, X.Y. (2002) Indefinite Stochastic Linear Quadratic Control and Generalized Differential Riccati Equation. SIAM Journal on Control and Optimization, 40, 1296-1311. [Google Scholar] [CrossRef]
|
|
[5]
|
Wang, T., Zhang, H. and Luo, Y. (2016) Infinite-Time Sto-chastic Linear Quadratic Optimal Control for Unknown Discrete-Time Systems Using Adaptive Dynamic Programming Approach. Neurocomputing, 171, 379-386. [Google Scholar] [CrossRef]
|
|
[6]
|
Du, K., Meng, Q. and Zhang, F. (2022) A Q-Learning Algo-rithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and Stabilization. SIAM Journal on Control and Optimization, 60, 1991-2015. [Google Scholar] [CrossRef]
|
|
[7]
|
舒心. 带熵的随机线性二次最优控制问题[J]. 应用数学进展, 2022, 11(12): 8836-8845. [Google Scholar] [CrossRef]
|
|
[8]
|
Metropolis, N. and Ulam, S. (1949) The Monte Carlo Method. Journal of the American Statistical Association, 44, 335-341. [Google Scholar] [CrossRef] [PubMed]
|
|
[9]
|
Harrison, R.L. (2010) Introduction to Monte Carlo Simu-lation. AIP Conference Proceedings, 1204, 17-21. [Google Scholar] [CrossRef] [PubMed]
|
|
[10]
|
James, F. (1980) Monte Carlo Theory and Practice. Reports on Progress in Physics, 43, Article No. 1145. [Google Scholar] [CrossRef]
|
|
[11]
|
Glasserman, P. (2004) Monte Carlo Methods in Financial En-gineering. Springer, New York. [Google Scholar] [CrossRef]
|
|
[12]
|
Ferrenberg, A.M. and Swendsen, R.H. (1988) New Monte Carlo Technique for Studying Phase Transitions. Physical Review Letters, 61, 2635-2638. [Google Scholar] [CrossRef]
|
|
[13]
|
Robbins, H. and Monro, S. (1951) A Stochastic Approximation Method. The Annals of Mathematical Statistics, 22, 400-407. [Google Scholar] [CrossRef]
|
|
[14]
|
Lai, T.L. (2003) Stochastic Approximation. The Annals of Statistics, 31, 391-406. [Google Scholar] [CrossRef]
|
|
[15]
|
Nemirovski, A., Juditsky, A., Lan, G. and Shapiro, A. (2009) Robust Stochastic Approximation Approach to Stochastic Programming. SIAM Journal on Optimization, 19, 1574-1609. [Google Scholar] [CrossRef]
|
|
[16]
|
Tsitsiklis, J.N. (1994) Asynchronous Stochastic Approximation and Q-Learning. Machine Learning, 16, 185-202. [Google Scholar] [CrossRef]
|