关于具有不可分离哈密顿量的平均场控制系统的策略迭代算法的研究
A Study on the Policy Iteration Method for Time-Dependent Mean Field Controls Systems with Non-Separable Hamiltonians
DOI: 10.12677/AAM.2023.125239, PDF,   
作者: 唐毅雄:中国地质大学(武汉)数学与物理学院,湖北 武汉
关键词: 平均场控制微分博弈最优控制数值计算方法Mean Field Type Control Differential Games Optimal Control Numerical Methods
摘要: 平均场控制研究的是在大量智能体参与的系统中,由一个中心控制者计算系统的帕累托最优,且所有个体先验地使用完全相同的反馈控制。本文研究的对象是一类来自于平均场控制问题的倒向–正向偏微分方程组有限差分数值计算方法,使用的算法基于最优控制问题中的策略迭代方法。本文讨论了较短时间区域内算法的收敛性和收敛速度。本文通过一些例子验证了算法的有效性。本文通过数值试验讨论了平均场博弈纳什均衡和平均场控制帕累托最优之间的总体效益比较。
Abstract: Mean field type control studies the Pareto optimum computed by a central planner in a system with large number of agents, with each agent using a priori the same feedback control. This paper stud-ies the finite difference numerical approximation of a type of Mean field type control problem, using algorithms based on the policy iteration method from optimal control problems. Under the as-sumption of a short time horizon, convergence of the algorithm and the rate of convergence are discussed. Some numerical examples are supplemented to show the efficacy of the algorithms. The comparisons between Nash equilibrium and Pareto optimum, in terms of aggregate utility, are il-lustrated by numerical examples.
文章引用:唐毅雄. 关于具有不可分离哈密顿量的平均场控制系统的策略迭代算法的研究[J]. 应用数学进展, 2023, 12(5): 2364-2375. https://doi.org/10.12677/AAM.2023.125239

参考文献

[1] Lasry, J.-M. and Lions, P.-L. (2007) Mean Field Games. Japanese Journal of Mathematics, 2, 229-260. [Google Scholar] [CrossRef
[2] Huang, M., Malhamé, R.P. and Caines, P.E. (2006) Large Popu-lation Stochastic Dynamic Games: Closed-Loop McKean- Vlasov Systems and the Nash Certainty Equivalence Principle. Communications in Information and Systems, 6, 221- 252. [Google Scholar] [CrossRef
[3] Achdou, Y. and Lauriere, M. (2015) On the System of Partial Differential Equations Arising in Mean Field Type Control. arXiv:1503.05044. [Google Scholar] [CrossRef
[4] Carmona, R. and Delarue, F. (2013) Probabil-istic Analysis of Mean-Field Games. SIAM Journal on Control and Optimization, 51, 2705-2734. [Google Scholar] [CrossRef
[5] Bensoussan, A., Frehse, J., Yam, P., et al. (2013) Mean Field Games and Mean Field Type Control Theory. Vol. 101, Springer. [Google Scholar] [CrossRef
[6] Lauriere, M. (2021) Numerical Methods for Mean Field Games and Mean Field Type Control. Mean Field Games, 78, 221. [Google Scholar] [CrossRef
[7] Achdou, Y., Cardaliaguet, P., Delarue, F., et al. (2019) Mean Field Games and Applications: Numerical Aspects. 249-307. [Google Scholar] [CrossRef
[8] Cacace, S., Camilli, F. and Goffi, A. (2021) A Policy Iteration Method for Mean Field Games. ESAIM: Control, Optimisation and Calculus of Variations, 27, 85. [Google Scholar] [CrossRef
[9] Camilli, F. and Tang, Q. (2022) Rates of Convergence for the Policy It-eration Method for Mean Field Games Systems. Journal of Mathematical Analysis and Applications, 512, 126138. [Google Scholar] [CrossRef
[10] Cardaliaguet, P. and Rainer, C. (2019) On the (in)Efficiency of MFG Equilibria. SIAM Journal on Control and Optimization, 57, 2292-2314. [Google Scholar] [CrossRef
[11] Carmona, R., Graves, C.V. and Tan, Z. (2019) Price of Anarchy for Mean Field Games. ESAIM: Proceedings and Surveys, 65, 349-383. [Google Scholar] [CrossRef
[12] Graber, P.J. (2015) Weak Solutions for Mean Field Games with Congestion. Analysis of PDEs.
[13] Cirant, M., Gianni, R. and Mannucci, P. (2020) Short-Time Existence for a General Backward-Forward Parabolic System Arising from Mean-Field Games. Dynamic Games and Applications, 10, 100-119. [Google Scholar] [CrossRef
[14] Laurière, M., Song, J. and Tang, Q. (2023) Policy Iteration Method for Time-Dependent Mean Field Games Systems with Non-Separable Hamiltonians. Applied Mathematics & Optimization, 87, 1-34. [Google Scholar] [CrossRef