|
[1]
|
Silver, D., Hubert, T., Schrittwieser, J., Antonoglou, I., Lai, M., Guez, A., et al. (2018) A General Reinforcement Learning Algorithm That Masters Chess, Shogi, and Go through Self-Play. Science, 362, 1140-1144. [Google Scholar] [CrossRef] [PubMed]
|
|
[2]
|
Young, T., Hazarika, D., Poria, S. and Cambria, E. (2018) Recent Trends in Deep Learning Based Natural Language Processing. IEEE Computational Intelligence Magazine, 13, 55-75. [Google Scholar] [CrossRef]
|
|
[3]
|
Letcher, A., Balduzzi, D., Racaniere, S., Martens, J., Foerster, J., Tuyls, K. and Graepel, T. (2019) Differentiable Game Mechanics. Journal of Machine Learning Research, 20, 3032-3071.
|
|
[4]
|
Lu, S., Tsaknakis, I., Hong, M. and Chen, Y. (2020) Hybrid Block Successive Approximation for One-Sided Nonconvex Min-Max Problems: Algorithms and Applications. IEEE Transactions on Signal Processing, 68, 3676-3691. [Google Scholar] [CrossRef]
|
|
[5]
|
Sanjabi, M., Ba, J., Razaviyayn, M. and Jason, D.L. (2018) On the Convergence and Robustness of Training Gans with Regularized Optimal Transport. Proceedings of the 32nd In-ternational Conference on Neural Information Processing Systems, Montréal, 3-8 December 2018, 7091-7101.
|
|
[6]
|
Nouiehed, M., Sanjabi, M., Huang, T., Lee, J.D. and Razaviyayn, M. (2019) Solving a Class of Nonconvex Min-Max Games Using Iterative First Order Methods. Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, 8-14 December 2019, 14934-14942.
|
|
[7]
|
Kong, W. and Monteiro, R.D.C. (2021) An Accelerated Inexact Proximal Point Method for Solving Nonconvex-Concave Min-Max Problems. SIAM Journal on Optimization, 31, 2558-2585. [Google Scholar] [CrossRef]
|
|
[8]
|
Lin, T., Jin, C. and Jordan, M.I. (2020) Near-Optimal Algorithms for Minimax Optimization. Conference on Learning Theory PMLR, 9-12 July 2020, 2738-2779.
|
|
[9]
|
Nesterov, Y. and Polyak, B.T. (2006) Cubic Regularization of Newton Method and Its Global Performance. Mathematical Programming, 108, 177-205. [Google Scholar] [CrossRef]
|
|
[10]
|
Cartis, C., Gould, N.I.M. and Toint, P.L. (2011) Adaptive Cubic Regularisation Methods for Unconstrained Optimization. Part I: Motivation, Convergence and Numerical Results. Mathematical Programming, 127, 245-295. [Google Scholar] [CrossRef]
|
|
[11]
|
Cartis, C., Gould, N.I.M. and Toint, P.L. (2011) Adaptive Cubic Regularisation Methods for Unconstrained Optimization. Part II: Worst-Case Function- and Derivative-Evaluation Complexity. Mathematical Programming, 130, 295-319. [Google Scholar] [CrossRef]
|
|
[12]
|
Luo, L., Li, Y. and Chen, C. (2022) Finding Second-Order Sta-tionary Point in Nonconvex-Strongly-Concave Minimax Optimization. 36th Conference on Neural Information Pro-cessing Systems (NeurIPS 2022), New Orleans, 28 November-9 December 2022, 36667-36679.
|
|
[13]
|
Lin, T., Jin, C. and Jordan, M.I. (2020) On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems. International Conference on Machine Learning. PMLR, 13-18 July 2020, 6083-6093.
|
|
[14]
|
Ganin, Y., Ustinova, E., Ajakan, H., et al. (2016) Domain-Adversarial Training of Neural Networks. Journal of Machine Learning Research, 17, 2096-2030.
|
|
[15]
|
LeCun, Y., Bottou, L., Bengio, Y. and Haffner, P. (1998) Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE, 86, 2278-2324. [Google Scholar] [CrossRef]
|
|
[16]
|
Geerts, D., Nouwen, M., Beek, E.V., Slegers, K., Miranda, F.C. and Bleumers, L. (2019) Using the SGDA Framework to Design and Evaluate Research Games. Simulation and Gaming, 50, 272-301. [Google Scholar] [CrossRef]
|
|
[17]
|
Wang, Z., Zhou, Y., Liang, Y. and Lan, G. (2020) Cubic Regu-larization with Momentum for Nonconvex Optimization. Uncertainty in Artificial Intelligence. PMLR, 3-6 August 2020, 313-322.
|
|
[18]
|
徐姿, 张慧灵. 非凸极小极大优化问题的优化算法与复杂度分析[J]. 运筹学学报, 2021, 25(3): 74-86.
|