极小极大优化问题的一类自适应三次正则化牛顿法
One Class of Cubic Regularized Newton Methods for Min-Max Optimization Problems
摘要: 许多机器学习问题,如对抗学习、强化学习和图像处理的一些典型问题大都可归结为极小极大优化问题。因此,极小极大优化问题最近已经成为最优化领域与机器学习交叉领域中的一个重要研究前沿和热点,吸引了众多学者的关注和研究。如何有效求解极小极大优化问题,是优化领域及应用中的关键科学问题之一。考虑到极小极大三次正则化牛顿法中,三次正则化项的系数是相对固定的,会影响算法的收敛性。本文首先针对非凸–强凹极小极大优化问题,采用信赖域半径的选取策略,提出了一类自适应极小极大三次正则化牛顿法。然后证得了算法的迭代复杂度为O(T-1/3)。最后,通过一类对抗攻击神经网络问题,验证了该算法的有效性。
Abstract: Many machine learning problems, such as adversarial learning, reinforcement learning and image processing, can be reduced to min-max optimization problem. Therefore, min-max optimization has recently become an important research frontier and hot spot in the intersection of optimization and machine learning, attracting the attention and research of many scholars. How to effectively solve min-max problem is one of the key scientific problems in optimization field and application. Considering that the coefficient of cubic regularization term in min-max cubic regularization Newton method is relatively fixed, it will affect the convergence of the algorithm. In this paper, a class of adaptive min-max cubic regularization Newton method is proposed based on the selection strategy of the radius of the trust region for the nonconvex-strongly concave minimax optimization problem. Then, the iterative complexity of the algorithm is bounded by O(T-1/3). Finally, the effectiveness of the proposed algorithm is verified by a class of adversarial attack neural network problems.
文章引用:高瑞成. 极小极大优化问题的一类自适应三次正则化牛顿法[J]. 理论数学, 2023, 13(5): 1255-1266. https://doi.org/10.12677/PM.2023.135129

参考文献

[1] Silver, D., Hubert, T., Schrittwieser, J., Antonoglou, I., Lai, M., Guez, A., et al. (2018) A General Reinforcement Learning Algorithm That Masters Chess, Shogi, and Go through Self-Play. Science, 362, 1140-1144. [Google Scholar] [CrossRef] [PubMed]
[2] Young, T., Hazarika, D., Poria, S. and Cambria, E. (2018) Recent Trends in Deep Learning Based Natural Language Processing. IEEE Computational Intelligence Magazine, 13, 55-75. [Google Scholar] [CrossRef
[3] Letcher, A., Balduzzi, D., Racaniere, S., Martens, J., Foerster, J., Tuyls, K. and Graepel, T. (2019) Differentiable Game Mechanics. Journal of Machine Learning Research, 20, 3032-3071.
[4] Lu, S., Tsaknakis, I., Hong, M. and Chen, Y. (2020) Hybrid Block Successive Approximation for One-Sided Nonconvex Min-Max Problems: Algorithms and Applications. IEEE Transactions on Signal Processing, 68, 3676-3691. [Google Scholar] [CrossRef
[5] Sanjabi, M., Ba, J., Razaviyayn, M. and Jason, D.L. (2018) On the Convergence and Robustness of Training Gans with Regularized Optimal Transport. Proceedings of the 32nd In-ternational Conference on Neural Information Processing Systems, Montréal, 3-8 December 2018, 7091-7101.
[6] Nouiehed, M., Sanjabi, M., Huang, T., Lee, J.D. and Razaviyayn, M. (2019) Solving a Class of Nonconvex Min-Max Games Using Iterative First Order Methods. Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, 8-14 December 2019, 14934-14942.
[7] Kong, W. and Monteiro, R.D.C. (2021) An Accelerated Inexact Proximal Point Method for Solving Nonconvex-Concave Min-Max Problems. SIAM Journal on Optimization, 31, 2558-2585. [Google Scholar] [CrossRef
[8] Lin, T., Jin, C. and Jordan, M.I. (2020) Near-Optimal Algorithms for Minimax Optimization. Conference on Learning Theory PMLR, 9-12 July 2020, 2738-2779.
[9] Nesterov, Y. and Polyak, B.T. (2006) Cubic Regularization of Newton Method and Its Global Performance. Mathematical Programming, 108, 177-205. [Google Scholar] [CrossRef
[10] Cartis, C., Gould, N.I.M. and Toint, P.L. (2011) Adaptive Cubic Regularisation Methods for Unconstrained Optimization. Part I: Motivation, Convergence and Numerical Results. Mathematical Programming, 127, 245-295. [Google Scholar] [CrossRef
[11] Cartis, C., Gould, N.I.M. and Toint, P.L. (2011) Adaptive Cubic Regularisation Methods for Unconstrained Optimization. Part II: Worst-Case Function- and Derivative-Evaluation Complexity. Mathematical Programming, 130, 295-319. [Google Scholar] [CrossRef
[12] Luo, L., Li, Y. and Chen, C. (2022) Finding Second-Order Sta-tionary Point in Nonconvex-Strongly-Concave Minimax Optimization. 36th Conference on Neural Information Pro-cessing Systems (NeurIPS 2022), New Orleans, 28 November-9 December 2022, 36667-36679.
[13] Lin, T., Jin, C. and Jordan, M.I. (2020) On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems. International Conference on Machine Learning. PMLR, 13-18 July 2020, 6083-6093.
[14] Ganin, Y., Ustinova, E., Ajakan, H., et al. (2016) Domain-Adversarial Training of Neural Networks. Journal of Machine Learning Research, 17, 2096-2030.
[15] LeCun, Y., Bottou, L., Bengio, Y. and Haffner, P. (1998) Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE, 86, 2278-2324. [Google Scholar] [CrossRef
[16] Geerts, D., Nouwen, M., Beek, E.V., Slegers, K., Miranda, F.C. and Bleumers, L. (2019) Using the SGDA Framework to Design and Evaluate Research Games. Simulation and Gaming, 50, 272-301. [Google Scholar] [CrossRef
[17] Wang, Z., Zhou, Y., Liang, Y. and Lan, G. (2020) Cubic Regu-larization with Momentum for Nonconvex Optimization. Uncertainty in Artificial Intelligence. PMLR, 3-6 August 2020, 313-322.
[18] 徐姿, 张慧灵. 非凸极小极大优化问题的优化算法与复杂度分析[J]. 运筹学学报, 2021, 25(3): 74-86.