基于Huber损失和组Lasso惩罚问题的加速临近梯度算法
Accelerated Proximal Gradient Algorithm Based on Huber Loss and Group Lasso Penalty Problem
DOI: 10.12677/ORF.2023.136721, PDF, 下载: 85  浏览: 131  国家自然科学基金支持
作者: 沈慧玲, 彭定涛*, 张 弦:贵州大学数学与统计学院, 贵州 贵阳
关键词: Huber损失组LassoNesterov加速加速临近梯度算法Huber Loss Group Lasso Nesterov Acceleration Accelerated Proximal Gradient Algorithm
摘要: 在高维线性回归模型中,组稀疏恢复的做法是在原有线性模型的基础上增加一个组Lasso惩罚项,以诱导尽可能少的非零组,将问题转换为凸优化模型进行求解。本文研究Huber损失和组Lasoo 组合问题,其中惩罚项包含了组稀疏惩罚,惩罚项的目的是用于保证组元素稀疏性结构。首先,由于惩罚项是一个凸但不光滑函数,为了刻画组Lasso模型的最优性条件,给出了其经典次微分。其次,利用Nesterov加速技术提出了加速临近梯度算法来求解我们的模型。最后证明了所提出算法的收敛性。
Abstract: In the high-dimensional linear regression model, the method of group sparse recovery is to add a group Lasso penalty term to the original linear model so as to induce as few non-zero groups as possible, and then to transform the problem into a convex optimization model. This paper studies the group Lasso problem based on Huber loss, where the penalty term includes group sparsity penalty, and the purpose of the penalty term is to ensure the group sparsity structure of group element. Firstly, since the penalty term is a convex but not smooth function, in order to characterize the optimality conditions of the group Lasso model, its classical subdifferential is given. Secondly, an accelerated proximity gradient algorithm was proposed using Nesterov acceleration technology to solve our model. Finally, the convergence of the proposed algorithm was demonstrated.
文章引用:沈慧玲, 彭定涛, 张弦. 基于Huber损失和组Lasso惩罚问题的加速临近梯度算法[J]. 运筹与模糊学, 2023, 13(6): 7333-7345. https://doi.org/10.12677/ORF.2023.136721

参考文献

[1] Natarajan, B. (1995) Sparse Approximate Solutions to Linear Systems. SIAM Journal on Computing, 24, 227-234.
https://doi.org/10.1137/S0097539792240406
[2] Tibshirani, R. (1996) Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society: Series B, 58, 267-288.
https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
[3] Jiao, Y., Jin, B. and Lu, X. (2015) A Primal Dual Active Set with Continuation Algorithm for the 0 Regularized Optimization Problem. Applied and Computational Harmonic Analysis, 39, 400-426.
https://doi.org/10.1016/j.acha.2014.10.001
[4] Jiao, Y., Jin, B. and Lu, X. (2016) Group Sparse Recovery via the 0 − 2 Penalty: Theory and Algorithm. IEEE Transactions on Signal Processing, 65, 998-1012.
https://doi.org/10.1109/TSP.2016.2630028
[5] Yuan, M. and Lin, Y. (2006) Model Selection and Estimation in Regression with Grouped Variables. Journal of the Royal Statistical Society: Series B, 68, 49-67.
https://doi.org/10.1111/j.1467-9868.2005.00532.x
[6] Eren Ahsen, M. and Vidyasagar, M. (2017) Error Bounds for Compressed Sensing Algorithms with Group Sparsity: A Unified Approach. Applied and Computational Harmonic Analysis, 43, 212-232.
https://doi.org/10.1016/j.acha.2015.11.006
[7] Cai, T.T., Zhang, A.R. and Zhou, Y.C. (2022) Sparse Group Lasso: Optimal Sample Complex- ity, Convergence Rate, and Statistical Inference. IEEE Transactions on Information Theory, 68, 5975-6002.
https://doi.org/10.1109/TIT.2022.3175455
[8] Chatterjee, S. and Steinhaeuser, K. (2012) Sparse Group Lasso: Consistency and Climate Applications. In: Ghosh, J., Liu, H., Davidson, I., Domeniconi, C. and Kamath, C., Eds., Proceedings of the SIAM International Conference on Data Mining, SIAM, 47-58.
https://doi.org/10.1137/1.9781611972825.5
[9] Li, Y.M., Nan, B. and Zhu, J. (2015) Multivariate Sparse Group Lasso for the Multivariate Multiple Linear Regression with an Arbitrary Group Structure. Biometrics, 71, 354-363.
https://doi.org/10.1111/biom.12292
[10] Li, X., Sun, D.F. and Toh, K.C. (2018) On Efficiently Solving the Subproblems of a Level-set Method for Fused Lasso Problems. SIAM Journal on Optimization, 28, 1842-1862.
https://doi.org/10.1137/17M1136390
[11] Poignard, B. (2020) Asymptotic Theory of the Adaptive Sparse Group Lasso. Annals of the Institute of Statistical Mathematics, 72, 297-328.
https://doi.org/10.1007/s10463-018-0692-7
[12] Simon, N., Friedman, J. and Hastie, T. (2013) A Sparse-Group Lasso. Journal of Computa- tional and Graphical Statistics, 22, 231-245.
https://doi.org/10.1080/10618600.2012.681250
[13] Zhang, Y.J., Zhang, N. and Sun, D.F. (2020) An Efficient Hessian Based Algorithm for Solving Large-Scale Sparse Group Lasso Problems. Mathematical Programming, 17, 223-263.
https://doi.org/10.1007/s10107-018-1329-6
[14] Zhang, X. and Peng, D.T. (2022) Solving Constrained Nonsmooth Group Sparse Optimization via Group Capped- 1 Relaxation and Group Smoothing Proximal Gradient Algorithm. Computational Optimization and Applications, 83, 801-844.
https://doi.org/10.1007/s10589-022-00419-2
[15] Friedman, J., Hastie, T. and Tibshirani, R. (2010) A Note on the Group Lasso and a Sparse Group Lasso. https://arxiv.org/pdf/1001.0736
[16] Qin, Z., Scheinberg, K. and Goldfarb, D. (2013) Efficient Block-Coordinate Descent Algorithms for the Group Lasso. Mathematical Programming Computation, 5, 143-169.
https://doi.org/10.1007/s12532-013-0051-x
[17] Richt6rik, P. and Tak´aˇc, M. (2014) Iteration Complexity of Randomized Block-Coordinate Descent Methods for Minimizing a Composite Function. Mathematical Programming, 144, 1-38.
https://doi.org/10.1007/s10107-012-0614-z
[18] Argyriou, A., Micchelli, C.A. and Pontil, M. (2011) Efficient First Order Methods for Linear Composite Reularizers.
https://doi.org/10.48550/arXiv.1104.1436
[19] Cand´es, E.J. and Plan, Y. (2011) A Probabilistic and Ripless Theory of Compressed Sensing.IEEE Transactions on Information Theory, 57, 7235-7254.
https://doi.org/10.1109/TIT.2011.2161794
[20] Cand´es, E.J., Romberg, J. and Tao, T. (2004) Robust Uncertainty Principles: Exact Signal Reconstruction from Highly Incomplete Frequency Information. IEEE Transactions on Infor- mation Theory, 52, 489-509.
https://doi.org/10.1109/TIT.2005.862083
[21] Fan, J.Q. and Li, R.Z. (2001) Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties. Journal of the American Statistical Association, 96, 1348-1360.
https://doi.org/10.1198/016214501753382273
[22] Bauer, F. (2016) Proximal Newton-Type Methods for Large-Scale Non-Smooth Optimization.Master’s Thesis, Technische Universita¨t Mu¨nchen, Mu¨nchen.
[23] Peng, D.T. and Chen, X. (2020) Computation of Second-Order Directional Stationary Points for Group Sparse Optimization. Optimization Methods and Software, 35, 348-376.
https://doi.org/10.1080/10556788.2019.1684492
[24] Bian, W. and Chen, X.J. (2020) A Smoothing Proximal Gradient Algorithm for Nonsmooth Convex Regression with Cardinality Penalty. SIAM Journal on Numerical Analysis, 58, 858- 883.
https://doi.org/10.1137/18M1186009
[25] Micchelli, C.A., Shen, L. and Xu, Y. (2011) Proximity Algorithms for Image Models: Denois- ing. Inverse Problems, 27, Article 045009.
https://doi.org/10.1088/0266-5611/27/4/045009
[26] Beck, A. and Teboulle, M. (2009) Gradient-Based Algorithms with Applications to Signal-Recovery Problems. In: Palomar, D.P. and Eldar, Y.C., Eds., Convex Optimization in Signal Processing and Communications, Cambridge University Press, Cambridge, 42-88.
https://doi.org/10.1017/CBO9780511804458.003