高维数据变量选择中MCP正则化参数选择研究
MCP Regularization Parameter Selection in High Dimensional Data Variable Selection
摘要:
大数据时代,高维数据的变量选择是现代统计的研究热点问题之一。MCP正则化方法是常用的变量选取方法,但MCP正则化方法的优劣取决于能否选取出最优的正则化参数。本文在BIC准则的基础上,提出适用于MCP正则化参数选择的MBIC准则。通过数据模拟及实际应用表明,MCP方法在MBIC准则下能够以更高的概率选择正确的模型,MBIC准则明显优于其它参数选择方法。
Abstract:
In the era of big data, variable selection of high-dimensional data is one of the hot topics in modern statistics. The MCP regularization method is a commonly used variable selection method, but the merits of the MCP regularization method depend on whether the optimal regularization parameter can be selected. Based on the BIC criterion of regularization parameter selection, an MBIC criterion is proposed for MCP regularization parameter selection. Through data simulation and practical application, the MCP method with MBIC criterion can select the correct model with higher probability, which is obviously superior to other regularization parameter selection methods.
参考文献
|
[1]
|
Akaike, H. (1973) Information Theory and an Extension of the Maximum Likelihood Principle. 2nd Interernationl Symp. on Information Theory, 1973.
|
|
[2]
|
Schwarz, G. and Schwartz, G.J. (1978) Estimating Dimension of a Model. Annals of Statistics, 6, 461-464. [Google Scholar] [CrossRef]
|
|
[3]
|
曾津, 周建军. 高维数据变量选择方法综述[J]. 数理统计与管理, 2017, 36(4): 678-692.
|
|
[4]
|
Tibshirani, R.J. (1996) Regression Shrinkage and Selection via the LASSO. Journal of the Royal Statistical Society, Series B: Methodological, 73, 267-288. [Google Scholar] [CrossRef]
|
|
[5]
|
Fan, J. and Li, R. (2001) Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties. Journal of the American Statistical Association, 96, 1348-1360. [Google Scholar] [CrossRef]
|
|
[6]
|
Zhang, C.-H. (2010) Nearly Unbiased Variable Selection under Minimax Concave Penalty. The Annals of Statistics, 38, 894-942. [Google Scholar] [CrossRef]
|
|
[7]
|
Zou, H., Hastie, T. and Tibshirani, R. (2007) On the “degrees of freedom” of the Lasso. The Annals of Statistics, 35, 2173-2192. [Google Scholar] [CrossRef]
|
|
[8]
|
Wang, H., Li, R. and Tsai, C.L. (2007) Tuning Parameter Selectors for the Smoothly Clipped Absolute Deviation Method. Biometrika, 94, 553-568. [Google Scholar] [CrossRef] [PubMed]
|
|
[9]
|
周荣旺. SCAD方法的调整参数选择[D]: [硕士学位论文]. 大连: 大连理工大学, 2010.
|
|
[10]
|
Breheny, P. and Huang, J. (2011) Coordinate Descent Algorithms for Nonconvex Penalized Regression with Applications to Biological Feature Selection. Annals of Applied Statistics, 5, 232-253. [Google Scholar] [CrossRef]
|