广义线性模型的平方根Lasso选择性推断
Selective Inference of Generalized Linear Models by the Square Root Lasso
摘要: 已经有很多人对线性模型的相关问题做出了大量的选择性推断工作。但是,其适用范围并不全面。实际上,我们会遇到很多非正态非连续的数据,并且随机误差项是异方差等方面的问题,故本文将一些选择性推断的工作推广到广义线性模型上。我们建议使用数据雕琢方法,对数据样本先拆分再进行推断。另外,很多的模型变量选择的工作,都是基于已知噪声水平的。但是,在很多实际情况下,误差方差是未知的,而且在高维数据中,对它的估计存在一定难度。本文中,我们使用平方根lasso,进行变量选择,证明出对于广义线性模型,调优参数的选择不受数据中噪声波动的影响。因此,平方根lasso比lasso调优参数选择更方便,应用范围更广泛。模拟结果表明,使用数据雕琢得到的参数的置信区间更小。
Abstract: In view of the fact that many people have done a lot of work on the selective inference of linear models, however, its scope of application is not comprehensive. In fact, we will encounter many problems such as non normal and non continuous data, and the random error term is heteroscedasticity. Therefore, this paper extends some selective inference work to generalized linear model. We suggest that we use the data carving to split the sample first and then infer. In addition, many of the work of model variable selection are based on the known noise level. However, in practical cases, the error variance is unknown, and in high-dimensional data, it is difficult to estimate. In this paper, we use the square root lasso to select variables, and prove that for the generalized linear model, the selection of tuning parameters is not affected by the noise fluctuations in the data. So the square root lasso is more convenient and widely used than the lasso, and the application is more extensive. Simulation shows that we get the shorter interval length by data carving.
文章引用:梁博, 石翔宇, 张齐. 广义线性模型的平方根Lasso选择性推断[J]. 应用数学进展, 2021, 10(5): 1668-1680. https://doi.org/10.12677/AAM.2021.105178

参考文献

[1] Cox, D.R. (1975) A Note on Data-Splitting for the Evaluation of Significance Levels. Narnia, 62, 441-444. [Google Scholar] [CrossRef
[2] Wasserman, L. and Roeder, K. (2009) High-Dimensional Variable Selection. The Annals of Statistics, 37, 2178-2201. [Google Scholar] [CrossRef
[3] Meinshausen, N., Meier, L. and Bühlmann, P. (2009) p-Values for High-Dimensional Regression. Journal of the American Association Statistical, 104, 1671-1681. [Google Scholar] [CrossRef
[4] Nelder, J.A. and Wedderburn, R. (1972) Generalized Linear Models. Journal of the Royal Statistical Society, 135, 370-384. [Google Scholar] [CrossRef
[5] Wedderburn, R. (1974) Quasi-Likelihood Functions, Generalized Linear Models, and the Gauss—Newton Method. Biometrika, 61, 439-447. [Google Scholar] [CrossRef
[6] Hocking, R.R. (1976) A Biometrics Invited Paper. The Analysis and Selection of Variables in Linear Regression. Biometrics, 32, 1-49. [Google Scholar] [CrossRef
[7] Guyon, I. and Elisseeff, A. (2003) An Introduction to Variable and Feature Selection. Journal of Machine Learning Research, 3, 1157-1182.
[8] Akaike, H. (1998) Information Theory and an Extension of the Maximum Likelihood Principle. In: Parzen, E., Tanabe, K. and Kitagawa, G., Eds., Selected Papers of Hirotugu Akaike, Springer, New York, 199-213. [Google Scholar] [CrossRef
[9] Schwarz, G.E. (1978) Estimating the Dimension of a Model. The Annals of Statistics, 6, 461-464. [Google Scholar] [CrossRef
[10] Tibshirani, R. (1996) Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society, Series B, 58, 267-288. [Google Scholar] [CrossRef
[11] Zou, H. (2006) The Adaptive Lasso and Its Oracle Properties. Journal of the American Statistical Association, 101, 1418-1429. [Google Scholar] [CrossRef
[12] Meinshausen, N. (2007) Relaxed Lasso. Computational Statistics & Data Analysis, 52, 374-393. [Google Scholar] [CrossRef
[13] Belloni, A., Chernozhukov, V. and Wang, L. (2011) Square-Root Lasso: Pivotal Recovery of Sparse Signals via Conic Programming. Biometrika, 98, 791-806.
[14] Lockhart, R., Taylor, J, Tibshirani, R.J. and Tibshirani, R. (2013) Rejoinder: “A Significance Test for the Lasso”. The Annals of Statistics, 42, 518-531. [Google Scholar] [CrossRef
[15] Lee, J.D and Taylor, J.E. (2014) Exact Post Model Selection Inference for Marginal Screening. In: Advances in Neural Information Processing Systems, 136-144.
https://proceedings.neurips.cc/paper/2014/file/a0a080f42e6f13b3a2df133f073095dd-Paper.pdf
[16] Lee, J.D., Sun, D.L., Sun, Y. and Taylor, J.E. (2016) Exact Post-Selection Inference, with Application to the Lasso. Annals of Statistics, 44, 907-927. [Google Scholar] [CrossRef
[17] Belloni, A., Chernozhukov, V. and Hansen, C. (2011) Inference for High-Dimensional Sparse Econometric Models. Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
https://arxiv.org/pdf/1201.0220v1.pdf
[18] Belloni, A., Chernozhukov, V., Fernández-Val, I. and Hansen, C. (2013) Program Evaluation and Causal Inference with High-Dimensional Data. Econometrica, 85, 233-298. [Google Scholar] [CrossRef
[19] Zhang, C.H. and Zhang, S.S. (2014) Confidence Intervals for Low Dimensional Parameters in High Dimensional Linear Models. Journal of the Royal Statistical Society, Series B: Statistical Methodology, 76, 217-242. [Google Scholar] [CrossRef
[20] Tian, X.Y., Loftus, J.R. and Taylor, J.E. (2018) Selective Inference with Unknown Variance via the Square-Root Lasso. Biometrika, 105, 755-768. [Google Scholar] [CrossRef
[21] Neath, A.A. (2006) Testing Statistical Hypotheses. Journal of the American Statistical Association, 101, 847-848. [Google Scholar] [CrossRef
[22] Scheffé, L.H. (1955) Completeness, Similar Regions, and Unbiased Estimation: Part II. Sankhyā: The Indian Journal of Statistics, 15, 219-236.
[23] Benjamini, Y. and Hochberg, Y. (1995) Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B: Methodological, 57, 289-300. [Google Scholar] [CrossRef
[24] Fithian, W., Sun, D. and Taylor, J. (2014) Optimal Inference after Model Selection. arXiv: 1410.2597.
[25] Shi, X.Y., Liang, B. and Zhang, Q. (2020) Post-Selection Inference of Generalized Linear Models Based on the Lasso and the Elastic Net. Communication in Statistics-Theory and Methods, No. 725, 1-18. [Google Scholar] [CrossRef