高维线性模型中参数组的显著性检验
Significance Tests for Parameter Group in High-Dimensional Linear Models
摘要: 高维线性模型中参数的显著性检验是统计学研究的热点。现有的统计方法往往局限于参数的稀疏性假设,但在实际中,该假设是很难检验且极容易被违反的。本文通过重构回归对参数组进行显著性检验,首先利用原假设的结构得到重构回归模型,并建立设计矩阵间的线性相关模型,进而计算两模型误差的相关性,以此将待测原假设转化为可检验的矩条件,并建立检验统计量,得到检验统计量的渐近分布。最后,通过模拟试验,观察该检验方法在不同的误差和设计矩阵设置下的第I类错误概率和势函数曲线。试验结果表明:对于参数组检验,矩方法能保证第I类错误概率控制在显著性水平附近,并且可以证明检验的势特征。
Abstract: The significance test of parameters in high-dimensional linear models is a hot topic in statistical re-search. Existing statistical methods are often limited to the assumption of parameter sparsity, but in practice, this assumption is difficult to test and easily violated. In this paper, the significance test of the parameter group is carried out through reconstruction regression. First, the reconstruction regression model is obtained by using the structure of the null hypothesis, and the linear correla-tion model between the design matrices is established, and then the correlation between the errors of the two models is calculated. The hypothesis is transformed into a testable moment condition, establish the test statistic, and obtain the asymptotic distribution of the test statistic. Finally, through simulation experiments, the probability of Type I error of the test method and power func-tion curve under different error and design matrix settings are observed. The experimental results show that for the parameter group test, the moment method can ensure that the probability of Type I error is close to the significance level, and can prove the power property of the test.
文章引用:史延美. 高维线性模型中参数组的显著性检验[J]. 应用数学进展, 2022, 11(10): 6924-6933. https://doi.org/10.12677/AAM.2022.1110734

参考文献

[1] Bühlmann, P. and Geer, S. (2015) High-Dimensional Inference in Misspecified Linear Models. Electronic Journal of Statistics, 9, 1449-1473. [Google Scholar] [CrossRef
[2] Tibshirani, R. (1996) Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society. Series B (Methodological), 58, 267-288. [Google Scholar] [CrossRef
[3] Candès, E. and Tao, T. (2007) Rejoinder: The Dantzig Selector: Statistical Estimation When p Is Much Larger than n. The Annals of Statistics, 35, 2392-2404. [Google Scholar] [CrossRef
[4] Belloni, A. (2011) Square-Root Lasso: Pivotal Recovery of Sparse Signals via Conic Programming. Biometrika, 98, 791-806. [Google Scholar] [CrossRef
[5] Nickl, R. and Geer, S. (2012) Confidence Sets in Sparse Regression. The Annals of Statistics, 41, 2852-2876. [Google Scholar] [CrossRef
[6] Cai, T.T. and Guo, Z. (2015) Confidence Intervals for High-Dimensional Linear Regression: Minimax Rates and Adaptivity. Statistics, 45, 615-646. [Google Scholar] [CrossRef
[7] Zhang, C.-H. and Zhang, S.S. (2014) Confidence Intervals for Low Di-mensional Parameters in High Dimensional Linear Models. Journal of the Royal Statistical Society, 76, 217-242.
[8] Javanmard, A. and Montanari, A. (2015) De-Biasing the Lasso: Optimal Sample Size for Gaussian De-signs. The Annals of Statistics, 46, 1997-2017. [Google Scholar] [CrossRef
[9] Sara, V., Bühlmann, P., Ritov, Y. and Dezeure, R. (2014) On Asymptotically Optimal Confidence Regions and Tests for High-Dimensional Models. Annals of Statistics, 42, 1166-1202. [Google Scholar] [CrossRef
[10] Stadler, N. and Mukherjee, S. (2017) Two-Sample Testing in High Dimensions. Journal of the Royal Statistical Society, Series B. Statistical Method-ology, 79, 225-246. [Google Scholar] [CrossRef
[11] Cai, T., Cai, T. and Guo, Z. (2019) Optimal Statistical Inference for Individualized Treatment Effects in High-Dimensional Models.
[12] Guo, Z., Rakshit, P., Herman, D.S. and Chen, J. (2020) Inference for the Case Probability in High-Dimensional Logistic Regression.
[13] Li, H.Z., et al. (2021) Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models. Journal of the American Statistical Association, 116, 984-998. [Google Scholar] [CrossRef] [PubMed]
[14] Zhang, Y. and Politis, D.N. (2020) Ridge Regression Revisited: Debiasing, Thresholding and Bootstrap.
[15] Zhang, J.T., Zhou, B. and Guo, J. (2022) Testing High-Dimensional Mean Vector with Applications. Statistical Papers, 63, 1105-1137. [Google Scholar] [CrossRef
[16] Zhu, Y. and Bradic, J. (2017) Significance Testing in Non-Sparse High-Dimensional Linear Models. Electronic Journal of Statistics, 12, 3312-3364. [Google Scholar] [CrossRef
[17] Zhu, Y. and Bradic, J. (2016) Linear Hypothesis Testing in Dense High-Dimensional Linear Models. Journal of the American Statal Association, 113, 1583-1600. [Google Scholar] [CrossRef