高维数据在Cox回归模型中的自变量选择
Independent Variable Selection of High-Dimensional Data in Cox Regression Model
摘要: 在高维数据分析中,LASSO维数约简方法占有很重要的位置。针对数据愈发繁杂,LASSO回归不再适应一些具有较高相关性的高维数据分析,由此产生了Elastic Net和其他相关的一些高维数据拓展分析方法。Elastic Net是在LASSO的思想方法基础上结合非凸罚函数和岭回归方法得到的,Adaptive Elastic Net等是在Elastic Net的思想方法上,通过数据特征的不同改进惩罚函数,缓和稀疏性和过拟合的问题。文章对Elastic Net、Adaptive Elastic Net、Weight Elastic Net进行了介绍并且通过实际例子做了简单的比较,最终得到了较好的维数约简方法。
Abstract: In high-dimensional data analysis, LASSO occupies a very important position. For data becoming more and more complicated, LASSO regression is not very suitable for some relevant high-dimen- sional data analysis. Some of the extended analysis methods are produced, such as adaptive Elastic Net and other high-dimensional data analysis methods. Elastic Net is obtained by combining non- convex penalty and ridge regression methods on the basis of LASSO’s method of thinking. Adaptive Elastic Net, etc., based on Elastic Net’s method of thinking, uses different data characteristics to improve the penalty function, and continuously corrects the sparsity and overfitting problem. The article mainly introduces Elastic Net, Adaptive Elastic Net, Weight Elastic Net and makes a simple comparison of several methods through practical examples.
文章引用:刘锋, 胡天英, 陈俊霖, 但晨. 高维数据在Cox回归模型中的自变量选择[J]. 统计学与应用, 2021, 10(2): 183-192. https://doi.org/10.12677/SA.2021.102018

参考文献

[1] Noah, S., Jerome, F., Trevor, H. and Rob, T. (2011) Regularization Paths for Cox’s Proportional Hazards Model via Coordinate Descent. Journal of Statistical Software, 39, 1-13.
[Google Scholar] [CrossRef] [PubMed]
[2] Zou, H. and Hastie, T. (2005) Regularization and Variable Selection via the Elastic Net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67, 301-320.
[Google Scholar] [CrossRef
[3] 宋瑞琪, 朱永忠, 王新军. 高维数据中变量选择研究[J]. 统计与决策, 2019, 35(2): 13-16.
[4] Zhang, H.H. and Lu, W.B. (2007) Adaptive Lasso for Cox’s Proportional Hazards Model. Biometrika, 94, 691-703.
[Google Scholar] [CrossRef
[5] Zou, H. (2006) The Adaptive Lasso and Its Oracle Properties. Journal of the American Statistical Association, 101, 1418-1429.
[Google Scholar] [CrossRef
[6] Huang, J., Ma, S.G. and Zhang, C.H. (2006) Adaptive LASSO for Sparse High-Dimensional Regression. Statistica Sinica, 18, 1603-1618.
[7] Simon, N., Friedman, J.H., Hastie, T. and Tibshirani, R. (2011) Regularization Paths for Cox’s Proportional Hazards Model via Coordinate Descent. Journal of Statistical Software, 39, 1-13.
[Google Scholar] [CrossRef] [PubMed]
[8] 董英, 黄品贤. Cox模型及预测列线图在R软件中的实现[J]. 数理医药学杂志, 2012, 25(6): 711-713.
[9] 韦新星, 李春红, 戴洪帅. Adaptive Elastic Net方法在Cox模型变量选择中的研究[J]. 西南大学学报(自然科学版), 2017, 39(9): 88-94.
[10] 闫丽娜, 覃婷, 王彤. LASSO方法在Cox回归模型中的应用[J]. 中国卫生统计, 2012, 29(1): 58-60+64.
[11] Fan, J.Q. and Li, R.Z. (2001) Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties. Journal of the American Statistical Association, 96, 1348-1360.
[Google Scholar] [CrossRef
[12] The R Project for Statistical Computing.
https://www.r-project.org/