基于投影重采样的多元非线性降维
On a Projective Resampling Method for Nonlinear Dimension Reduction with Multivariate Responses
摘要: 本文研究响应变量和预测因子均为向量的充分降维问题。投影重采样方法的核心思想是将多元响应投影到随机采样的方向上,以获取标量响应的样本,并反复应用单变量响应的降维方法来解决问题。该方法已被证明对多元线性降维有效。本文将投影重采样方法推广到非线性情境,并通过核映射提出了四种新的估计方法。研究结果表明,新方法具有优良的性质,并能在温和条件下完整恢复降维空间。最后,通过数值模拟和真实数据集分析验证了所提方法的有效性和可行性。
Abstract: This paper addresses the problem of sufficient dimension reduction where both the response and predictor are vectors. The core idea of projective resampling method is to project the multivariate responses along randomly sampled directions to obtain samples of scalar-valued responses. A univariate-response dimension reduction method is then applied repeatedly for solving the problem. This has proven effective for multivariate linear dimension reduction. In this paper, we extend the projective resampling method to nonlinear scenarios and use the mapping induced by kernels to develop four novel estimation methods. The research results suggest that the new methods exhibit excellent properties and ensure full recovery of the dimension reduction space under mild conditions. Finally, we validate the effectiveness and feasibility of the proposed methods through numerical simulations and real data analysis.
文章引用:申亮亮, 张俊英. 基于投影重采样的多元非线性降维[J]. 统计学与应用, 2024, 13(5): 1887-1898. https://doi.org/10.12677/sa.2024.135183

参考文献

[1] Li, K. (1991) Sliced Inverse Regression for Dimension Reduction. Journal of the American Statistical Association, 86, 316-327. [Google Scholar] [CrossRef
[2] Cook, R.D. and Weisberg, S. (1991) Sliced Inverse Regression for Dimension Reduction: Comment. Journal of the American Statistical Association, 86, 328-332. [Google Scholar] [CrossRef
[3] Li, B., Zha, H. and Chiaromonte, F. (2005) Contour Regression: A General Approach to Dimension Reduction. The Annals of Statistics, 33, 1580-1616. [Google Scholar] [CrossRef
[4] Li, B. and Wang, S. (2007) On Directional Regression for Dimension Reduction. Journal of the American Statistical Association, 102, 997-1008. [Google Scholar] [CrossRef
[5] Setodji, C.M. and Cook, R.D. (2004) K-Means Inverse Regression. Technometrics, 46, 421-429. [Google Scholar] [CrossRef
[6] Saracco, J. (2005) Asymptotics for Pooled Marginal Slicing Estimator Based on SIRα Approach. Journal of Multivariate Analysis, 96, 117-135. [Google Scholar] [CrossRef
[7] Li, B., Wen, S. and Zhu, L. (2008) On a Projective Resampling Method for Dimension Reduction with Multivariate Responses. Journal of the American Statistical Association, 103, 1177-1186. [Google Scholar] [CrossRef
[8] Scholkopf, B. and Smola, A.J. (2018) Learning with Kernels: Support Vector Machines, Regularization, Optimization, and beyond. MIT Press.
[9] Wu, H. (2008) Kernel Sliced Inverse Regression with Applications to Classification. Journal of Computational and Graphical Statistics, 17, 590-610. [Google Scholar] [CrossRef
[10] Akaho, S. (2001) A kernel method for canonical correlation analysis. arXiv: cs/0609071.
[11] Bach, F.R. and Jordan, M.I. (2002) Kernel Independent Component Analysis. Journal of Machine Learning Research, 3, 1-48.
[12] Fukumizu, K., Bach, F.R. and Gretton, A. (2007) Statistical Consistency of Kernel Canonical Correlation Analysis. Journal of Machine Learning Research, 8, 361-383.
[13] Lee, K., Li, B. and Chiaromonte, F. (2013) A General Theory for Nonlinear Sufficient Dimension Reduction: Formulation and Estimation. The Annals of Statistics, 41, 221-249. [Google Scholar] [CrossRef
[14] Mercer, J. (1909) Functions of Positive and Negative Type and Their Connection with Theory of Integral Equations. Philosophical Transaction of the Royal Society A, 209, 415-446.
[15] König, H. (2013) Eigenvalue Distribution of Compact Operators. Vol. 16, Birkhäuser.
[16] Lee, K., Li, B. and Zhao, H. (2016) Variable Selection via Additive Conditional Independence. Journal of the Royal Statistical Society Series B: Statistical Methodology, 78, 1037-1055. [Google Scholar] [CrossRef
[17] Li, B. (2018) Sufficient Dimension Reduction: Methods and Applications with R. Chapman and Hall/CRC.
[18] Zhu, L., Miao, B. and Peng, H. (2006) On Sliced Inverse Regression with High-Dimensional Covariates. Journal of the American Statistical Association, 101, 630-643. [Google Scholar] [CrossRef
[19] Wang, Q. and Yin, X. (2008) A Nonlinear Multi-Dimensional Variable Selection Method for High Dimensional Data: Sparse Mave. Computational Statistics & Data Analysis, 52, 4512-4520. [Google Scholar] [CrossRef
[20] Li, L., Li, B. and Zhu, L. (2010) Groupwise Dimension Reduction. Journal of the American Statistical Association, 105, 1188-1201. [Google Scholar] [CrossRef
[21] Guo, Z., Li, L., Lu, W. and Li, B. (2015) Groupwise Dimension Reduction via Envelope Method. Journal of the American Statistical Association, 110, 1515-1527. [Google Scholar] [CrossRef] [PubMed]
[22] Fernholz, L.T. (2012) Von Mises Calculus for Statistical Functionals (Vol. 19). Springer Science & Business Media.