高维成分数据的精度矩阵估计
Large Precision Matrix Estimation for Compositional Data
摘要: 高维成分数据在许多应用中均有出现,由于定和约束,统计方法通常不能产生合理的结果。高维协方差矩阵和精度(逆协方差)矩阵的估计是现代多元分析的基本问题,本文考虑高维成分数据的精度矩阵估计问题。已知样本协方差矩阵求逆对于估计精度矩阵是不稳定的,由于数据的样本量小于变量个数,高维数据矩阵的逆很难估计。本文利用中心对数比变换方法,处理高维成分数据,然后解决协方差矩阵奇异性问题,得到高维成分数据的精度矩阵估计。模拟实验和实际数据可以验证提出方法的合理性。
Abstract: High-dimensional compositional data arise in many applications, and statistical methods often fail to produce sensible results due to the unit-sum constraints. The estimation of high dimensional covariance matrix or precision (inverse covariance) matrix is the basic problem of modern multivariate analysis. In this paper, the precision matrix estimation problem for high-dimensional compositional data is considered. It is known that the inverse of the sample covariance matrix is unstable for the estimate precision matrix. Since the sample size of the data is smaller than the number of variables, the inverse of the high-dimensional data matrix is difficult to estimate. In this paper, we use the centered log-ratio transformation method to process high-dimensional compositional data, and then solve the singularity problem of covariance matrix, and obtain the precision matrix estimation of high-dimensional compositional data. Simulation experiments and actual data can verify the rationality of the proposed method.
文章引用:张轩轩, 何凤霞. 高维成分数据的精度矩阵估计[J]. 统计学与应用, 2019, 8(5): 777-783. https://doi.org/10.12677/SA.2019.85088

参考文献

[1] Ferrers, N.M. (1866) An Elementary Treatise on Trilinear Coordinates. Macmillan, London.
[2] Aitchison, J. (1968) The Statistical Analysis of Compositional Data. Chapman and Hall, London.
[3] Aitchison, J. (1994) A Concise Guide to Compositional Data Analysis. Institute of Mathematical Statistics Lecture Notes—Monograph Series, Vol. 24, 73-81. [Google Scholar] [CrossRef
[4] Aitchison, J. and Egozcue, J.J. (2005) Compositional Data Analysis: Where Are We and Where Should We Be Heading. Mathematical Geology, 37, 829-850. [Google Scholar] [CrossRef
[5] Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G., et al. (2003) Isometric Logratio Transformations for Compositional Data Analysis. Mathematical Geology, 35, 279-300. [Google Scholar] [CrossRef
[6] Wang, H., Liu, Q., Henry, M.K., et al. (2007) A Hyperspherical Transformation Forecasting Model for Compositional Data. European Journal of Operational Research, 179, 459-468. [Google Scholar] [CrossRef
[7] Bickel, P.J. and Levina, E. (2008) Covariance Regularization by Thresholding. Annals of Statistics, 36, 2577-2604. [Google Scholar] [CrossRef
[8] Rothman, A.J., Levina, E. and Zhu, J. (2009) Generalized Thresholding of Large Covariance Matrices. Journal of the American Statistical Association, 104, 177-186. [Google Scholar] [CrossRef
[9] Cai, T. and Liu, W. (2011) Adaptive Thresholding for Sparse Covariance Matrix Estimation. Journal of the American Statistical Association, 106, 672-684. [Google Scholar] [CrossRef
[10] Friedman, J., Hastie, T. and Tibshirani, R. (2008) Sparse Inverse Covariance Estimation with the Graphical Lasso. Biostatistics, 9, 432-441. [Google Scholar] [CrossRef] [PubMed]
[11] Cai, T., Liu, W. and Luo, X. (2011) A Constrained L1 Minimization Approach to Sparse Precision Matrix Estimation. Journal of the American Statistical Association, 106, 594-607. [Google Scholar] [CrossRef
[12] Liu, W. and Luo, X. (2015) Fast and Adaptive Sparse Precision Matrix Estimation in High Dimensions. Journal of Multivariate Analysis, 135, 153-162. [Google Scholar] [CrossRef] [PubMed]
[13] Fan, J., Liao, Y. and Liu, H. (2016) An Overview of the Estimation of Large Covariance and Precision Matrices. The Econometrics Journal, 19, C1-C32. [Google Scholar] [CrossRef
[14] Cao, Y., Lin, W. and Li, H. (2018) Large Covariance Estimation for Compositional Data via Composition-Adjusted Thresholding. Journal of the American Statistical Association, 114, 759-772.
[15] Lu, J.R., Shi, P.X. and Li, H.Z. (2018) Generalized Linear Models with Linear Constraints for Microbiome Compositional Data. Biometrics.