高维数据降维中的PCA与CUR分解对比分析
PCA and CUR Decomposition Analysis in Dimensionality Reduction of High-Dimensional Data
摘要:
在大数据的时代,使高维数据降低维度有很多种方法。其中,在线性降维方法中最典型的方法是PCA和CUR分解方法,但是目前我们利用这两种方法对高维数据降维的研究成果还不够,因此,本文通过对这主成分分析和CUR分解方法的探讨和研究,分析了这两种降维方法的使用条件和实际效果,得出:与传统的主成分分析的矩阵分解方法相比较,在特征选择方面,CUR分解方法不仅具有很高的准确度,而且还具有很好的可解释性;在矩阵恢复方面,CUR矩阵分解方法具有很高的稳定性同时还具有很高的准确度,其准确度有时候能够达到90%以上,因此我认为CUR矩阵分解具有很好的应用价值,值得我们利用CUR矩阵分解法去对高维数据进行降维。
Abstract:
In the era of big data, there are many ways to reduce the dimension of high-dimensional data. Among them, the linear dimension reduction method is the most typical method in PCA and CUR decomposition method, but at present, using the two methods on the research achievements of high-dimensional data dimension reduction is not enough. Therefore, through the discussion and research of the principal component analysis and CUR decomposition method, this paper analyzes the use conditions and practical effects of the two methods. It is concluded that: with the traditional principal component analysis method of the matrix decomposition, in terms of feature selection, CUR decomposition method not only has high accuracy, but also has good interpretability. In terms of matrix recovery, CUR matrix decomposition method has high stability and accuracy, and its accuracy can sometimes reach more than 90%. Therefore, I think CUR matrix decomposition has good application value, and it is worth using CUR matrix decomposition method to reduce the dimension of high-dimensional data.
参考文献
|
[1]
|
Bobadilla, J., Ortega, F. and Hernando, A. (2013) Recommender Systems Survey. Knowledge-Based Systems, 46, 109-132. [Google Scholar] [CrossRef]
|
|
[2]
|
林海明. 对主成分分析法运用中十个问题的解析[J]. 统计与决策, 2007(16): 16-18.
|
|
[3]
|
雷恒鑫, 刘惊雷. 基于行列联合选择矩阵分解的偏好特征提取[J]. 模式识别与人工智能, 2017, 30(3): 279-288.
|
|
[4]
|
雷恒鑫, 刘惊雷. 利用CUR矩阵分解提高特征选择与矩阵恢复能力[J]. 计算机应用, 2017, 37(3): 640-646.
|
|
[5]
|
Kumar, R., Verma, B.K. and Rastogi, S.S. (2014) Social Popularity Based SVD++ Recommender System. International Journal of Computer Applications, 87, 33-37. [Google Scholar] [CrossRef]
|
|
[6]
|
张梦阳. 矩阵分解的常用方法[J]. 成才之路, 2012(36): 39-39.
|
|
[7]
|
Jia, Y., Zhang, C., Lu, Q., et al. (2014) Users’ Brands Preference Based on SVD++ in Recommender Systems. Proceedings of the 2014 IEEE Workshop on Advanced Research and Technology in Industry Applications, Ottawa, ON, 29-30 September 2014, 1175-1178.
|
|
[8]
|
王群英. 矩阵分解方法的探究[J]. 长春工业大学学报, 2011, 32(1): 95-101.
|
|
[9]
|
Xu, F., Gu, G., Kong, X., et al. (2016) Object Tracking Based on Two-Dimensional PCA. Optical Review, 23, 231-243. [Google Scholar] [CrossRef]
|
|
[10]
|
Drineas, P., Mahoney, M.W. and Muthukrishnan, S. (2008) Relative-Error CUR Matrix Decompositions. SIAM Journal on Matrix Analysis and Applications, 30, 844-881. [Google Scholar] [CrossRef]
|
|
[11]
|
李明. 基于矩阵分解理论学习的数据降维算法研究[D]: [硕士学位论文]. 大连: 辽宁师范大学, 2011.
|