基于非参数方法的分类模型交叉验证结果比较

doi:10.12677/CSA.2016.63017

期刊菜单

基于非参数方法的分类模型交叉验证结果比较
Comparison of Cross Validation Results of Classification Model Based on Nonparametric Method

DOI: 10.12677/CSA.2016.63017, PDF, HTML, XML, 下载: 1,961 浏览: 10,613
作者: 徐奇钊：云南财经大学，云南昆明
关键词: 交叉验证；模型比较；非参数；假设检验；Cross Validation； Model Comparison； Nonparametric； Hypothesis Test

摘要: 本文主要研究了基于非参数方法的分类模型交叉验证结果比较，主要是对实例通过非参数的方法进行模型比较的假设检验，检验两分类模型是否存在显著差异。模型的真实泛化误差是一个较为科学的模型比较标准，对于分类模型而言，模型的真实泛化误差表现为分类模型的误判率，而基于交叉验证得到的结果是模型误判率的一个优良估计，可以通过交叉验证结果对模型进行比较。交叉验证结果是随机变量，存在分布，而对于此随机变量而言，其分布是很难观测的，因此，对于交叉验证结果的比较，本文通过非参数的方法进行模型比较的假设检验，检验两分类模型是否存在显著差异。

Abstract: The true generalization error is a scientific evaluation criterion to model selection. For the classi-fication model, the rate of miscarriage of justice, which is an excellent estimation, is based on cross validation to the true generalization error. So we compare models through the cross validation results. Cross validation results are random variables, which have its distribution. For the random variable, its distribution is very hard to detect. Therefore, based on the comparison of cross validation results, this paper designs a hypothesis testing through the nonparametric method to inspect whether a significant difference exists between two classification models.

文章引用：徐奇钊. 基于非参数方法的分类模型交叉验证结果比较[J]. 计算机科学与应用, 2016, 6(3): 132-136. http://dx.doi.org/10.12677/CSA.2016.63017

参考文献

[1]	Wasseman, L. (2000) Bayesian Model Selection and Model Averaging. Journal of Mathematical Psychology, 44, 92- 107. http://dx.doi.org/10.1006/jmps.1999.1278
[2]	吴喜之. 复杂数据统计方法[M]. 北京: 中国人民大学出版社, 2012.
[3]	高红. 基于交叉验证的错误率估计分析[J]. 科技信息, 2011(25): I0149.
[4]	Fushiki, T. (2011) Estima-tion of Prediction Error by Using K-Fold Cross-Validation. Statistics & Computing, 21, 137- 146. http://dx.doi.org/10.1007/s11222-009-9153-8
[5]	Conover, W.J. (2012) Practical Nonparametric Statistics. Techno-metrics, 14, 977-979.
[6]	吴喜之. 非参数统计[M]. 北京: 中国统计出版社, 2013.

为你推荐

友情链接