一种用于分类问题的属性加权方法
A Weighting Attribute Method for Classification Problems
摘要:
属性加权调整通常用于机器学习方法中以提高这些方法的性能。在本文中,我们提出了一种基于互信息的新型属性加权方法,并将该方法应用于两种经典的机器学习分类方法中。我们通过在威斯康星州乳腺癌数据集进行实验来研究加权方法的性能。我们的实验结果表明,针对分类任务,我们的加权机器学习方法往往优于相应的传统机器学习方法,从而证明了本文提出的加权方法的合理性和实用性。
Abstract:
Attribute weighting adjustments are used in machine learning models to improve performance. In this paper, we propose a novel attribute weighting method based on mutual information and apply this method to two classical machine learning models for classification. We study the performance of our weighting method by conducting experiments on the Wisconsin Breast Cancer database. In both machine learning models, our weighted attribute models tend to outperform the corresponding conventional machine learning models in classification which also approves that our weighting method is reasonable and applicable.
参考文献
|
[1]
|
李航. 统计学习方法[M]. 北京: 清华大学出版社, 2012: 122.
|
|
[2]
|
Karabatak, M. (2015) A New Classifier for Breast Cancer Detection Based on Naïve Bayesian. Measurement, 72, 32-36. [Google Scholar] [CrossRef]
|
|
[3]
|
Zaidi, N.A., Cerquides, J., Carman, M.J., et al. (2013) Alleviating Naive Bayes Attribute Independence Assumption by Attribute Weighting. Journal of Machine Learning Research, 14, 1947-1988.
|
|
[4]
|
Wu, J., Pan, S., Cai, Z., et al. (2014) Dual Instance and Attribute Weighting for Naive Bayes Classification. International Joint Conference on Neural Networks (IJCNN), Beijing, 1675-1679. [Google Scholar] [CrossRef]
|
|
[5]
|
Wettschereck, D., Aha, D.W. and Mohri, T. (1977) A Review and Empirical Evaluation of Feature Weighting Methods for a Class of Lazy Learning Algorithm. Artificial Intelligence Review, 11, 273-314. [Google Scholar] [CrossRef]
|
|
[6]
|
Gupta, M. (2012) Dynamic k-NN with Attribute Weighting for Automatic Web Page Classification (Dk-NNwAW). International Journal of Computer Applications, 58, 34-40. [Google Scholar] [CrossRef]
|
|
[7]
|
周志华. 机器学习[M]. 北京: 清华大学出版社, 2016: 121-128.
|
|
[8]
|
Mangasarian, O.L. and Wolberg, W.H. (1990) Cancer Diagnosis via Linear Programming. SIAM News, 23, 1-18.
|
|
[9]
|
Bagui, S.C., Bagui, S., Pal, K. and Pal, N.R. (2003) Breast Cancer Detection Using Rank Nearest Neighbor Classification Rules. Pattern Recognition, 36, 25-34. [Google Scholar] [CrossRef]
|