一种用于分类问题的属性加权方法
A Weighting Attribute Method for Classification Problems
DOI: 10.12677/AAM.2020.95089, PDF,    国家自然科学基金支持
作者: 王庭芳, 李东喜:太原理工大学数学学院,山西 晋中
关键词: 属性加权分类问题朴素贝叶斯k近邻方法Weighted Attribute Classification Problem Na?ve Bayes k-Nearest Neighbor
摘要: 属性加权调整通常用于机器学习方法中以提高这些方法的性能。在本文中,我们提出了一种基于互信息的新型属性加权方法,并将该方法应用于两种经典的机器学习分类方法中。我们通过在威斯康星州乳腺癌数据集进行实验来研究加权方法的性能。我们的实验结果表明,针对分类任务,我们的加权机器学习方法往往优于相应的传统机器学习方法,从而证明了本文提出的加权方法的合理性和实用性。
Abstract: Attribute weighting adjustments are used in machine learning models to improve performance. In this paper, we propose a novel attribute weighting method based on mutual information and apply this method to two classical machine learning models for classification. We study the performance of our weighting method by conducting experiments on the Wisconsin Breast Cancer database. In both machine learning models, our weighted attribute models tend to outperform the corresponding conventional machine learning models in classification which also approves that our weighting method is reasonable and applicable.
文章引用:王庭芳, 李东喜. 一种用于分类问题的属性加权方法[J]. 应用数学进展, 2020, 9(5): 752-758. https://doi.org/10.12677/AAM.2020.95089

参考文献

[1] 李航. 统计学习方法[M]. 北京: 清华大学出版社, 2012: 122.
[2] Karabatak, M. (2015) A New Classifier for Breast Cancer Detection Based on Naïve Bayesian. Measurement, 72, 32-36. [Google Scholar] [CrossRef
[3] Zaidi, N.A., Cerquides, J., Carman, M.J., et al. (2013) Alleviating Naive Bayes Attribute Independence Assumption by Attribute Weighting. Journal of Machine Learning Research, 14, 1947-1988.
[4] Wu, J., Pan, S., Cai, Z., et al. (2014) Dual Instance and Attribute Weighting for Naive Bayes Classification. International Joint Conference on Neural Networks (IJCNN), Beijing, 1675-1679. [Google Scholar] [CrossRef
[5] Wettschereck, D., Aha, D.W. and Mohri, T. (1977) A Review and Empirical Evaluation of Feature Weighting Methods for a Class of Lazy Learning Algorithm. Artificial Intelligence Review, 11, 273-314. [Google Scholar] [CrossRef
[6] Gupta, M. (2012) Dynamic k-NN with Attribute Weighting for Automatic Web Page Classification (Dk-NNwAW). International Journal of Computer Applications, 58, 34-40. [Google Scholar] [CrossRef
[7] 周志华. 机器学习[M]. 北京: 清华大学出版社, 2016: 121-128.
[8] Mangasarian, O.L. and Wolberg, W.H. (1990) Cancer Diagnosis via Linear Programming. SIAM News, 23, 1-18.
[9] Bagui, S.C., Bagui, S., Pal, K. and Pal, N.R. (2003) Breast Cancer Detection Using Rank Nearest Neighbor Classification Rules. Pattern Recognition, 36, 25-34. [Google Scholar] [CrossRef