嵌入属性加权的实例加权朴素贝叶斯算法
Embedded-Attribute Weighted Instance-Weighted Naive Bayes
DOI: 10.12677/AAM.2023.125241, PDF,  被引量    国家自然科学基金支持
作者: 杨 柳, 胡桂开*, 彭 萍, 曾嘉琪:东华理工大学理学院,江西 南昌
关键词: 朴素贝叶斯实例加权属性加权Naive Bayes Case Weighting Attribute Weighting
摘要: 朴素贝叶斯是一类应用广泛的分类算法,它是根据贝叶斯定理和属性条件独立来实现的。然而,属性条件独立性假设在现实生活中难以满足,为减少该假设对朴素贝叶斯算法效果的影响,本文提出了一种将属性加权嵌入到实例加权过程中的朴素贝叶斯算法。首先,基于相关性属性加权算法计算各个属性的权重;其次,将实例众数与训练实例的相似度进行属性加权,并按照不同实例众数对加权后的相似度进行算术平均得到实例权重;然后,利用实例权重构建加权朴素贝叶斯分类器;最后,采用标准UCI数据集将我们提出的算法和朴素贝叶斯算法、实例加权朴素贝叶斯算法进行仿真实验,结果表明我们提出的算法在准确率以及F1值上优于其它两种算法。
Abstract: Naive Bayes is a widely used classification algorithm, which is independently implemented based on Bayesian theorem and attribute conditions. However, the assumption of attribute conditional independence is difficult to meet in real life. To reduce the impact of this assumption on the per-formance of naïve Bayesian algorithms, we propose a naive Bayes algorithm by embedding attrib-ute weighting into instance weighting process. Firstly, the weight of each attribute is calculated based on the correlation attribute weighting algorithm. Secondly, the similarity between the in-stance mode and the training instance is weighted by attribute, and the weighted similarity is arithmetically averaged according to the different mode instances to get the instance weight. Then, a weighted naive Bayes classifier is constructed using case weights. Finally, the standard UCI data set is used to simulate the proposed algorithm, naive Bayes algorithm and case weighted naive Bayes algorithm. The results show that the proposed algorithm is superior to the other two algo-rithms in accuracy and F1 value.
文章引用:杨柳, 胡桂开, 彭萍, 曾嘉琪. 嵌入属性加权的实例加权朴素贝叶斯算法[J]. 应用数学进展, 2023, 12(5): 2392-2401. https://doi.org/10.12677/AAM.2023.125241

参考文献

[1] Zhang, H., Jiang, L.X. and Yu, L.J. (2021) Attribute and Instance Weighted Naive Bayes. Pattern Recognition, 111, 107674-107684. [Google Scholar] [CrossRef
[2] Friedman, N., Geiger, D. and Goldszmidt, M. (1997) Bayesian Network Classifiers. Machine Learning, 29, 131-163. [Google Scholar] [CrossRef
[3] Webb, G.I., Boughton, J.R. and Wang, Z.H. (2005) Not So Naive Bayes: Aggregating One-Dependence Estimators. Machine Learning, 58, 5-24. [Google Scholar] [CrossRef
[4] Jiang, L.X., Zhang, H. and Cai, Z.H. (2008) A Novel Bayes Model: Hidden Naive Bayes. IEEE Transactions on Knowledge and Data Engineering, 21, 1361-1371. [Google Scholar] [CrossRef
[5] Wu, J., Pan, S.R., Zhu, X.Q., et al. (2016) Sode: Self-Adaptive One-Dependence Estimators for Classification. Pattern Recognition, 51, 358-377. [Google Scholar] [CrossRef
[6] Yu, L.J., Jiang, L.X., Wang, D.H., et al. (2017) Attribute Value Weighted Average of One-Dependence Estimators. Entropy, 19, 501-517. [Google Scholar] [CrossRef
[7] Harzevili, N.S. and Alizadeh, S.H. (2018) Mixture of Latent Multinomial Naive Bayes Classifier. Applied Soft Computing, 69, 516-527. [Google Scholar] [CrossRef
[8] El Hindi, K.M. (2014) Fine Tuning the Naïve Bayesian Learning Algorithm. AI Communications, 27, 133-141. [Google Scholar] [CrossRef
[9] El Hindi, K.M. (2018) Combining Instance Weighting and Fine Tuning for Training Naïve Bayesian Classifiers with Scan Training Data. The International Arab Journal of Information Tech-nology, 15, 1099-1106.
[10] Zhang, H. and Jiang, L.X. (2022) Fine Tuning Attribute Weighted Naive Bayes. Neuro-computing, 488, 402-411. [Google Scholar] [CrossRef
[11] 余良俊. 属性加权的贝叶斯网络分类算法及其应用研究[D]: [博士学位论文]. 武汉: 中国地质大学, 2018.
[12] Zhang, H. and Sheng, S.L. (2004) Learning Weighted Naive Bayes with Accurate Ranking. 4th IEEE International Conference on Data Mining (ICDM’04), Brighton, 1-4 November 2004, 567-570.
[13] Hall, M. (2007) A Decision Tree-Based Attribute Weighting Filter for Naive Bayes. In: Research and Development in Intelligent Systems XXIII: Proceedings of AI-2006, the 26th SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, Springer, London, 59-70. [Google Scholar] [CrossRef
[14] Lee, C.H., Gutierrez, F. and Dou, D. (2011) Calculating Feature Weights in Naive Bayes with Kullback-Leibler Measure. 2011 IEEE 11th International Conference on Data Mining, Vancouver, 11 December 2011, 1146-1151. [Google Scholar] [CrossRef
[15] Jiang, L.X., Zhang, L.G., Li, C.Q., et al. (2018) A Correlation-Based Feature Weighting Filter for Naive Bayes. IEEE Transactions on Knowledge and Data Engineering, 31, 201-213. [Google Scholar] [CrossRef
[16] Hall, M.A. (2000) Correlation-Based Feature Selection of Dis-crete and Numeric Class Machine Learning. 2000 Working Papers, 5: 00/08.
[17] Ratanamahatana, C. and Gunopulos, D. (2003) Feature Selection for the Naive Bayesian Classifier Using Decision Trees. Applied Artificial Intelligence, 17, 475-487. [Google Scholar] [CrossRef
[18] Jiang, L.X. and Zhang, H. (2006) Learning Naive Bayes for Probability Estimation by Feature Selection. Advances in Artificial Intelligence: 19th Conference of the Canadian Society for Computational Studies of Intelligence, Canadian AI 2006, Québec City, 7-9 June 2006, 503-514. [Google Scholar] [CrossRef
[19] Xie, Z.P., Hsu, W., Liu, Z.T., et al. (2002) Snnb: A Selective Neigh-borhood Based Naive Bayes for Lazy Learning. Advances in Knowledge Discovery and Data Mining: 6th Pacific-Asia Conference, PAKDD 2002, Taipei, 6-8 May 2002, 104-114. [Google Scholar] [CrossRef
[20] Frank, E., Hall, M. and Pfahringer, B. (2003) Locally Weighted Naive Bayes. The Conference on Uncertainty in Artificial Intel-ligence, Acapulco, 7-10 August 2003, 249-256.
[21] Jiang, L.X., Cai, Z.H. and Wang, D.H. (2010) Improving Naive Bayes for Classification. International Journal of Computers and Applications, 32, 328-332. [Google Scholar] [CrossRef
[22] Xu, W.Q., Jiang, L.X. and Yu, L.J. (2019) An Attribute Value Frequency-Based Instance Weighting Filter for Naive Bayes. Journal of Experimental & Theoretical Artificial In-telligence, 31, 225-236. [Google Scholar] [CrossRef