混合决策信息系统下基于加权k近邻的属性约简
Attribute Reduction Based on Weighted k-Nearest Neighborhood in Hybrid Decision Information System
摘要: 属性约简是粗糙集理论的一项重要应用,大多数的邻域粗糙集模型只关注邻域粒完全包含在某些决策类中的对象,而忽略邻域粒不包含在任何决策类中的边界对象的可分性。本文采用k近邻来描述混合信息系统对象的粒化,利用边界对象邻域粒中对象的决策信息以及在各自决策类中的分布情况构造对象的评价函数,并基于此定义近似正域。进而在混合型信息系统上提出基于加权k近邻的约简定义,给出属性的重要性度量,以及基于加权k近邻的属性约简方法。最后在3个混合数据集上对约简结果的分类精度进行比较。与传统邻域粗糙集的属性约简算法相比,该算法能保证约简后数据有较高的分类精度。
Abstract: Attribute reduction is an important application of rough set theory. Most neighborhood rough set models only focus on the objects whose neighborhood granules are completely contained in some decision classes, but ignore the divisibility of boundary objects whose neighborhood granules are not contained in any decision classes. In this paper, the k-nearest neighborhood is used to describe the granulation of hybrid information system objects. The evaluation function of objects is constructed by using the decision information of objects in the neighborhood of boundary objects and their distribution in their decision classes. Then, the definition of reduction based on weighted k-nearest neighborhood is proposed in hybrid information system, the importance measure of attribute and the attribute reduction method based on weighted k-nearest neighborhood are given. Finally, the classification accuracy of the reduction results is compared on 3 mixed data sets. Compared with the traditional neighborhood rough set attribute reduction algorithm, the proposed algorithm in this paper can ensure the high classification accuracy of the reduced data.
文章引用:马达. 混合决策信息系统下基于加权k近邻的属性约简[J]. 应用数学进展, 2025, 14(4): 310-322. https://doi.org/10.12677/aam.2025.144164

参考文献

[1] Qian, Y., Liang, J., Pedrycz, W. and Dang, C. (2010) Positive Approximation: An Accelerator for Attribute Reduction in Rough Set Theory. Artificial Intelligence, 174, 597-618. [Google Scholar] [CrossRef
[2] Lin, Y., Hu, Q., Liu, J., Li, J. and Wu, X. (2017) Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information. IEEE Transactions on Fuzzy Systems, 25, 1491-1507. [Google Scholar] [CrossRef
[3] Zhu, P., Zhu, W., Hu, Q., Zhang, C. and Zuo, W. (2017) Subspace Clustering Guided Unsupervised Feature Selection. Pattern Recognition, 66, 364-374. [Google Scholar] [CrossRef
[4] Yao, S., Xu, F., Zhao, P., et al. (2017) Feature Selection Algorithm Based on Neighborhood Valued Tolerance Relation Rough Set Model. Pattern Recognition and Artificial Intelligence, 30, 416-428.
[5] Pawlak, Z. (1982) Rough Sets. International Journal of Computer & Information Sciences, 11, 341-356. [Google Scholar] [CrossRef
[6] Guan, Y. and Wang, H. (2006) Set-Valued Information Systems. Information Sciences, 176, 2507-2525. [Google Scholar] [CrossRef
[7] Leung, Y., Fischer, M.M., Wu, W. and Mi, J. (2008) A Rough Set Approach for the Discovery of Classification Rules in Interval-Valued Information Systems. International Journal of Approximate Reasoning, 47, 233-246. [Google Scholar] [CrossRef
[8] Dubois, D. and Prade, H. (1990) Rough Fuzzy Sets and Fuzzy Rough Sets. International Journal of General Systems, 17, 191-209. [Google Scholar] [CrossRef
[9] Zeng, A., Li, T., Liu, D., Zhang, J. and Chen, H. (2015) A Fuzzy Rough Set Approach for Incremental Feature Selection on Hybrid Information Systems. Fuzzy Sets and Systems, 258, 39-60. [Google Scholar] [CrossRef
[10] HU, Q., YU, D. and XIE, Z. (2008) Neighborhood Classifiers. Expert Systems with Applications, 34, 866-876. [Google Scholar] [CrossRef
[11] Hu, Q., Yu, D., Liu, J. and Wu, C. (2008) Neighborhood Rough Set Based Heterogeneous Feature Subset Selection. Information Sciences, 178, 3577-3594. [Google Scholar] [CrossRef
[12] Fan, X., Zhao, W., Wang, C. and Huang, Y. (2018) Attribute Reduction Based on Max-Decision Neighborhood Rough Set Model. Knowledge-Based Systems, 151, 16-23. [Google Scholar] [CrossRef
[13] Chen, H., Li, T., Cai, Y., Luo, C. and Fujita, H. (2016) Parallel Attribute Reduction in Dominance-Based Neighborhood Rough Set. Information Sciences, 373, 351-368. [Google Scholar] [CrossRef
[14] Wang, C., Shao, M., He, Q., Qian, Y. and Qi, Y. (2016) Feature Subset Selection Based on Fuzzy Neighborhood Rough Sets. Knowledge-Based Systems, 111, 173-179. [Google Scholar] [CrossRef
[15] Shu, W., Qian, W. and Xie, Y. (2020) Incremental Feature Selection for Dynamic Hybrid Data Using Neighborhood Rough Set. Knowledge-Based Systems, 194, Article 105516. [Google Scholar] [CrossRef
[16] Hu, M., Tsang, E.C.C., Guo, Y., Chen, D. and Xu, W. (2021) A Novel Approach to Attribute Reduction Based on Weighted Neighborhood Rough Sets. Knowledge-Based Systems, 220, Article 106908. [Google Scholar] [CrossRef
[17] Sewwandi, M.A.N.D., Li, Y. and Zhang, J. (2024) Granule-Specific Feature Selection for Continuous Data Classification Using Neighborhood Rough Sets. Expert Systems with Applications, 238, Article 121765. [Google Scholar] [CrossRef
[18] Chen, H., Li, T., Fan, X. and Luo, C. (2019) Feature Selection for Imbalanced Data Based on Neighborhood Rough Sets. Information Sciences, 483, 1-20. [Google Scholar] [CrossRef
[19] Zhang, X., Mei, C., Chen, D. and Li, J. (2016) Feature Selection in Mixed Data: A Method Using a Novel Fuzzy Rough Set-Based Information Entropy. Pattern Recognition, 56, 1-15. [Google Scholar] [CrossRef
[20] Wang, C., Shi, Y., Fan, X. and Shao, M. (2019) Attribute Reduction Based on K-Nearest Neighborhood Rough Sets. International Journal of Approximate Reasoning, 106, 18-31. [Google Scholar] [CrossRef
[21] 张文修, 梁怡, 吴伟志. 信息系统与知识发现[M]. 北京: 科学出版社, 2003.
[22] Wang, N. and Zhao, E. (2024) A New Method for Feature Selection Based on Weighted k-Nearest Neighborhood Rough Set. Expert Systems with Applications, 238, Article 122324. [Google Scholar] [CrossRef