序决策系统下近似约简的启发式算法
Heuristic Algorithm to Attribute Reduction for Lower Approximation Preservation in Ordered Decision Systems
DOI: 10.12677/CSA.2021.111013, PDF,  被引量    科研立项经费支持
作者: 孙祖文, 唐玉凯:烟台大学计算机与控制工程学院,山东 烟台
关键词: 属性约简启发式算法粗糙集序决策系统Attribute Reduction Heuristic Algorithm Rough Set Theory Ordered Decision Systems
摘要: 随着网络的进步,社会中产生了大量的高维数据,但很多统计方法难以直接应用到高维数据上。如何获得去噪简化且保存关键信息的低维度数据是一项急需解决的问题。粗糙集理论提供了一种数据降维的方法,被称为属性约简。属性约简的目标是保证原数据集的一种分类特征不变,获得其最小的属性子集。目前,基于不同的分类特征,已提出了许多不同的属性约简算法,如下近似保持、分布保持等。在序决策系统中,传统的下近似约简算法是基于差别矩阵的,计算复杂度高。为了解决这一问题,本文使用依赖度,设计了一个后向贪婪的启发式算法来计算下近似约简。实验使用6组UCI数据集。实验结果表明本文设计的算法可以得到正确的下近似约简,并在时间效率上优于传统的差别矩阵算法。
Abstract: With the development of network, a large number of high-dimensional data are generated in the society, but many statistical methods are difficult to be directly applied to high-dimensional data. How to obtain the denoising simplified data with low dimensions and key information is an urgent problem. Rough set theory provides a method for reducing data dimensions, called attribute reduction. The purpose of attribute reduction is to obtain a minimal subset of attributes without changing a classification property of the original data. Until now, different attribute reduction methods are proposed for different classification properties, such as lower approximation preservation and distribution preservation. In an ordered decision system, the traditional attribute reduction algorithm is based on discernibility matrices, and the computation complexity is high. To solve this problem, we design a backward greedy heuristic algorithm to compute a reduct for lower approximation preservation. The experiments are conducted in six UCI data sets. The experimental results show that the algorithm gets a correct reduct and is better than the traditional discernibility matrix algorithm in time efficiency.
文章引用:孙祖文, 唐玉凯. 序决策系统下近似约简的启发式算法[J]. 计算机科学与应用, 2021, 11(1): 113-120. https://doi.org/10.12677/CSA.2021.111013

参考文献

[1] Pawlak, Z. (1982) Rough Sets. International Journal of Computer & Information Sciences, 11, 341-356. [Google Scholar] [CrossRef
[2] Zhang, J.B., Li, T.R., Ruan, D. and Liu, D. (2012) Neighborhood Rough Sets for Dynamic Data Mining. International Journal of Intelligent Systems, 27, 317-342. [Google Scholar] [CrossRef
[3] Huang, H., Meng, F.Z., Zhou, S.H., Jiang, F. and Manogaran, G. (2019) Brain Image Segmentation Based on FCM Clustering Algorithm and Rough Set. IEEE Access, 7, 12386-12396. [Google Scholar] [CrossRef
[4] Sai, Y., Yao, Y.Y. and Zhong, N. (2001) Data Analysis and Mining in Ordered Information Tables. 2001 IEEE International Conference of Data Mining, San Jose, 29 November-2 December 2001, 497-504. [Google Scholar] [CrossRef
[5] Swiniarski, R.W. and Skowron, A. (2003) Rough Set Method in Feature Selection and Recognition. Pattern Recognition Letters, 24, 833-849. [Google Scholar] [CrossRef
[6] 唐玉凯, 张楠, 童向荣, 张小峰. 不完备决策系统下的多特定类广义决策约简[J]. 智能系统学报, 2019, 14(6): 1199-1208. http://dx.doi.org/10.11992/tis.201905059 [Google Scholar] [CrossRef
[7] 许鑫. 连续值信息系统的不确定性度量[J]. 计算机科学与应用, 2017, 7(4): 388-397. [Google Scholar] [CrossRef
[8] 苗夺谦, 胡桂荣. 知识约简的一种启发式算法[J]. 计算机研究与发展, 1999, 36(6): 681-684.
[9] Greco, S., Matarazzo, B. and Slowinski, R. (1998) A New Rough Set Approach to Multicriteria and Multiattribute Classification. 1998 International Conference on Rough Sets and Current Trends in Computing, Warsaw, 22-26 June 1998, 60-67. [Google Scholar] [CrossRef
[10] 袁修久, 何华灿. 优势关系下广义决策约简和上近似约简[J]. 计算机工程与应用, 2006, 42(5): 4-7. http://dx.chinadoi.cn/10.3321/j.issn:1002-8331.2006.05.002
[11] 徐伟华, 张晓燕, 张文修. 优势关系下不协调目标信息系统的下近似约简[J]. 计算机工程与应用, 2009, 45(16): 66-69, 76. http://dx.chinadoi.cn/10.3778/j.issn.1002-8331.2009.16.018
[12] Qian, Y.H., Dang C.Y., Liang, J.Y. and Tang, D.W. (2009) Set-Valued Ordered Information Systems. Information Sciences, 179, 2809-2832. [Google Scholar] [CrossRef
[13] Qian, Y.H., Liang, J.Y. and Dang C.Y. (2008) Interval Ordered In-formation Systems. Computers & Mathematics with Applications, 56, 1994-2009. [Google Scholar] [CrossRef
[14] Skowron, A. and Rauszer, C. (1992) The Discernibility Matri-ces and Functions in Information Systems. In: Słowiński, R., Ed., Intelligent Decision Support, Springer, Dordrecht, 331-362. [Google Scholar] [CrossRef
[15] Hu, Q.H., Yu, D.R. and Guo, M.Z. (2010) Fuzzy Preference Based Rough Sets. Information Sciences, 180, 2003-2022. [Google Scholar] [CrossRef
[16] Du, W.S. and Hu, B.Q. (2018) A Fast Heuristic Attribute Reduction Approach to Ordered Decision Systems. European Journal of Operational Research, 264, 440-452. [Google Scholar] [CrossRef
[17] 赵立威, 张楠, 张中喜. 基于特征粒的序决策系统快速约简研究[J]. 山西大学学报(自然科学版), 2020, 43(4): 897-905.
[18] Du, W.S. and Hu, B.Q. (2014) Approximate Distri-bution Reducts in Inconsistent Interval-Valued Information Systems. Information Sciences, 271, 93-114. [Google Scholar] [CrossRef