关联规则挖掘中闭频繁项集的理解与探索
Understanding and Exploration of Closed Frequent Itemsets in Association Rule Mining
DOI: 10.12677/AAM.2023.1211482, PDF,    科研立项经费支持
作者: 万 鑫*, 张慧娜*, 李裕梅, 王 鑫:北京工商大学数学与统计学院,北京
关键词: 关联规则挖掘频繁项集支持度闭包闭频繁项集Association Rule Mining Frequent Itemsets Support Closure Close Frequent Itemsets
摘要: 关联规则挖掘,通过数据挖掘事务之间的关联关系,被广泛应用到各个领域,主要是通过频繁项集产生关联规则,而频繁项集的挖掘又归结到闭频繁项集的挖掘,由此可见闭频繁项集在关联规则挖掘种的重要作用。本文以购物篮关联规则分析为场景,对闭频繁项集的理论进行了梳理,针对闭包算子定义中的函数i和t进行了单调性证明;对闭包算子定义满足的三条性质进行了证明;对频繁项集和其闭包的支持度进行了探讨;对封闭频繁项集和其超集之间的支持度进行了关系讨论。最后,按照频繁项集及其闭包所形成的等价类的性质进行了研究,给出了有关定理和证明,以及结论等。
Abstract: Association rule mining, is widely used in various fields by mining the association relationships between transactions through data mining, in which, association rules are mainly generated through frequent itemsets, and the mining of frequent itemsets is reduced to the mining of closed frequent itemsets, and this shows the important role of closed frequent itemsets in association rule mining. This article takes the analysis of shopping basket association rules as a scenario to sort out the theory of closed frequent itemsets, and proves the monotonicity of functions i and t in the defi-nition of closure operators; proves three properties that the closure operator definition satisfies; explores the support of frequent itemsets and their closures; discusses the relationship between the support of closed frequent itemsets and their supersets. Finally, the properties of the equiva-lence class formed by the frequent itemsets and their closure are studied, and relevant theorems, proofs, and conclusions are provided.
文章引用:万鑫, 张慧娜, 李裕梅, 王鑫. 关联规则挖掘中闭频繁项集的理解与探索[J]. 应用数学进展, 2023, 12(11): 4898-4905. https://doi.org/10.12677/AAM.2023.1211482

参考文献

[1] Agrawal, R. (1994) A Fast Algorithm for Mining Association Rules in Image. 2014 IEEE 5th International Conference on Software Engineering and Service Science, Beijing, 27-29 June 2014, 513-516. [Google Scholar] [CrossRef
[2] Wong, M.-H., Sze-To, H.-Y., Leung-Yau, L., et al. (2015) Discovering Binding Cores in Protein-DNA Binding Using Association Rule Mining with Statistical Measures. IEEE/ACM Transactions on Computational Biology & Bioinformatics, 12, 142-154. [Google Scholar] [CrossRef
[3] Xiang, Y., Shuai, C., Li, Y. and Zhang, Y. (2022) Information Reconstruction of Student Management Work Based on Association Rules Mining. Computational Intelligence and Neuroscience, 2022, Article ID: 2318515. [Google Scholar] [CrossRef] [PubMed]
[4] Shrestha, A., Zikos, D. and Fegaras, L. (2021) An Annotated Associa-tion Mining Approach for Extracting and Visualizing Interesting Clinical Events. International Journal of Medical In-formatics, 148, 104366. [Google Scholar] [CrossRef] [PubMed]
[5] Hu, K., Qiu, L., Zhang, S., et al. (2023) An Animal Dynamic Migration Optimization Method for Directional Association Rule Mining. Expert Systems with Applications, 211, 118617. [Google Scholar] [CrossRef
[6] Bezdek, J.C. (1974) Numerical Taxonomy with Fuzzy Sets. Journal of Mathematical Biology, 7, 57-71.
[7] Bezdek, J.C. (1974) Cluster Validity with Fuzzy Sets. Journal of Cy-bernetics, 3, 58-72. [Google Scholar] [CrossRef
[8] 李广璞, 黄妙华. 频繁项集挖掘的研究进展及主流方法[J]. 计算机科学, 2018, 45(11): 1-11, 26.
https://www.jsjkx.com/CN/Y2018/V45/I11A/1
[9] 谭军, 卜英勇, 杨勃. 一种高效的闭频繁模式挖掘算法[J]. 计算机工程与应用, 2010, 46(6): 130-132. [Google Scholar] [CrossRef
[10] [美]穆罕默德∙扎基, 小瓦格纳∙梅拉. 数据挖掘与分析-概念与算法[M]. 吴诚堃, 译. 北京: 人民邮电出版社, 2017.
[11] 毛伊敏, 陈志刚. 在线挖掘数据流闭频繁项集的高效算法[J]. 计算机科学, 2013, 40(2): 229-234. [Google Scholar] [CrossRef
[12] 柴玉梅, 张卓, 王黎明. 基于频繁概念直乘分布的全局闭频繁项集挖掘算法[J]. 计算机学报, 2012, 35(5): 990-1001. [Google Scholar] [CrossRef
[13] Aryabarzan, N. and Minaei-Bidgoli, B. (2021) NEclatClosed: A Vertical Algorithm for Mining Frequent Closed Itemsets. Expert Systems with Applications, 174, 114738. [Google Scholar] [CrossRef
[14] Li, B., Pei, Z., Qin, K., et al. (2019) TT-Miner: Topolo-gy-Transaction Miner for Mining Closed Itemset. Institute of Electrical and Electronics Engineers (IEEE), 7, 10798-10810. [Google Scholar] [CrossRef
[15] Wu, Y., Wang, M., Chu, W., et al. (2023) Or-ganization Preference Knowledge Acquisition of Multi-Platform Aircraft Mission System Utilizing Frequent Closed Itemset Mining. Aerospace, 10, Article 166. [Google Scholar] [CrossRef
[16] Liu, J.Q., Ye, Z.S., et al. (2021) Efficient Strategies for Incremen-tal Mining of Frequent Closed Itemsets over Data Streams. Expert Systems with Applications, 191, 116220. [Google Scholar] [CrossRef
[17] Pasquier, N., Bastide, Y., Taouil, R. and Lakhal, L. (1999) Effi-cient Mining of Association Rules Using Closed Itemset Lattices. Information Systems, 24, 25-46. [Google Scholar] [CrossRef
[18] Ganter, B. and Wille, R. (1999) Formal Concept Analysis: Mathematical Foundations. Springer-Verlag, Berlin. [Google Scholar] [CrossRef