Abstract:It is deserved to foresee how many association rules will be mined from a given database without taking support and confidence in consideration. Hence the concept of expecting association rule is proposed in this paper to turn the above problem into how to calculate the base number of an expecting association rule set. The categorical and continuous computing formulas are presented respectively. The exclusive property of items in itemset is discussed after the transformation of continuous data into categorical data. An expanding matrix and an expanding method is deduced by the exclusive property. This method is used to calculate the base number of an expecting association rule set of continuous dataset in a brief way. The analysis and test results show that the size of an expecting association rule set decrease as the amount of exclusive items increase. These results are helpful to understand the essence of association rule mining and furtherly develop more highly efficient mining algorithm.
李凯里,王立宏,童向荣. 预期关联规则集及其基数的定量分析[J]. 模式识别与人工智能, 2010, 23(3): 402-407.
LI Kai-Li,WANG Li-Hong,TONG Xiang-Rong. Expecting Association Rule Set and Quantitative Analysis for Its Base Number. , 2010, 23(3): 402-407.
[1] Agrawal R, Imieliski T, Swami A. Mining Association Rule between Sets of Items in Large Database // Proc of the ACM SIGMOD International Conference on Management of Data. Washington, USA, 1993: 207-216 [2] Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules // Proc of the 20th International Conference on Very Large Databases. Santiago de Chile, USA, 1994: 478-499 [3] Pasquier N, Bastide Y, Taouil R, et al. Efficient Mining of Association Rules Using Closed Itemset Lattices. Information Systems, 1999, 24(1): 25-46 [4] Cheung D W, Ng N T, Fu A W, et al. Efficient Mining of Association Rules in Distributed Databases. IEEE Trans on Knowledge and Data Engineering, 1996, 8(6): 911-922 [5] Wile R. Restructuring Lattice Theory: An Approach Based on Hierarchies of Concepts // Rival I, ed. Ordered Sets. Dordrecht, Netherlands: Reidel, 1982: 445-470 [6] Mao Guojun, Liu Chunnian. Mining of Association Rules Based on the Operators of Set of Item Sequences. Chinese Journal of Computers, 2002, 25(4): 417-422 (in Chinese) (毛国君,刘椿年.基于项目序列集操作的关联规则挖掘算法.计算机学报, 2002, 25(4): 417-422) [7] Han Jiawei, Pei Jian, Yin Yiwen. Mining Frequent Patterns without Candidate Generation. ACM SIGMOD Record, 2000, 29(2): 1-12 [8] Yuan Yubo, Huang Tingzhu. A Matrix Algorithm for Mining Association Rules // Proc of the International Conference on Intelligence Computing. Hefei, China, 2005: 370-379 [9] Zhang Jin, Zhang Xiaogan. Data Mining Algorithm and Engineering Applications. Beijing, China: China Machine Press, 2007 (in Chinese) (章 兢,张小刚.数据挖掘算法及其工程应用.北京:机械工业出版社, 2007) [10] Cohen E, Datar M, Fujiwara S. Finding Interesting Associations without Support Pruning. IEEE Trans on Knowledge and Data Engineering, 2001, 13(1): 64-78 [11] Chen An, Chen Ning, Zhou Longxiang. Data Mining Technology and Applications. Beijing, China: Science Press, 2007 (in Chinese) (陈 安,陈 宁,周龙骧.数据挖掘技术及应用.北京:科学出版社, 2007) [12] Zhu Yuquan, Yang Hebiao, Sun Lei. Data Mining Technology. Nanjing, China: Southeast University Press, 2006 (in Chinese) (朱玉全,杨鹤标,孙 蕾.数据挖掘技术.南京:东南大学出版社, 2006)