Abstract:An aggregate constraint with items of varying values (ACV) is introduced. The ACV constraint is used to express users requirement on the aggregate feature of target patterns. An algorithm for mining frequent sequential patterns with the ACV constraint is proposed. It exploits the computational properties of ACV to effectively prune the search space. Experimental results on both the synthetic sequential data generated by IBM data generator and a real world data set show that the proposed algorithm utilizes ACV constraints to prune the useless candidate sequential patterns, and it reduces the redundant search space to improve the mining efficiency.
[1] Srikant R, Agrawal R. Mining Sequential Patterns: Generalizations and Performance Improvements // Proc of the 5th International Conference on Extending Database Technology. Avignon, France, 1996: 3-17 [2] Han Jiawei, Pei Jian, Mortazavi-Asl B, et al. Freespan: Frequent Pattern-Projected Sequential Pattern Mining // Proc of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Boston, USA, 2000: 355-359 [3] Pei Jian, Han Jiawei, Mortazavi-Asl B, et al. PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth // Proc of the 17th International Conference on Data Engineering. Heidelberg, Germany, 2001: 215-226 [4] Chiu D Y, Wu Y H, Chen A L P. An Efficient Algorithm for Mining Frequent Sequences by a New Strategy without Support Counting // Proc of the 20th International Conference on Data Engineering. Boston, USA, 2004: 375-386 [5] Bonchi F, Giannotti F, Mazzanti A, et al. ExAnte: Anticipated Data Reduction in Constrained Patterns Mining // Proc of the European Conference on Principles of Data Mining and Knowledge Discovery. Cavtat-Dubrovnik, Croatia, 2003: 59-70 [6] Garofalakis M, Rastogi R, Shim K. Mining Sequential Patterns with Regular Expression Constraints. IEEE Trans on Knowledge and Data Engineering, 2002, 14(3): 530-552 [7] Orlando S, Perego R, Silvestri C. A New Algorithm for Gap Constrained Sequence Mining // Proc of the ACM Symposium on Applied Computing. Nicosia, Cyprus, 2004: 540-547 [8] Zaki M J. Sequence Mining in Categorical Domains: Incorporating Constraints // Proc of the 9th International Conference on Information and Knowledge Management. McLean, USA, 2000: 422-429 [9] Pei Jian, Han Jiawei, Wang Wei. Mining Sequential Patterns with Constraints in Large Databases // Proc of the 11th International Conference on Information and Knowledge Management. McLean, USA, 2002: 18-25 [10] Srikant R, Agrawal R. Mining Sequential Patterns // Proc of the 11th International Conference on Data Engineering. Taipei, China, 1995: 3-14