基于信息熵的兴趣度规则挖掘算法<sup>*</sup>

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (595 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要传统关联规则挖掘方法通常产生海量杂乱的规则,它们对用户而言是冗余的.为解决该问题,文中提出一种基于信息熵的兴趣度规则挖掘算法.通过变量相关性分析剔除原始规则集中虚假、错误的规则,并在信息熵的基础上提出度量关联规则兴趣度的框架.该算法不依赖用户先验知识,能无偏地表达数据包含的信息.在真实和仿真数据集上的实验验证该算法能有效挖掘兴趣度规则,且性能比传统算法更优.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	金洲
	王儒敬

关键词 ：知识发现, 关联规则, 兴趣度度量, 信息熵

Abstract：With the development of data collection and storage techniques, excessive and unorderly rules are generated by traditional association rule mining, which can not meet interest of users. To solve this problem, an interestingness measure of association rules based on information entropy is proposed to mine interestingness association rules. Correlation analysis for categorical variables is adopted to eliminate false and erroneous rules from the primitive set, and a framework for evaluating the interestingness degree of rules based on information entropy is proposed. Since the method does not depend on the prior knowledge of users, it can represent the information hidden in the data accurately. Simulation results on both real and synthetic datasets show that the proposed algorithm performs better than the traditional algorithms, and it discovers interestingness rules from large database efficiently.

Key words： Knowledge Discovery Association Rule Interestingness Measure Information Entropy

收稿日期: 2012-11-29

ZTFLH:

TP 311

基金资助:国家自然科学基金项目(No.31171456)资助

作者简介: 金洲(通讯作者)，男，1985年生，博士研究生，主要研究方向为数据挖掘、知识发现.E-mail:manjinzhou@gmail.com.王儒敬，男，1964年生，研究员，博士生导师，主要研究方向为知识表示与可视化、知识获取、主从推理与决策融合、复杂自适应系统.

引用本文:

金洲，王儒敬. 基于信息熵的兴趣度规则挖掘算法^*[J]. 模式识别与人工智能, 2014, 27(6): 524-532. JIN Zhou, WANG Ru-Jing. Interestingness Rule Mining Algorithm Based on Information Entropy. , 2014, 27(6): 524-532.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2014/V27/I6/524