非共现数据的二元化加权转化算法

摘要
图/表
参考文献
相关文章 (5)

全文: PDF (465 KB) HTML (0 KB)
输出: BibTeX | EndNote (RIS)

摘要面向范畴数据的序列化信息瓶颈算法(CD-sIB)假设数据各个属性特征对二元化转化的贡献均匀，从而影响转化效果。文中提出二元化加权转化方法来反映非共现数据的特征。该方法通过突出非共现数据的代表性属性，从抑制非代表性(冗余)属性，从而获取最佳共现表示。文中提出随机分布数据的适用性和计算方法的无监督性两个非共现加权原则，并基于加权粒度概念构造二元化加权转化算法。实验结果表明，文中算法的聚类精度优于其它算法。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	姬波
	叶阳东

关键词 ：非共现数据, 特征权重, 信息瓶颈, 面向范畴数据的序列化信息瓶颈(CD-sIB)算法, 二元化转化

Abstract：The assumption that all data features are equally important in the categorical data-sequential information bottleneck(CD-sIB) lowers the transformation quality. A weighting binary transformation method is proposed to reveal the feature of non co-occurrence data by highlighting the representative features and depressing the redundancy features. Meanwhile,two weighting rules,the applicability of stochastically distributed data and the non supervision of weighting schemes,are introduced. Then,the weighted categorical data-sequential information bottleneck(WCD-sIB) algorithm is presented based on the weighting granularity concept. The experimental results show that the weighting binary transformation method generates good co-occurrence data representation,and the WCD-sIB algorithm is superior to the other algorithms.

Key words： Non Co-occurrence Data Feature Weighting Information Bottleneck Categorical Data-Sequential Information Bottleneck(CD-sIB) Algorithm Binary Transformation

收稿日期: 2012-05-28

ZTFLH:

TP181

基金资助:国家自然科学基金资助项目(No.61170223)

作者简介: 姬波(通讯作者)，男，1973年生，副教授，博士研究生，主要研究方向为机器学习、模式识别.E-mail:iebji@zzu.edu.cn.叶阳东，男，1962年生，教授，博士生导师，主要研究方向为知识工程、机器学习、数据库.

引用本文:

姬波，叶阳东. 非共现数据的二元化加权转化算法[J]. 模式识别与人工智能, 2013, 26(6): 584-591. JI Bo,YE Yang-Dong. Weighting Binary Transformation Algorithm for Non Co-occurrence Data. , 2013, 26(6): 584-591.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2013/V26/I6/584