多类类别不平衡学习算法:EasyEnsemble.M<sup>*</sup>

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (378 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要随机欠采样方法忽略潜在有用的大类样本信息，在面对多类分类问题时更为突出。文中提出多类类别不平衡学习算法:EasyEnsemble.M。该算法通过多次针对大类样本随机采样，充分利用被随机欠采样方法忽略的潜在有用的大类样本，学习多个子分类器，利用混合的集成技术最终得到性能较优的强分类器。实验结果表明，与常用的多类类别不平衡学习算法相比，EasyEnsemble.M可有效提高分类器的G-mean值。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	李倩倩
	刘胥影

关键词 ：机器学习, 类别不平衡学习, 欠采样, 集成

Abstract：The potential useful information in the majority class is ignored by stochastic under-sampling. When under-sampling is applied to multi-class imbalance problem, this situation becomes even worse. In this paper, EasyEnsemble.M for multi-class imbalance problem is proposed. The potential useful information contained in the majority classes which is ignored is explored by stochastic sampling the majority classes for multiple times. Then, sub-classifiers are learned and a strong classifier is obtained by using hybrid ensemble techniques. Experimental results show that EasyEnsemble.M is superior to other frequently used multi-class imbalance learning methods when G-mean is used as performance measure.

Key words： Machine Learning Class-Imbalance Learning Under-Sampling Ensemble

收稿日期: 2013-05-13

ZTFLH:

TP 391

基金资助:国家自然科学基金青年基金项目(No.61105046)、教育部高等学校博士学科点专项科研基金项目(No.20110092120029)、南京大学软件新技术国家重点实验室开放课题项目(No.KFKT2011B01)资助

作者简介: 李倩倩，女，1989年生，硕士研究生，主要研究方向为机器学习、数据挖掘.E-mail:liqianqianseu@gmail.com.刘胥影(通讯作者)，女，1981年生，博士，讲师，主要研究方向为机器学习、数据挖掘.E-mail:liuxy@seu.edu.cn.

引用本文:

李倩倩，刘胥影. 多类类别不平衡学习算法:EasyEnsemble.M^*[J]. 模式识别与人工智能, 2014, 27(2): 187-192. LI Qian-Qian, LIU Xu-Ying. EasyEnsemble.M for Multiclass Imbalance Problem. , 2014, 27(2): 187-192.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2014/V27/I2/187