基于成对差异性度量的选择性集成方法

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (506 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要有效地产生泛化能力强、差异大的个体学习器，是集成学习算法的关键。为了提高学习器的差异性和精度，文中提出一种基于成对差异性度量的选择性集成方法。同时研究一种改进方法，进一步提高方法的运算速度，且支持并行计算。最后通过使用BP神经网络作为基学习器，在UCI数据集上进行实验，并与Bagging、基于遗传算法的选择性集成(GASEN)算法进行比较。实验结果表明，该改进算法在性能上与GASEN算法相近的前提下，训练速度得到大幅提高。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	杨长盛
	陶亮
	曹振田
	汪世义

关键词 ：集成学习, 选择性集成, 差异性, 成对差异性度量, 并行计算

Abstract：Effective generating individual learners with strong generalization ability and great diversity is the key issue of ensemble learning. To improve diversity and accuracy of learners, Pairwise Diversity Measures based Selective Ensemble (PDMSEN) is proposed in this paper. Furthermore, an improved method is studied to advance the speed of the algorithm and support parallel computing. Finally, through applying BP neural networks as base learners, the experiment is carried out on selected UCI database and the improved algorithm is compared with Bagging and GASEN (Genetic Algorithm based Selected Ensemble) algorithms. Experimental results demonstrate that the learning speed of the proposed algorithm is superior to that of the GASEN algorithm with the same learning performance.

Key words： Ensemble Learning Selective Ensemble Diversity Pairwise Diversity Measures Parallel Computing

收稿日期: 2009-04-22

ZTFLH:

TP181

基金资助:国家自然科学基金(No.60572128)、安徽省人才发展基金(No.2005Z029)资助项目

作者简介: 杨长盛，男，1989年生，硕士研究生，主要研究方向为模式识别、机器学习、图像处理.E-mail:changsheng_100@yahoo.com.cn.陶亮，男，1963年生，教授，博士生导师，主要研究方向为数字信号与图像处理、模式识别.曹振田，男，1981年生，硕士研究生，主要研究方向为神经网络、机器学习.汪世义，男，1974年生，博士研究生，主要研究方向为模式识别、机器学习，网络安全.

引用本文:

杨长盛，陶亮，曹振田，汪世义. 基于成对差异性度量的选择性集成方法[J]. 模式识别与人工智能, 2010, 23(4): 565-571. YANG Chang-Sheng,TAO Liang,CAO Zhen-Tian,WANG Shi-Yi. Pairwise Diversity Measures Based Selective Ensemble Method. , 2010, 23(4): 565-571.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2010/V23/I4/565

[1] Krogh A, Vedelsby J. Neural Network Ensembles, Cross Validation, and Active Learning // Tesauro G, Touretzky D S, Leen T K, eds. Advances in Neural Information Processing Systems. Cambridge, USA: MIT Press, 1995, Ⅶ: 231-238
[2] Zhou Zhihua, Wu Jianxin, Tang Wei. Ensembling Neural Networks: Many Could Be Better Than All. Artificial Intelligence, 2002, 137(1/2): 239-263
[3] Castro P D, Coelho G P, Caetano M F, et al. Designing Ensembles of Fuzzy Classification Systems: An Immune-Inspired Approach // Proc of the 4th International Conference on Artificial Immune Systems. Banff, Canada, 2005: 469-482
[4] Li Kai, Huang Houkuan, Ye Xiuchen, et al. A Selective Approach to Neural Network Ensemble Based on Clustering Technology // Proc of the International Conference on Machine Learning and Cybernetics. Banff, Canada, 2004: 3229-3233
[5] Dietterich T G. Machine Learning Research: Four Current Directions. AI Magazine, 1997, 18(4): 97-136
[6] Schapire R E. The Strength of Weak Learnability. Machine Learning, 1990, 5(2): 197-227
[7] Breiman L. Bagging Predictors. Machine Learning, 1996, 24(2): 123-140
[8] Ho T K. The Random Subspace Method for Constructing Decision Forests. IEEE Trans on Pattern Analysis and Machine Intelligence, 1998, 20(8): 832-844
[9] Opitz D W. Feature Selection for Ensembles // Proc of the 16th National Conference on Artificial Intelligence. Orlando, USA, 1999: 379-384
[10] Huang Y S, Suen C Y. A Method of Combining Multiple Experts for the Recognition of Unconstrained Handwritten Numerals. IEEE Trans on Pattern Analysis and Machine Intelligence, 1995, 17(1): 90-94
[11] Jie Z, Qiang J, Nagy G. A Comparative Study of Local Matching Approach for Face Recognition. IEEE Trans on Image Processing, 2007, 16(10): 2617-2628
[12] Verlinde P, Chollet G. Comparing Decision Fusion Paradigms Using k-NN Based Classifiers, Decision Trees and Logistic Regression in A Multi-Modal Identity Verification Application // Proc of the 2nd International Conference on Audio and Video-Based Biometric Person Authentication. Washington, USA: 1999: 188-193
[13] Kittler J, Hatef M, Duin R P W, et al. On Combining Classifiers. IEEE Trans on Pattern Analysis and Machine Intelligence, 1998, 20(3): 226-239
[14] Kuncheva L I, Bezdek J, Duin R. Decision Templates for Multiple Classifier Fusion: An Experimental Comparison. Pattern Recognition, 2001, 34(2): 299-314
[15] Cho S B, Kim J H. Multiple Network Fusion Using Fuzzy Logic. IEEE Trans on Neural Networks, 1995, 6(2): 497-501
[16] Kuncheva L I, Whitaker C J. Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy. Machine Learning, 2003, 51(2): 181-207
[17] Freund Y, Schapire R E. A Decision-Theoretic Generalization of On-line Learning and an Application to Boosting // Proc of the 2nd European Conference on Computational Learning Theory. Barcelona, Spain, 1995: 23-37
[18] Yule G U. On the Association of Attributes in Statistics. Philosophy Trans of the Royal Society of London, 1900, 194: 257-319
[19] Sneath P H, Sokal R R. Numerical Taxonomy. London, UK: Freeman W H, Co, 1973
[20] Skalak D. The Sources of Increased Accuracy for Two Proposed Boosting Algorithms [EB/OL]. [2009-04-15]. https://eprints.kfupm.edu.sa/71328/1/71328.pdf
[21] Giacinto G, Roli F. Design of Effective Neural Network Ensembles for Image Classification Purposes. Image and Vision Computing, 2001, 19(9): 699-707
[22] Kuncheva L I. Combining Pattern Classifiers: Methods and Algorithms. Malden, USA: Wiley Interscience, 2004