Abstract:To improve the scalability and adaptability of traditional decision tree learning algorithm, a novel multiple subclassifier integration method of decision forest is proposed based on general information theory. It adopts down-top learning strategy and combines discretization with logical representation of decision tree naturally. The learning procedure does not require any human intervention. The number and structures of subtrees can be set automatically. Experimental results and instance analysis on UCI machine learning data sets prove the feasibility and effectiveness of the proposed method.
王利民,徐沛娟,李雄飞. 基于广义信息论的决策森林多重子模型集成方法*[J]. 模式识别与人工智能, 2009, 22(2): 325-329.
WANG Li-Min, XU Pei-Juan, LI Xiong-Fei. Multiple Subclassifier Integration Method of Decision Forest Based on General Information Theory. , 2009, 22(2): 325-329.
[1] Dietterich T G. Machine Learning Research: Four Current Direction. AI Magazine, 1997, 18(4): 97-136 [2] Li Aijun, Luo Siwei, Huang Hua. et al. Decision Tree Based Neural Network Design. Journal of Computer Research and Development, 2005, 42(8): 1312-1317 (in Chinese) (李爱军,罗四维,黄 华,等.基于决策树的神经网络.计算机研究与发展, 2005, 42(8), 1312-1317) [3] Schapire R E. The Strength of Week Learnability. Machine Learning, 1990, 5(2): 197-227 [4] Tin K H. The Random Subspace Method for Constructing Decision Forests. Pattern Analysis and Machine Intelligence, 2005, 20(8): 832-844 [5] Todorovski L, Dzeroski S. Combining Multiple Models with Meta Decision Trees // Proc of the 4th European Conference on Principles of Data Mining and Knowledge Discovery. Lyon, France, 2000: 54-64 [6] Wang Limin, Li Xiaolin. Combining Decision Tree and Nave Bayes for Classification. Knowledge-Based Systems, 2006, 19(7): 511-515 [7] Wang Limin, Yuan Senmiao. Induction of Hybrid Decision Tree Based on Post-Discretization Strategy. Progress in Natural Science, 2004, 14(6): 541-545 [8] Wang Xizhao, Yang Chenxiao. Merging-Branches Impact on Decision Tree Induction. Chinese Journal of Computers, 2007, 30(8): 1251-1258 (in Chinese) (王熙照,杨晨晓.分支合并对决策树归纳学习的影响.计算机学报, 2007, 30(8): 1251-1258) [9] Liang Daokei, Huang Guoxing, Jin Jian. A New Multivariate Decision Tree Algorithm. Computer Science, 2008, 35(1): 211-212 (in Chinese) (梁道雷,黄国兴,金 健.一种多变量决策树方法研究.计算机科学, 2008, 35(1): 211-212)