School of Mathematics and Computer Science,Fujian Normal University,Fuzhou 350007 Key Laboratory of Network Security and Cryptography,Fujian Normal University,Fuzhou 350007
Abstract:KNN Model is an improved version of the k-nearest neighbor method. However, KNN Model is a non-incremental learning method, which restricts it from some real applications. A KNN Model based incremental learning method is proposed by introducing level concept for created clusters. It constructs few clusters for new coming data with different levels assignment to adjust and optimize previous generated KNN Model. Experimental results show the effectiveness of the proposed method.
[1] Luo Changsheng, Duan Jianguo, Guo Li. Research on Incremental Learning of DragPush-Based Text Classification. Journal of Chinese Information Processing, 2008, 22(1): 37-44 (in Chinese) (罗长升,段建国,郭 莉.基于推拉策略的文本分类增量学习研究.中文信息学报, 2008, 22(1): 37-44) [2] Fu Changlong, Du Xuhui, Yao Quanzhu. An Incremental Rules Learning Algorithm Based on Probabilistic Rough Set Model. Computer Science, 2008, 35(5): 143-146 (in Chinese) (付长龙,杜旭辉,姚全珠.一种基于概率粗糙集模型的增量式规则学习算法.计算机科学, 2008, 35(5): 143-146) [3] Xiang Tao, Gong Shaogang. Incremental and Adaptive Abnormal Behavior Detection. Computer Vision and Image Understanding, 2008, 111(1): 59-73 [4] Xiao Jianpeng, Zhang Laishun, Ren Xing. Transductive Support Vector Machines Based on Incremental Learning. Journal of Computer Applications, 2008, 28(7): 1642-1644 (in Chinese) (肖建鹏,张来顺,任 星.基于增量学习的直推式支持向量机算法.计算机应用, 2008, 28(7): 1642-1644) [5] Liu Bo, Pan Jiuhui. Incremental Classification Method Based on Ensemble. Computer Engineering, 2008, 34(19):187-188,191 (in Chinese) (刘 波,潘久辉.基于Ensemble的增量分类方法.计算机工程, 2008, 34(19): 187-188,191) [6] Wang Xiujun, Shen Hong. Improved Growing Learning Vector Quantification for Text Classification. Chinese Journal of Computers, 2007, 30(8): 1277-1285 (in Chinese) (王修君,沈 鸿.一种基于增量学习型矢量量化的有效文本分类算法.计算机学报, 2007, 30(8): 1277-1285) [7] Guo Gongde, Wang Hui, Bell D, et al. KNN Model Based Approach in Classification // Proc of the OTM Confederated International Conferences on CoopIS, DOA and ODBASE. Catania, Italy,
2003: 986- 996 [8] Guo Gongde, Wang Hui, Bell D, et al. Using KNN Model for Automatic Text Categorization. Soft Computing: A Fusion of Foundations, 2006, 10(5): 423-430 [9] Ye Nong, Li Xiangyang. A Machine Learning Algorithm Based on Supervised Clustering and Classification // Proc of the 6th International Computer Science Conference on Active Media Technology. Hongkong, China, 2001: 327-334 [10] Bian H Y. Fuzzy-Rough Nearest Neighbor Classification: An Integrated Framework // Proc of the IASTED International Symposium on Artificial Intelligence and Applications. Malaga, Spain, 2002: 160-164 [11] Rosa J L A, Ebecken N F F. Data Mining for Data Classification Based on the KNN-Fuzzy Method Supported by Genetic Algorithm // Proc of the 5th International Conference on High Performance Computing for Computational Science. Porte, Portugal, 2003: 126-133 [12] Keller J M, Gray M R, Jr Givens J A. A Fuzzy K-Nearest Neighbor Algorithm. IEEE Trans on Systems, Man and Cybernetics, 1985, 15(4): 580-585 [13] Teng Yueyang, Tang Huanwen, Zhang Haixia. A New Algorithm to Incremental Learning with Support Vector Machine. Computer Engineering and Applications, 2004, 40(36): 77-80 (in Chinese) (滕月阳,唐焕文,张海霞.一种新的支持向量机增量学习算法.计算机工程与应用, 2004, 40(36): 77-80) [14] Tan Songbo, Cheng Xueqi, Ghanem M M, et al. A Novel Refinement Approach for Text Categorization // Proc of the 14th ACM International Conference on Information and Knowledge Management. Bremen, Germany, 2005: 469-476 [15] Guan S U, Zhu Fangming. An Incremental Approach to Genetic-Algorithms-Based Classification. IEEE Trans on Systems, Man and Cybernetics, 2005, 35(2): 227-239 [16] Sang Nong, Zhang Rong, Zhang Tianxu. Incremental Learning Algorithm of a Modified Minimum-Distance Classifier. Pattern Recognition and Artificial Intelligence, 2007, 20(3): 358-364 (in Chinese) (桑 农,张 荣,张天序.一类改进的最小距离分类器的增量学习算法.模式识别与人工智能, 2007, 20(3): 358-364) [17] UCI Repository of Machine Learning Databases [DB/OL]. [2009-04-03]. http://www.ics.uci.edu/~mlearn/MLRository.html [18] Witten I H, Frank G. Data Mining: Practical Machine Learning Tools with Java Implementations. San Francisco, USA: Morgan Kaufmann, 2000 [19] KDD Cup 1999 Data. Information and Computer Science [EB/OL]. [2006-10-09]. http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html.