模式识别与人工智能
Thursday, Apr. 3, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2009, Vol. 22 Issue (5): 709-717    DOI:
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Documents Sampling Based on Feature Selection and Condensing Techniques
HAO Xiu-Lan1,2, TAO Xiao-Peng1, WANG Shu-Yun1, XU He-Xiang1,3, HU Yun-Fa1
1.School of Computer Science and Technology, Fudan University, Shanghai 200433
2.School of Information Engineering, Huzhou Teachers College, Huzhou 313000
3.Shanghai Tele-Education Group, Shanghai 200092

Download: PDF (612 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  As an instance based classifier, kNN has many computational and store requirements. Meanwhile, the poor performance of kNN classifier is caused by the imbalance distribution of training data. Aiming at these defects of kNN classifier, a technique, combining feature selection and condensing, is proposed to reduce the time cost and the space of kNN classifier. The proposed algorithm is divided into two steps. Firstly, several traditional methods of feature selection are combined to form features for each class. Then, redundant cases are removed by combination of class features contained in samples with Condensing algorithm. Experimental results indicate when the sample set acquired by the proposed method is used as training set, the classifier saves the time cost and the space dramatically, and the performance of the kNN classifier is improved because noisy data are removed from the training set.
Key wordsText Categorization      k-Nearest Neighbor (kNN)      Sampling      Feature Selection      Condensing Algorithm     
Received: 18 December 2007     
ZTFLH: TP391  
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
HAO Xiu-Lan
TAO Xiao-Peng
WANG Shu-Yun
XU He-Xiang
HU Yun-Fa
Cite this article:   
HAO Xiu-Lan,TAO Xiao-Peng,WANG Shu-Yun等. Documents Sampling Based on Feature Selection and Condensing Techniques[J]. , 2009, 22(5): 709-717.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2009/V22/I5/709
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn