模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2016, Vol. 29 Issue (9): 780-789    DOI: 10.16451/j.cnki.issn1003-6059.201609002
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Unstable Cut-Points Based Sample Selection for Large Data Classification
WANG Xizhao1, XING Sheng2,3, ZHAO Shixin2,4
1.College of Mathematics and Information Science, Hebei University, Baoding 071002.2.School of Management, Hebei University, Baoding 071002.3.College of Computer Science and Engineering, Cangzhou Normal University, Cangzhou 061001.4.Department of Mathematics and Physics, Shijiazhuang Tiedao University, Shijiazhuang 050043

Download: PDF (475 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  When the traditional sample selection methods are used to compress the large data, the computational complexity and large time consumption are high. Aiming at this problem, a sample selection method based on unstable cuts for the compression of large data sets is proposed in this paper. The extreme value is obtained at the interval endpoint for convex function, and therefore the endpoint degree of a sample is measured by making the unstable cuts of all attributes according to the basic property. The samples with higher endpoint degree are selected,and the calculation of the distance between the samples is avoided. The efficiency of the computation is improved without affecting the classification accuracy. The experimental results show a significant effect of the proposed algorithm on the compression for the large data set with high imbalance ratio and strong ability of anti-noise.
Key wordsLarge Data Classification      Sample Selection      Unstable cut-points      Decision Tree     
Received: 03 May 2016     
ZTFLH: TP 181  
About author:: WANG Xizhao, born in 1963, Ph.D., professor. His research interests include machine learning and pattern recognition.XING Sheng(Corresponding author), born in 1982, Ph.D. candidate, lecturer.His research interests include machine learning.ZHAO Shixin, born in 1978, Ph.D. candidate, lecturer. Her research interests include machine learning.)
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
WANG Xizhao
XING Sheng
ZHAO Shixin
Cite this article:   
WANG Xizhao,XING Sheng,ZHAO Shixin. Unstable Cut-Points Based Sample Selection for Large Data Classification[J]. , 2016, 29(9): 780-789.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201609002      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2016/V29/I9/780
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn