模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2017, Vol. 30 Issue (1): 1-10    DOI: 10.16451/j.cnki.issn1003-6059.201701001
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Clustering Assumption Based Classification Algorithm for Stream Data
LI Nan
College of Computer and Information Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002

Download: PDF (530 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Labeling all the instances is unpractical due to the high cost of acquiring labeled data in a real streaming environment. However, labeling part of the instances leads to model instability. Aiming at these problem, a clustering assumption based classification algorithm for stream data(CASD) is proposed. It is assumed that the instances divided into the same cluster may come from the same class. Based on the clustering assumption, the clustering result is utilized to fit the distribution of each class. The instances difficult to be classified or from concept drift class are selected to update the current model. Maintaining several base learners for each class and dynamical updating them is another innovation of the proposed algorithm. When instances from a specific class disappear or reappear, the corresponding base learners are frozen or activated instead of relearning the prior knowledge. Experimental results show that with a few labeled instances, the accuracy of CASD is comparable to that of state-of-the-art algorithms and the model can adapt to concept drift rapidly.
Key wordsConcept Drift      Stream Data      Classification      Clustering     
Received: 30 May 2016     
ZTFLH: TP 311  
About author:: LI Nan, born in 1987, master, assistant professor. His research interests include pattern recognition and artificial intelligence.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
LI Nan
Cite this article:   
LI Nan. Clustering Assumption Based Classification Algorithm for Stream Data[J]. , 2017, 30(1): 1-10.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201701001      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2017/V30/I1/1
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn