模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2018, Vol. 31 Issue (7): 634-642    DOI: 10.16451/j.cnki.issn1003-6059.201807006
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Semi-supervised Short Text Stream Classification Based on Vector Representation and Label Propagation
WANG Haiyan1 , HU Xuegang1,2 , LI Peipei1,2
1.School of Computer and Information, Hefei University of Technology, Hefei 230601
2.Anhui Province Key Laboratory of Industry Safety and Emergency Technology, Hefei University of Technology, Hefei 230009

Download: PDF (0 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  The huge volume of short text streams produced by social Network is fast, high-volume and it contains concept drift, short length of texts and massive unlabeled data. Therefore, a semi-supervised short text stream classification algorithm based on vector representation and label propagation is proposed in this paper to classify short text stream with a few labeled data. Besides, to adapt to the concept drift, a concept drift detection algorithm based on clusters is proposed. Experimental results on real short text streams show that the proposed algorithm improves the classification accuracy and the macro average compared with classical semi-supervised classification algorithms and semi-supervised data stream classification algorithms, and it adapts to the concept drift quickly in data stream.
Key wordsShort Text Stream      Semi-supervised Classification      Label Propagation      Concept Drift     
Received: 11 April 2018     
ZTFLH: TP 391.1  
Fund:Supported by National Key Research and Development Program of China(No.2016YFC0801406), National Natural Science Foundation of China(No.61503112,61673152)
Corresponding Authors: LI Peipei(Corresponding author), Ph.D., associate professor. Her research interests include data stream mining and knowledge engineering.   
About author:: WANG Haiyan, master student. Her research interests include short text stream cla-ssification.HU Xuegang, Ph.D., professor. His re-search interests include data mining and know-ledge engineering.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
WANG Haiyan
HU Xuegang
LI Peipei
Cite this article:   
WANG Haiyan,HU Xuegang,LI Peipei. Semi-supervised Short Text Stream Classification Based on Vector Representation and Label Propagation[J]. , 2018, 31(7): 634-642.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201807006      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2018/V31/I7/634
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn