模式识别与人工智能
Tuesday, Apr. 22, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2024, Vol. 37 Issue (8): 741-754    DOI: 10.16451/j.cnki.issn1003-6059.202408007
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Semi-supervised Online Classification Method for Multi-label Data Stream Based on Kernel Extreme Learning Machine
WANG Yuchen1,2, QIU Shiyuan1,2, LI Peipei1,2,3, HU Xuegang1,2,4
1. School of Computer Science and Information Engineering, He-fei University of Technology, Hefei 230601;
2. Key Laboratory of Knowledge Engineering with Big Data of Ministry of Education of China, Hefei University of Technology, Hefei 230009;
3. Institute of Health Big Data and Population Medicine, Institute of Health and Medicine, Hefei Comprehensive National Science Center, Hefei 230032;
4. Anhui Province Key Laboratory of Industry Safety and Emergency Technology, Hefei University of Technology, Hefei 230009

Download: PDF (853 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  In practical applications, a large amount of streaming data emerges, and it is characterized of high arrival speed, massive volume and dynamic variation. Moreover, the data streams often contain multiple labels but only a small amount of data in the streams is labeled, causing the problems of concept drift and label missing in the multi-label data. To solve these problems, a semi- supervised online classification method for multi-label data stream based on kernel extreme learning machine is proposed in this paper. Firstly, the data stream is divided into k blocks according to the sliding window to tackle the label missing problem in multi-label data stream. A feature similarity matrix and a label similarity matrix are constructed for each piece of data and they are added to the training of kernel extreme learning machine model. An incremental update mechanism is designed to construct a semi-supervised online kernel extreme learning machine to adapt to the characteristics of streaming data. Secondly, to address the issue of the concept drift problem in data stream, the timestamp mechanism is adopted for discarding update. The data size is preset in advance. When the data reaches the specified size, the oldest unlabeled data is discarded and new data is added for updating. Finally, experiments on 10 multi-label datasets demonstrate that the proposed method possesses strong adaptability to the problems of label missing and concept drift, while maintaining good classification performance.
Key wordsData Stream Classification      Semi-supervised Classification      Multi-label Classification      Kernel Extreme Learning Machine      Concept Drift     
Received: 15 June 2024     
ZTFLH: TP181  
Fund:National Natural Science Foundation of China(No.62376085,62076085,62120106008), Research Funds of Center for Big Data and Population Health of Institute of Health and Medicine of Hefei Comprehensive National Science Center(No.JKS2023003)
Corresponding Authors: LI Peipei, Ph.D., professor. Her research interests include data mining.   
About author:: WANG Yuchen, Master student. His research interests include semi-supervised multi-label data stream classification. QIU Shiyuan, Master. Her research inte-rests include semi-supervised multi-label data stream classification. HU Xuegang, Ph.D., professor. His research interests include data mining and know-ledge engineering.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
WANG Yuchen
QIU Shiyuan
LI Peipei
HU Xuegang
Cite this article:   
WANG Yuchen,QIU Shiyuan,LI Peipei等. Semi-supervised Online Classification Method for Multi-label Data Stream Based on Kernel Extreme Learning Machine[J]. Pattern Recognition and Artificial Intelligence, 2024, 37(8): 741-754.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202408007      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2024/V37/I8/741
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn