模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2020, Vol. 33 Issue (9): 811-819    DOI: 10.16451/j.cnki.issn1003-6059.202009005
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
Adaptive Undersampling Based on Density Peak Clustering
CUI Caixia1,2, CAO Fuyuan1,3 , LIANG Jiye1,3
1. School of Computer and Information Technology, Shanxi University, Taiyuan 030006
2. Computer Science and Technology Department, Taiyuan Normal University, Jinzhong 030619
3. Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education, Shanxi University, Taiyuan 030006

Download: PDF (724 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Undersampling based on K-means clustering is only suitable for hypersphere shape data, the impact of overlapping regions on classification is not taken into account, and the density of samples in the clusters is neglected. Therefore, an adaptive undersampling method based on density peak clustering is proposed. Firstly, the samples of the majority class in the overlapping region are identified by the nearest neighbor search algorithm and deleted. Secondly, a number of clusters of different shapes, sizes and densities are automatically obtained by improved density peaks clustering. Then, undersampling is performed according to the sampling weights calculated by the density of the samples in the subclusters, and bagging ensemble classification is conducted on the obtained balanced dataset. Experiments indicate that the performance of the proposed method is better on most datasets.
Key wordsImbalanced Data      Classification      Undersampling      Density Peak Clustering      Overlapping region     
Received: 15 June 2020     
ZTFLH: TP 391  
Fund:National Natural Science Foundation of China(No.61876103), Key Research and Development Project of Shanxi Province(No.201903D121162)
Corresponding Authors: Liang Jiye, Ph.D., professor. His research interests include artificial intelligence, granular computing, data mining and machine learning.   
About author:: CUI Caixia, Ph.D. candidate. Her research interests include data mining and machine learning.CAO Fuyuan, Ph.D., professor. His research interests include data mining and machine learning.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
CUI Caixia
CAO Fuyuan
LIANG Jiye
Cite this article:   
CUI Caixia,CAO Fuyuan,LIANG Jiye. Adaptive Undersampling Based on Density Peak Clustering[J]. , 2020, 33(9): 811-819.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202009005      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2020/V33/I9/811
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn