模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2016, Vol. 29 Issue (1): 82-89    DOI: 10.16451/j.cnki.issn1003-6059.201601010
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Clustering Algorithm for Mixed Data Based on Dimensional Frequency Dissimilarity and Strongly Connected Fusion
QIAN Chaokai, HUANG Decai
College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310014

Download: PDF (484 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  The clustering result of k-Prototypes algorithm is unpredictable due to the sensitivity of the initial prototypes selection. Moreover, the whole diversity between data points and clusters is ignored. Therefore, a clustering algorithm based on dimensional frequency dissimilarity and strongly connected fusion is proposed. Plenty of sub-clusters are produced by multiple pre-clustering. According to the connectivity of those sub-clusters, strongly connected fusion is used to generate the final clusters. The proposed clustering algorithm is validated on three different UCI datasets. Meanwhile, it is compared with three mixed data clustering algorithms. The experimental results show that the proposed algorithm can yield better clustering precision and purity.
Key wordsDimensional Frequency Dissimilarity      Mixed Attribute      Clustering      Strongly Connected Fusion     
Received: 18 August 2014     
ZTFLH: TP 391.4  
Fund:Supported by Special Fund for Scientific Research in the Public Welfare of Ministry of Water Resources of China (No.201401044)
About author:: QIAN Chaokai, born in 1990, Master student. His research interests include data mining.HUANG Decai (Corresponding author), born in 1958, Ph.D., Professor. His research interests include data mining and artificial intelligence.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
QIAN Chaokai
HUANG Decai
Cite this article:   
QIAN Chaokai,HUANG Decai. Clustering Algorithm for Mixed Data Based on Dimensional Frequency Dissimilarity and Strongly Connected Fusion[J]. , 2016, 29(1): 82-89.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.201601010      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2016/V29/I1/82
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn