Abstract:A hybrid constrained semi supervised clustering algorithm(HCC) is proposed based on consistency algorithm. To get a better clustering result, both labeled data and pairwise constraints are considered in clustering to make use of two types of prior knowledge supplementary to each other. The theoretical derivation and the algorithm are presented in detail. Experimental results show that labeled data outperform pairwise constraints in promoting the quality of clustering. Additionally, for many indices, such as CRI, number of clusters and running time, HCC is better than comparative algorithms.