模式识别与人工智能
Thursday, Apr. 10, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2014, Vol. 27 Issue (7): 663-672    DOI:
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
A Semi-Structured Tibetan Text Clustering Algorithm Based on Swarm Intelligence
KANG Jian1, QIAO Shao-Jie1, GESANG Duoji2, HAN Nan3, HONG Xi-Jin1, NIMA Zhaxi2, FAN Xiao-Gang1
1School of Information Science and Technology, Southwest Jiaotong University, Chengdu 610031
2College of Engineering, Tibet University, Lhasa 850000
3School of Life Science and Engineering, Southwest Jiaotong University, Chengdu 610031

Download: PDF (1001 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  

To apply swarm intelligence techniques to cluster semi-structured Tibetan Web texts, a semi-structured Tibetan text clustering algorithm based on swarm Intelligence (SCAST) is proposed. Taking into a full consideration of accuracy and efficiency of Tibetan text clustering, a vector space model is used to express Tibetan texts, and the Tibetan texts and intelligent ants are randomly put in a two dimensional text vector space. Then, intelligent ants randomly select a Tibetan text, calculate the similarity between this text and others in the local area,and compute the probability of pick-up operation or drop-down operation to determine whether to pick up, move, or drop down the text. Finally, Tibetan texts are accurately clustered according to their similarities by iterative training of the proposed algorithm. The experimental results on real Tibetan Web text datasets show that the proposed algorithm is more accurate than the traditional k-means clustering algorithm with average increase of 8.0%.

Key wordsSwarm Intelligence      Tibetan Text      Clustering Analysis      Swarm Similarity     
Received: 26 June 2013     
ZTFLH: TP311  
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
KANG Jian
QIAO Shao-Jie
GESANG Duoji
HAN Nan
HONG Xi-Jin
NIMA Zhaxi
FAN Xiao-Gang
Cite this article:   
KANG Jian,QIAO Shao-Jie,GESANG Duoji等. A Semi-Structured Tibetan Text Clustering Algorithm Based on Swarm Intelligence[J]. , 2014, 27(7): 663-672.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2014/V27/I7/663
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn