模式识别与人工智能
Friday, May. 2, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2014, Vol. 27 Issue (3): 226-234    DOI:
Researches and Applications Current Issue| Next Issue| Archive| Adv Search |
Texts Similarity Algorithm Based on Subtrees Matching
ZHANG Pei-Yun1,2,CHEN Chuan-Ming1,HUANG Bo3
1.School of Mathematics and Computer Science,Anhui Normal University,Wuhu 241003
2.School of Computer Science and Technology,University of Science and Technology of China,Hefei 230027
3 .School of Computer Science and Technology,Nanjing University of Science and Technology,Nanjing 210094

Download: PDF (0 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  To reduce the dimensionality of text vectors and improve the performance of semantic similarity measurement,an algorithm for texts similarity computation is proposed,which combines the advantages of the statistical methods and semantic dictionary. The texts are utilized to generate metadata feature vectors,so that it reduces the dimensionality of text vectors space. The algorithm for computing texts similarity is designed based on subtrees matching and the speed of computing texts similarity is improved. The accuracy of texts semantic similarity measurement is improved by utilizing the semantic matching of metadata feature vectors and subtrees. The synonyms widely existing in metadata are processed by the proposed method,and the semantic coverage ability for similarity computation of texts is also enhanced. The experimental results show that the proposed method is feasible and effective.
Received: 06 May 2013     
ZTFLH: TP 311  
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
ZHANG Pei-Yun
CHEN Chuan-Ming
HUANG Bo
Cite this article:   
ZHANG Pei-Yun,CHEN Chuan-Ming,HUANG Bo. Texts Similarity Algorithm Based on Subtrees Matching[J]. , 2014, 27(3): 226-234.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2014/V27/I3/226
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn