An Ontology Concept-Based Cluster Partition Approach for Computing the Semantic Distance between Concepts
PENG Zhi-Ping1, LI Xiao-Ming2, KE Wen-De1
1.Department of Computer Science and Technology, Guangdong University of Petrochemical Technology, Maoming 525000 2.College of Computer and Software, Taiyuan University of Technology, Taiyuan 030000
Abstract:The semantic similarity computing between concepts is an important component in natural language processing etc., and the semantic similarity computing between concepts based on semantic distance is currently dominant technique. In this paper, the ontology based cluster partition approach for computing the semantic distance between concepts is proposed on the basis of the analysis of the lacks in the existing algorithms. The rules for computing the semantic distance between concepts are given under the situation of multi-concept clusters, and then the approach for computing the semantic distance between concepts within single cluster as well as cross-cluster is put forward. In the proposed approach, the non-symmetry of semantic similarities in the pairs of hyponymy concepts is worked out by introducing the forward semantic distance and the reverse semantic distance, and the other binary relationships of the pairs of non-hyponymy concepts are deal with by dynamically allocating the relation weights in the light of the locations of concept nodes. Experimented results shows that the proposed approach is effective and it is preferable to other typical similar ones.
彭志平,李晓明,柯文德. 基于本体概念群组划分的语义距离计算方法[J]. 模式识别与人工智能, 2011, 24(2): 194-200.
PENG Zhi-Ping, LI Xiao-Ming, KE Wen-De. An Ontology Concept-Based Cluster Partition Approach for Computing the Semantic Distance between Concepts. , 2011, 24(2): 194-200.
[1] Berners-Lee T, Hendler J, Lassila O. The Semantic Web. Scientific American, 2001, 284(5): 34-43 [2] Francisco M C, Mario J S, Pedro M C. Measuring Semantic Similarity between Gene Ontology Terms. Data Knowledge Engineering, 2007, 61(1): 137-152 [3] Gao Shu, Rana O F, Avis N J, et al. Ontology-Based Semantic Matchmaking Approach. Advances in Engineering Software, 2007, 38(1): 59-67 [4] Budanitsky A, Hirst G. Evaluating WordNet-Based Measures of Lexical Semantic Relatedness. Computational Linguistics, 2004, 32(1): 13-47 [5] Niles I, Pease A. Towards a Standard Upper Ontology // Proc of the International Conference on Formal Ontology in Information Systems. Ogunquit, USA, 2001: 2-9 [6] Zhao Jinguo. Research on Concept Sementic Similarity Based on Semantic Distance. Master Dissertation. Hunan: Central South University. School of Information Science and Engineering, 2008 (in Chinese) (赵巾帼.基于语义距离的概念语义相似度研究.硕士学位论文.长沙:中南大学信息科学与工程学院, 2008) [7] Liu Qun, Li Sujian. Word Similarity Computing Based on How-net. Computational Linguistics and Chinese Language Processing, 2002, 17(2): 59-76 (in Chinese) (刘 群,李素建.基于《知网》的词汇语义相似度计算. 中文计算语言学, 2002, 17(2): 59-76) [8] Bernstein A, Kiefer C. RDQL-Imprecise Queries Using Similarity Joins for Retrieval in Ontologies // Proc of the 4th International Semantic Web Conference. Galway, Ireland, 2005 [9] Kiefer C, Bernstein A, Lee H J, et al. Semantic Process Retrieval with ISPARQL // Proc of the 4th European Conference on the Semantic Web: Research and Applications. Innsbruck, Austria, 2007: 609-623 [10] Peng Hui, Shi Zhongzhi, Chang Liang, et al. Semantic Web Services Matchmaking Based on Non-Symmetry of Semantic Similarities // Proc of China National Conference on Artificial Intelligence. Beijing: Beijing University of Posts and Telecommunication Press, 2007: 133-138 (in Chinese) (彭 晖,史忠植,常 亮,等.基于非对称语义相似度的语义Web服务匹配//中国人工智能大会论文集.北京:北京邮电大学出版社, 2007: 133-138) [11] Al-Mubaid H, Nguyen H A. A Cluster-Based Approach for Semantic Similarity in the Biomedical Domain // Proc of the 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. New York, USA, 2006: 2713-2717 [12] Cheng Yong, Huang He, Qiu Lirong, et al. Similarity-Based Dynamic Multi-Dimension Concept Mapping Algorithm. Journal of Chinese Computer System, 2006, 26(6): 975-979 (in Chinese) (程 勇,黄 河,邱莉榕,等.一个基于相似度计算的动态多维概念映射算法.小型微型计算机系统, 2006, 26(6): 975-977)