Abstract:A representation method for generalized web sessions based on ontology is proposed, called semantic web session, and the corresponding semantic web session clustering and visualization method is presented. For the sessions clustering, the similarity measure on semantic common paths (SMSCP) for semantic web sessions is defined on the semantic common paths of users navigation. The validity of the similarity measure is verified through the clustering accuracy using the improved k medoids algorithm. Stratograms are employed to visualize the clustering results. Experimental results show that the proposed clustering and visualization methods are effective and understandable.
杨钤雯,寇纪淞,陈富赞,李敏强. 基于本体的语义网络会话聚类和可视化方法[J]. 模式识别与人工智能, 2011, 24(1): 111-116.
YANG Qian-Wen, KOU Ji-Song, CHEN Fu-Zan, LI Min-Qiang. Semantic Web Session Clustering and Visualization Method Based on Ontology. , 2011, 24(1): 111-116.
[1] Srivastava J, Cooley R, Deshpande M, et al. Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data. ACM SIGKDD Explorations Newsletter, 2000, 1(2): 12-23 [2] Xiao Jitian, Zhang Yanchun. Clustering of Web Users Using Session Based Similarity Measures // Proc of the International Conference on Computer Networks and Mobile Computing. Beijing, China, 2001: 223-228 [3] Shahabi C, Zarkesh M, Adibi J, et al. Knowledge Discovery from Users Web-Page Navigation // Proc of the 7th International Workshop on Research Issues of Data Mining. Birmingham, USA, 1997: 20-29 [4] Banerjee A, Ghosh J. Clickstream Clustering Using Weighted Longest Common Subsequences // Proc of the Web Mining Workshop at the 1st SIAM Conference on Data Mining. Chicago, USA, 2001: 33-40 [5] Baldi P, Frasconi P, Smyth P. Modeling the Internet and the Web. New York, USA: Wiley, 2003 [6] Bettina B. Detail and Context in Web Usage Mining: Coarsening and Visualizing Sequences // Proc of the 3rd International Workshop on Mining Web Log Data across all Customers Touch Points. London, UK, 2002: 1-24 [7] Oberlander J, Cox R, Monaghan P, et al. Individual Differences in Proof Structures Following Multimodal Logic Teaching // Proc of the Conference on Congnitive Science. San Diego, USA, 1996: 201-206 [8] Berendt B. Using Site Semantics to Analyze, Visualize and Support Navigation. Data Mining and Knowledge Discovery, 2002, 6(1): 37-59 [9] Stumme G. Using Ontologies and Formal Concept Analysis for Organizing Business Knowledge // Proc of Referenzmodellierung. Dresden, Germany, 2001: 163-174 [10] Sheng Weiguo, Liu Xiaohui. A Hybrid Algorithm for k-medoid Clustering of Large Data Sets // Proc of the Congress on Evolutionary Computation. London, UK, 2004: 77-82 [11] Rousseeuw P J. Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. Journal of Computational and Applied Mathematics, 1987, 20(1): 53-65