Identification of Query Intents via Combining Multiple Features
WU Da-Yong1,ZHAO Shi-Qi1,2,LIU Ting1,ZHANG Yu1
1. Research Center for Social Computing and Information Retrieval,Harbin Institute of Technology,Harbin 150001 2.Baidu Online Network Technology Co.Ltd,Beijing 100085
Abstract:Identifying underlying user intents of search engine queries is a hotspot in the field of web information retrieval. An approach to identifying user intents of search engine queries is proposed based on features from various sources. Specifically, the query intent identification is regarded as a classification problem. The classification features are extracted from various sources including query texts, search engine feedbacks and query logs. The method is evaluated on the real web query data. The experimental results show that the exploited features are helpful to improve the identification performance. Furthermore, about 88.5% of the test queries can be correctly identified with the classification framework via combining all the features.
[1] Broder A.A Taxonomy of Web Search.SIGIR Forum,2002,36(2): 3-10 [2] Rose D E,Levinson D.Understanding User Goals in Web Search // Proc of the 13th International Conference on World Wide Web.New York,USA,2004: 13-19 [3] Liu Yiqun,Zhang Min,Ru Liyun,et al.Automatic Query Type Identification Based on Click through Information // Proc of the 3rd Asia Information Retrieval Symposium.Singapore,Singapore,2006: 593-600 [4] Lee U,Liu Zhenyu,Cho J.Automatic Identification of User Goals in Web Search // Proc of the 14th International Conference on World Wide Web.Chiba,Japan,2005: 391-400 [5] Baeza-Yates R,Calderon-Benavides L,Gonzalez-Caro C.The Intention behind Web Queries // Proc of the 13th International Conference on String Processing and Information Retrieval.Glasgow,UK,2006: 98-109 [6] Zhang Sen,Wang Bin.A Survey of Web Search Query Intention Classification.Journal of Chinese Information Processing,2008,22(4): 75-82 (in Chinese) (张 森,王 斌.Web检索查询意图分类技术综述.中文信息学报,2008,22(4): 75-82) [7] Yuan Xiaojie,Dou Zhicheng,Zhang Lu,et al.Automatic User Goals Identification Based on Anchor Text and Click-Through Data.Wuhan University Journal of Natural Sciences,2008,13(4): 495-500 [8] Brenes D J,Gayo-Avello D.Automatic Detection of Navigational Queries according to Behavioural Characteristics // Proc of the Lernen-Wissen-Adaptivitat Workshop on Information Retrieval.Würzburg,Germany,2008: 41-48 [9] Herrera M R,Moura E S,Cristo M,et al.Exploring Features for the Automatic Identification of User Goals in Web Search.Information Processing and Management: An International Journal,2010,46(2): 131-142 [10] Jansen B J,Booth D L,Spink A.Determining the Informational,Navigational,and Transactional Intent of Web Queries.Information Processing and Management: An International Journal,2008,44(3): 1251-1266 [11] Vapnik V N.Statistical Learning Theory.New York,USA: Wiley,1998 [12] Landis J R,Koch G G.The Measurement of Observer Agreement for Categorical Data.Biometrics,1977,33(1): 159-174