Content area

Abstract

dentifying ambiguous queries is crucial to research on personalized Web search and search result diversity. Intuitively, query logs contain valuable information on how many intentions users have when issuing a query. However, previous work showed user clicks alone are misleading in judging a query as being ambiguous or not. In this paper, we address the problem of learning a query ambiguity model by using search logs. First, we propose enriching a query by mining the documents clicked by users and the relevant follow up queries in a session. Second, we use a text classifier to map the documents and the queries into predefined categories. Third, we propose extracting features from the processed data. Finally, we apply a state-of-the-art algorithm, Support Vector Machine (SVM), to learn a query ambiguity classifier. Experimental results verify that the sole use of click based features or session based features perform worse than the previous work based on top retrieved documents. When we combine the two sets of features, our proposed approach achieves the best effectiveness, specifically 86% in terms of accuracy. It significantly improves the click based method by 5.6% and the session based method by 4.6%.

Details

Title
Learning Query Ambiguity Models by Using Search Logs
Author
Song, Ruihua 1 ; Dou, Zhicheng 2 ; Hon, Hsiao-Wuen 2 ; Yu, Yong 3 

 Shanghai Jiao Tong University, Department of Computer Science, Shanghai, China (GRID:grid.16821.3c) (ISNI:0000000403688293); Microsoft Research Asia, Beijing, China (GRID:grid.466946.f) (ISNI:0000000122165314) 
 Microsoft Research Asia, Beijing, China (GRID:grid.466946.f) (ISNI:0000000122165314) 
 Shanghai Jiao Tong University, Department of Computer Science, Shanghai, China (GRID:grid.16821.3c) (ISNI:0000000403688293) 
Pages
728-738
Publication year
2010
Publication date
Jul 2010
Publisher
Springer Nature B.V.
ISSN
10009000
e-ISSN
18604749
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
872095479
Copyright
© Springer 2010.