Cohort-based personalized query auto-completion

Dan-yang JIANG; Hong-hui CHEN

doi:10.1631/FITEE.1800010

PDF(736 KB)

Front. Inform. Technol. Electron. Eng ›› 2019, Vol. 20 ›› Issue (9) : 1246-1258. DOI: 10.1631/FITEE.1800010

Orginal Article

Cohort-based personalized query auto-completion

Author information +

History +

Abstract

Query auto-completion (QAC) facilitates query formulation by predicting completions for given query prefix inputs. Most web search engines use behavioral signals to customize query completion lists for users. To be effective, such personalized QAC models rely on the access to sufficient context about each user’s interest and intentions. Hence, they often suffer from data sparseness problems. For this reason, we propose the construction and application of cohorts to address context sparsity and to enhance QAC personalization. We build an individual’s interest profile by learning his/her topic preferences through topic models and then aggregate users who share similar profiles. As conventional topic models are unable to automatically learn cohorts, we propose two cohort topic models that handle topic modeling and cohort discovery in the same framework. We present four cohortbased personalized QAC models that employ four different cohort discovery strategies. Our proposals use cohorts’ contextual information together with query frequency to rank completions. We perform extensive experiments on the publicly available AOL query log and compare the ranking effectiveness with that of models that discard cohort contexts. Experimental results suggest that our cohort-based personalized QAC models can solve the sparseness problem and yield significant relevance improvement over competitive baselines.

Keywords

Query auto-completion / Cohort-based retrieval / Topic models

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Dan-yang JIANG, Hong-hui CHEN. Cohort-based personalized query auto-completion. Front. Inform. Technol. Electron. Eng, 2019, 20(9): 1246‒1258 https://doi.org/10.1631/FITEE.1800010