Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Merge Double Thompson Sampling for Large Scale Online Ranker Evaluation

Dec 11, 2018

Chang Li, Ilya Markov, Maarten de Rijke, Masrour Zoghi

Figure 1 for Merge Double Thompson Sampling for Large Scale Online Ranker Evaluation

Figure 2 for Merge Double Thompson Sampling for Large Scale Online Ranker Evaluation

Figure 3 for Merge Double Thompson Sampling for Large Scale Online Ranker Evaluation

Figure 4 for Merge Double Thompson Sampling for Large Scale Online Ranker Evaluation

Share this with someone who'll enjoy it:

Abstract:Online ranker evaluation is one of the key challenges in information retrieval. While the preferences of rankers can be inferred by interleaved comparison methods, how to effectively choose the pair of rankers to generate the result list without degrading the user experience too much can be formalized as a K-armed dueling bandit problem, which is an online partial-information learning framework, where feedback comes in the form of pair-wise preferences. A commercial search system may evaluate a large number of rankers concurrently, and scaling effectively in the presence of numerous rankers has not been fully studied. In this paper, we focus on solving the large-scale online ranker evaluation problem under the so-called Condorcet assumption, where there exists an optimal ranker that is preferred to all other rankers. We propose Merge Double Thompson Sampling (MergeDTS), which first utilizes a divide-and-conquer strategy that localizes the comparisons carried out by the algorithm to small batches of rankers, and then employs the Thompson Sampling (TS) to reduce the comparisons between suboptimal rankers inside these small batches. The effectiveness (regret) and efficiency (time complexity) of MergeDTS are extensively evaluated using examples from the domain of online evaluation for web search. Our main finding is that for large-scale Condorcet ranker evaluation problems MergeDTS outperforms the state-of-the-art dueling bandit algorithms.

View paper on

Share this with someone who'll enjoy it:

Title:Merge Double Thompson Sampling for Large Scale Online Ranker Evaluation

Paper and Code