Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mathew Jacob

Drowning in Documents: Consequences of Scaling Reranker Inference

Nov 18, 2024

Mathew Jacob, Erik Lindgren, Matei Zaharia, Michael Carbin, Omar Khattab, Andrew Drozdov

Figure 1 for Drowning in Documents: Consequences of Scaling Reranker Inference

Figure 2 for Drowning in Documents: Consequences of Scaling Reranker Inference

Figure 3 for Drowning in Documents: Consequences of Scaling Reranker Inference

Figure 4 for Drowning in Documents: Consequences of Scaling Reranker Inference

Abstract:Rerankers, typically cross-encoders, are often used to re-score the documents retrieved by cheaper initial IR systems. This is because, though expensive, rerankers are assumed to be more effective. We challenge this assumption by measuring reranker performance for full retrieval, not just re-scoring first-stage retrieval. Our experiments reveal a surprising trend: the best existing rerankers provide diminishing returns when scoring progressively more documents and actually degrade quality beyond a certain limit. In fact, in this setting, rerankers can frequently assign high scores to documents with no lexical or semantic overlap with the query. We hope that our findings will spur future research to improve reranking.

Via

Access Paper or Ask Questions