Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haixin Lin

Optimizing Guided Traversal for Fast Learned Sparse Retrieval

May 02, 2023

Yifan Qiao, Yingrui Yang, Haixin Lin, Tao Yang

Figure 1 for Optimizing Guided Traversal for Fast Learned Sparse Retrieval

Figure 2 for Optimizing Guided Traversal for Fast Learned Sparse Retrieval

Figure 3 for Optimizing Guided Traversal for Fast Learned Sparse Retrieval

Figure 4 for Optimizing Guided Traversal for Fast Learned Sparse Retrieval

Abstract:Recent studies show that BM25-driven dynamic index skipping can greatly accelerate MaxScore-based document retrieval based on the learned sparse representation derived by DeepImpact. This paper investigates the effectiveness of such a traversal guidance strategy during top k retrieval when using other models such as SPLADE and uniCOIL, and finds that unconstrained BM25-driven skipping could have a visible relevance degradation when the BM25 model is not well aligned with a learned weight model or when retrieval depth k is small. This paper generalizes the previous work and optimizes the BM25 guided index traversal with a two-level pruning control scheme and model alignment for fast retrieval using a sparse representation. Although there can be a cost of increased latency, the proposed scheme is much faster than the original MaxScore method without BM25 guidance while retaining the relevance effectiveness. This paper analyzes the competitiveness of this two-level pruning scheme, and evaluates its tradeoff in ranking relevance and time efficiency when searching several test datasets.

* In Proceedings of the ACM Web Conference 2023 (pp. 3375-3385)
* This paper is published in WWW'23

Via

Access Paper or Ask Questions

Dual Skipping Guidance for Document Retrieval with Learned Sparse Representations

Apr 23, 2022

Yifan Qiao, Yingrui Yang, Haixin Lin, Tianbo Xiong, Xiyue Wang, Tao Yang

Figure 1 for Dual Skipping Guidance for Document Retrieval with Learned Sparse Representations

Figure 2 for Dual Skipping Guidance for Document Retrieval with Learned Sparse Representations

Figure 3 for Dual Skipping Guidance for Document Retrieval with Learned Sparse Representations

Figure 4 for Dual Skipping Guidance for Document Retrieval with Learned Sparse Representations

Abstract:This paper proposes a dual skipping guidance scheme with hybrid scoring to accelerate document retrieval that uses learned sparse representations while still delivering a good relevance. This scheme uses both lexical BM25 and learned neural term weights to bound and compose the rank score of a candidate document separately for skipping and final ranking, and maintains two top-k thresholds during inverted index traversal. This paper evaluates time efficiency and ranking relevance of the proposed scheme in searching MS MARCO TREC datasets.

Via

Access Paper or Ask Questions