Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ximu Zeng

LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search

Mar 30, 2025

Ximu Zeng, Liwei Deng, Penghao Chen, Xu Chen, Han Su, Kai Zheng

Figure 1 for LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search

Figure 2 for LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search

Figure 3 for LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search

Figure 4 for LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search

Abstract:Approximate nearest neighbor search is fundamental in information retrieval. Previous partition-based methods enhance search efficiency by probing partial partitions, yet they face two common issues. In the query phase, a common strategy is to probe partitions based on the distance ranks of a query to partition centroids, which inevitably probes irrelevant partitions as it ignores data distribution. In the partition construction phase, all partition-based methods face the boundary problem that separates a query's nearest neighbors to multiple partitions, resulting in a long-tailed kNN distribution and degrading the optimal nprobe (i.e., the number of probing partitions). To address this gap, we propose LIRA, a LearnIng-based queRy-aware pArtition framework. Specifically, we propose a probing model to directly probe the partitions containing the kNN of a query, which can reduce probing waste and allow for query-aware probing with nprobe individually. Moreover, we incorporate the probing model into a learning-based redundancy strategy to mitigate the adverse impact of the long-tailed kNN distribution on search efficiency. Extensive experiments on real-world vector datasets demonstrate the superiority of LIRA in the trade-off among accuracy, latency, and query fan-out. The codes are available at https://github.com/SimoneZeng/LIRA-ANN-search.

* This paper is accepted by WWW 2025

Via

Access Paper or Ask Questions

Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search

Nov 26, 2024

Liwei Deng, Penghao Chen, Ximu Zeng, Tianfu Wang, Yan Zhao, Kai Zheng

Figure 1 for Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search

Figure 2 for Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search

Figure 3 for Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search

Figure 4 for Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search

Abstract:High-dimensional approximate $K$ nearest neighbor search (AKNN) is a fundamental task for various applications, including information retrieval. Most existing algorithms for AKNN can be decomposed into two main components, i.e., candidate generation and distance comparison operations (DCOs). While different methods have unique ways of generating candidates, they all share the same DCO process. In this study, we focus on accelerating the process of DCOs that dominates the time cost in most existing AKNN algorithms. To achieve this, we propose an \underline{D}ata-\underline{A}ware \underline{D}istance \underline{E}stimation approach, called \emph{DADE}, which approximates the \emph{exact} distance in a lower-dimensional space. We theoretically prove that the distance estimation in \emph{DADE} is \emph{unbiased} in terms of data distribution. Furthermore, we propose an optimized estimation based on the unbiased distance estimation formulation. In addition, we propose a hypothesis testing approach to adaptively determine the number of dimensions needed to estimate the \emph{exact} distance with sufficient confidence. We integrate \emph{DADE} into widely-used AKNN search algorithms, e.g., \emph{IVF} and \emph{HNSW}, and conduct extensive experiments to demonstrate the superiority.

* Accepted by VLDB 2025

Via

Access Paper or Ask Questions