Vector search with small radiuses

Szilvasy, Gergely; Mazaré, Pierre-Emmanuel; Douze, Matthijs

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.10746 (cs)

[Submitted on 16 Mar 2024]

Title:Vector search with small radiuses

Authors:Gergely Szilvasy, Pierre-Emmanuel Mazaré, Matthijs Douze

View PDF HTML (experimental)

Abstract:In recent years, the dominant accuracy metric for vector search is the recall of a result list of fixed size (top-k retrieval), considering as ground truth the exact vector retrieval results. Although convenient to compute, this metric is distantly related to the end-to-end accuracy of a full system that integrates vector search. In this paper we focus on the common case where a hard decision needs to be taken depending on the vector retrieval results, for example, deciding whether a query image matches a database image or not. We solve this as a range search task, where all vectors within a certain radius from the query are returned.
We show that the value of a range search result can be modeled rigorously based on the query-to-vector distance. This yields a metric for range search, RSM, that is both principled and easy to compute without running an end-to-end evaluation. We apply this metric to the case of image retrieval. We show that indexing methods that are adapted for top-k retrieval do not necessarily maximize the RSM. In particular, for inverted file based indexes, we show that visiting a limited set of clusters and encoding vectors compactly yields near optimal results.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
Cite as:	arXiv:2403.10746 [cs.CV]
	(or arXiv:2403.10746v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.10746

Submission history

From: Matthijs Douze [view email]
[v1] Sat, 16 Mar 2024 00:34:25 UTC (1,177 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Vector search with small radiuses

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Vector search with small radiuses

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators