Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization

Aiger, Dror; Araujo, André; Lynen, Simon

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.09012 (cs)

[Submitted on 15 Jun 2023 (v1), last revised 29 Dec 2023 (this version, v3)]

Title:Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization

Authors:Dror Aiger, André Araujo, Simon Lynen

View PDF HTML (experimental)

Abstract:Large-scale visual localization systems continue to rely on 3D point clouds built from image collections using structure-from-motion. While the 3D points in these models are represented using local image features, directly matching a query image's local features against the point cloud is challenging due to the scale of the nearest-neighbor search problem. Many recent approaches to visual localization have thus proposed a hybrid method, where first a global (per image) embedding is used to retrieve a small subset of database images, and local features of the query are matched only against those. It seems to have become common belief that global embeddings are critical for said image-retrieval in visual localization, despite the significant downside of having to compute two feature types for each query image. In this paper, we take a step back from this assumption and propose Constrained Approximate Nearest Neighbors (CANN), a joint solution of k-nearest-neighbors across both the geometry and appearance space using only local features. We first derive the theoretical foundation for k-nearest-neighbor retrieval across multiple metrics and then showcase how CANN improves visual localization. Our experiments on public localization benchmarks demonstrate that our method significantly outperforms both state-of-the-art global feature-based retrieval and approaches using local feature aggregation schemes. Moreover, it is an order of magnitude faster in both index and query time than feature aggregation schemes for these datasets. Code: \url{this https URL}

Comments:	ICCV23 camera-ready + appendix
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.09012 [cs.CV]
	(or arXiv:2306.09012v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.09012

Submission history

From: Dror Aiger [view email]
[v1] Thu, 15 Jun 2023 10:12:10 UTC (21,143 KB)
[v2] Wed, 15 Nov 2023 10:16:55 UTC (21,154 KB)
[v3] Fri, 29 Dec 2023 10:35:52 UTC (21,213 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators