Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark

Humenberger, Martin; Cabon, Yohann; Pion, Noé; Weinzaepfel, Philippe; Lee, Donghwan; Guérin, Nicolas; Sattler, Torsten; Csurka, Gabriela

doi:10.1007/s11263-022-01615-7

Computer Science > Computer Vision and Pattern Recognition

arXiv:2205.15761 (cs)

[Submitted on 31 May 2022]

Title:Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark

Authors:Martin Humenberger, Yohann Cabon, Noé Pion, Philippe Weinzaepfel, Donghwan Lee, Nicolas Guérin, Torsten Sattler, Gabriela Csurka

View PDF

Abstract:Visual localization, i.e., camera pose estimation in a known scene, is a core component of technologies such as autonomous driving and augmented reality. State-of-the-art localization approaches often rely on image retrieval techniques for one of two purposes: (1) provide an approximate pose estimate or (2) determine which parts of the scene are potentially visible in a given query image. It is common practice to use state-of-the-art image retrieval algorithms for both of them. These algorithms are often trained for the goal of retrieving the same landmark under a large range of viewpoint changes which often differs from the requirements of visual localization. In order to investigate the consequences for visual localization, this paper focuses on understanding the role of image retrieval for multiple visual localization paradigms. First, we introduce a novel benchmark setup and compare state-of-the-art retrieval representations on multiple datasets using localization performance as metric. Second, we investigate several definitions of "ground truth" for image retrieval. Using these definitions as upper bounds for the visual localization paradigms, we show that there is still sgnificant room for improvement. Third, using these tools and in-depth analysis, we show that retrieval performance on classical landmark retrieval or place recognition tasks correlates only for some but not all paradigms to localization performance. Finally, we analyze the effects of blur and dynamic scenes in the images. We conclude that there is a need for retrieval approaches specifically designed for localization paradigms. Our benchmark and evaluation protocols are available at this https URL.

Comments:	International Journal of Computer Vision (2022). arXiv admin note: text overlap with arXiv:2011.11946
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2205.15761 [cs.CV]
	(or arXiv:2205.15761v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2205.15761
Related DOI:	https://doi.org/10.1007/s11263-022-01615-7

Submission history

From: Martin Humenberger [view email]
[v1] Tue, 31 May 2022 12:59:01 UTC (24,603 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators