Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition

Peng, Jianyi; Lu, Fan; Li, Bin; Huang, Yuan; Qu, Sanqing; Chen, Guang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.11742 (cs)

[Submitted on 17 Feb 2025]

Title:Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition

Authors:Jianyi Peng, Fan Lu, Bin Li, Yuan Huang, Sanqing Qu, Guang Chen

View PDF HTML (experimental)

Abstract:Image-to-point cloud cross-modal Visual Place Recognition (VPR) is a challenging task where the query is an RGB image, and the database samples are LiDAR point clouds. Compared to single-modal VPR, this approach benefits from the widespread availability of RGB cameras and the robustness of point clouds in providing accurate spatial geometry and distance information. However, current methods rely on intermediate modalities that capture either the vertical or horizontal field of view, limiting their ability to fully exploit the complementary information from both sensors. In this work, we propose an innovative initial retrieval + re-rank method that effectively combines information from range (or RGB) images and Bird's Eye View (BEV) images. Our approach relies solely on a computationally efficient global descriptor similarity search process to achieve re-ranking. Additionally, we introduce a novel similarity label supervision technique to maximize the utility of limited training data. Specifically, we employ points average distance to approximate appearance similarity and incorporate an adaptive margin, based on similarity differences, into the vanilla triplet loss. Experimental results on the KITTI dataset demonstrate that our method significantly outperforms state-of-the-art approaches.

Comments:	Submmitted to IEEE IV 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.11742 [cs.CV]
	(or arXiv:2502.11742v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.11742

Submission history

From: Jianyi Peng [view email]
[v1] Mon, 17 Feb 2025 12:29:26 UTC (2,230 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators