Steerers: A framework for rotation equivariant keypoint descriptors

Bökman, Georg; Edstedt, Johan; Felsberg, Michael; Kahl, Fredrik

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.02152 (cs)

[Submitted on 4 Dec 2023 (v1), last revised 2 Apr 2024 (this version, v2)]

Title:Steerers: A framework for rotation equivariant keypoint descriptors

Authors:Georg Bökman, Johan Edstedt, Michael Felsberg, Fredrik Kahl

View PDF HTML (experimental)

Abstract:Image keypoint descriptions that are discriminative and matchable over large changes in viewpoint are vital for 3D reconstruction. However, descriptions output by learned descriptors are typically not robust to camera rotation. While they can be made more robust by, e.g., data augmentation, this degrades performance on upright images. Another approach is test-time augmentation, which incurs a significant increase in runtime. Instead, we learn a linear transform in description space that encodes rotations of the input image. We call this linear transform a steerer since it allows us to transform the descriptions as if the image was rotated. From representation theory, we know all possible steerers for the rotation group. Steerers can be optimized (A) given a fixed descriptor, (B) jointly with a descriptor or (C) we can optimize a descriptor given a fixed steerer. We perform experiments in these three settings and obtain state-of-the-art results on the rotation invariant image matching benchmarks AIMS and Roto-360. We publish code and model weights at this https URL.

Comments:	CVPR 2024 Camera ready
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.02152 [cs.CV]
	(or arXiv:2312.02152v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.02152

Submission history

From: Georg Bökman [view email]
[v1] Mon, 4 Dec 2023 18:59:44 UTC (6,986 KB)
[v2] Tue, 2 Apr 2024 09:40:33 UTC (46,071 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Steerers: A framework for rotation equivariant keypoint descriptors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Steerers: A framework for rotation equivariant keypoint descriptors

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators