SHIC: Shape-Image Correspondences with no Keypoint Supervision

Shtedritski, Aleksandar; Rupprecht, Christian; Vedaldi, Andrea

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.18907 (cs)

[Submitted on 26 Jul 2024]

Title:SHIC: Shape-Image Correspondences with no Keypoint Supervision

Authors:Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi

View PDF HTML (experimental)

Abstract:Canonical surface mapping generalizes keypoint detection by assigning each pixel of an object to a corresponding point in a 3D template. Popularised by DensePose for the analysis of humans, authors have since attempted to apply the concept to more categories, but with limited success due to the high cost of manual supervision. In this work, we introduce SHIC, a method to learn canonical maps without manual supervision which achieves better results than supervised methods for most categories. Our idea is to leverage foundation computer vision models such as DINO and Stable Diffusion that are open-ended and thus possess excellent priors over natural categories. SHIC reduces the problem of estimating image-to-template correspondences to predicting image-to-image correspondences using features from the foundation models. The reduction works by matching images of the object to non-photorealistic renders of the template, which emulates the process of collecting manual annotations for this task. These correspondences are then used to supervise high-quality canonical maps for any object of interest. We also show that image generators can further improve the realism of the template views, which provide an additional source of supervision for the model.

Comments:	ECCV 2024. Project website this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.18907 [cs.CV]
	(or arXiv:2407.18907v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.18907

Submission history

From: Aleksandar Shtedritski [view email]
[v1] Fri, 26 Jul 2024 17:58:59 UTC (16,509 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SHIC: Shape-Image Correspondences with no Keypoint Supervision

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SHIC: Shape-Image Correspondences with no Keypoint Supervision

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators