GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting

Di, Yan; Zhang, Ruida; Lou, Zhiqiang; Manhardt, Fabian; Ji, Xiangyang; Navab, Nassir; Tombari, Federico

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.07918 (cs)

[Submitted on 15 Mar 2022 (v1), last revised 17 Mar 2022 (this version, v2)]

Title:GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting

Authors:Yan Di, Ruida Zhang, Zhiqiang Lou, Fabian Manhardt, Xiangyang Ji, Nassir Navab, Federico Tombari

View PDF

Abstract:While 6D object pose estimation has recently made a huge leap forward, most methods can still only handle a single or a handful of different objects, which limits their applications. To circumvent this problem, category-level object pose estimation has recently been revamped, which aims at predicting the 6D pose as well as the 3D metric size for previously unseen instances from a given set of object classes. This is, however, a much more challenging task due to severe intra-class shape variations. To address this issue, we propose GPV-Pose, a novel framework for robust category-level pose estimation, harnessing geometric insights to enhance the learning of category-level pose-sensitive features. First, we introduce a decoupled confidence-driven rotation representation, which allows geometry-aware recovery of the associated rotation matrix. Second, we propose a novel geometry-guided point-wise voting paradigm for robust retrieval of the 3D object bounding box. Finally, leveraging these different output streams, we can enforce several geometric consistency terms, further increasing performance, especially for non-symmetric categories. GPV-Pose produces superior results to state-of-the-art competitors on common public benchmarks, whilst almost achieving real-time inference speed at 20 FPS.

Comments:	CVPR 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.07918 [cs.CV]
	(or arXiv:2203.07918v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.07918

Submission history

From: Ruida Zhang [view email]
[v1] Tue, 15 Mar 2022 13:58:50 UTC (7,514 KB)
[v2] Thu, 17 Mar 2022 14:12:21 UTC (7,514 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators