Pose for Everything: Towards Category-Agnostic Pose Estimation

Xu, Lumin; Jin, Sheng; Zeng, Wang; Liu, Wentao; Qian, Chen; Ouyang, Wanli; Luo, Ping; Wang, Xiaogang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.10387 (cs)

[Submitted on 21 Jul 2022]

Title:Pose for Everything: Towards Category-Agnostic Pose Estimation

Authors:Lumin Xu, Sheng Jin, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang

View PDF

Abstract:Existing works on 2D pose estimation mainly focus on a certain category, e.g. human, animal, and vehicle. However, there are lots of application scenarios that require detecting the poses/keypoints of the unseen class of objects. In this paper, we introduce the task of Category-Agnostic Pose Estimation (CAPE), which aims to create a pose estimation model capable of detecting the pose of any class of object given only a few samples with keypoint definition. To achieve this goal, we formulate the pose estimation problem as a keypoint matching problem and design a novel CAPE framework, termed POse Matching Network (POMNet). A transformer-based Keypoint Interaction Module (KIM) is proposed to capture both the interactions among different keypoints and the relationship between the support and query images. We also introduce Multi-category Pose (MP-100) dataset, which is a 2D pose dataset of 100 object categories containing over 20K instances and is well-designed for developing CAPE algorithms. Experiments show that our method outperforms other baseline approaches by a large margin. Codes and data are available at this https URL.

Comments:	ECCV 2022 Oral
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2207.10387 [cs.CV]
	(or arXiv:2207.10387v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.10387

Submission history

From: Lumin Xu [view email]
[v1] Thu, 21 Jul 2022 09:40:54 UTC (8,135 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Pose for Everything: Towards Category-Agnostic Pose Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Pose for Everything: Towards Category-Agnostic Pose Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators