SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Liang, Yujia; Ye, Zixuan; Liu, Wenze; Lu, Hao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.13483 (cs)

[Submitted on 18 Jul 2024]

Title:SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Authors:Yujia Liang, Zixuan Ye, Wenze Liu, Hao Lu

View PDF HTML (experimental)

Abstract:Category-Agnostic Pose Estimation (CAPE) aims to localize keypoints on an object of any category given few exemplars in an in-context manner. Prior arts involve sophisticated designs, e.g., sundry modules for similarity calculation and a two-stage framework, or takes in extra heatmap generation and supervision. We notice that CAPE is essentially a task about feature matching, which can be solved within the attention process. Therefore we first streamline the architecture into a simple baseline consisting of several pure self-attention layers and an MLP regression head -- this simplification means that one only needs to consider the attention quality to boost the performance of CAPE. Towards an effective attention process for CAPE, we further introduce two key modules: i) a global keypoint feature perceptor to inject global semantic information into support keypoints, and ii) a keypoint attention refiner to enhance inter-node correlation between keypoints. They jointly form a Simple and strong Category-Agnostic Pose Estimator (SCAPE). Experimental results show that SCAPE outperforms prior arts by 2.2 and 1.3 PCK under 1-shot and 5-shot settings with faster inference speed and lighter model capacity, excelling in both accuracy and efficiency. Code and models are available at this https URL

Comments:	Accepted to ECCV 2024. Code is available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.13483 [cs.CV]
	(or arXiv:2407.13483v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.13483

Submission history

From: Hao Lu [view email]
[v1] Thu, 18 Jul 2024 13:02:57 UTC (18,818 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators