Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation

Yang, Yuchen; Qiao, Yu; Sun, Xiao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.07051 (cs)

[Submitted on 12 Dec 2023 (v1), last revised 8 Jul 2024 (this version, v2)]

Title:Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation

Authors:Yuchen Yang, Yu Qiao, Xiao Sun

View PDF HTML (experimental)

Abstract:Automatic estimation of 3D human pose from monocular RGB images is a challenging and unsolved problem in computer vision. In a supervised manner, approaches heavily rely on laborious annotations and present hampered generalization ability due to the limited diversity of 3D pose datasets. To address these challenges, we propose a unified framework that leverages mask as supervision for unsupervised 3D pose estimation. With general unsupervised segmentation algorithms, the proposed model employs skeleton and physique representations that exploit accurate pose information from coarse to fine. Compared with previous unsupervised approaches, we organize the human skeleton in a fully unsupervised way which enables the processing of annotation-free data and provides ready-to-use estimation results. Comprehensive experiments demonstrate our state-of-the-art pose estimation performance on Human3.6M and MPI-INF-3DHP datasets. Further experiments on in-the-wild datasets also illustrate the capability to access more data to boost our model. Code will be available at this https URL.

Comments:	Accepted by ECCV2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.07051 [cs.CV]
	(or arXiv:2312.07051v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.07051

Submission history

From: Yuchen Yang [view email]
[v1] Tue, 12 Dec 2023 08:08:34 UTC (7,741 KB)
[v2] Mon, 8 Jul 2024 11:03:42 UTC (3,429 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators