MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild

Rogez, Grégory; Schmid, Cordelia

Computer Science > Computer Vision and Pattern Recognition

arXiv:1607.02046 (cs)

[Submitted on 7 Jul 2016 (v1), last revised 28 Oct 2016 (this version, v2)]

Title:MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild

Authors:Grégory Rogez, Cordelia Schmid

View PDF

Abstract:This paper addresses the problem of 3D human pose estimation in the wild. A significant challenge is the lack of training data, i.e., 2D images of humans annotated with 3D poses. Such data is necessary to train state-of-the-art CNN architectures. Here, we propose a solution to generate a large set of photorealistic synthetic images of humans with 3D pose annotations. We introduce an image-based synthesis engine that artificially augments a dataset of real images with 2D human pose annotations using 3D Motion Capture (MoCap) data. Given a candidate 3D pose our algorithm selects for each joint an image whose 2D pose locally matches the projected 3D pose. The selected images are then combined to generate a new synthetic image by stitching local image patches in a kinematically constrained manner. The resulting images are used to train an end-to-end CNN for full-body 3D pose estimation. We cluster the training data into a large number of pose classes and tackle pose estimation as a K-way classification problem. Such an approach is viable only with large training sets such as ours. Our method outperforms the state of the art in terms of 3D pose estimation in controlled environments (Human3.6M) and shows promising results for in-the-wild images (LSP). This demonstrates that CNNs trained on artificial images generalize well to real images.

Comments:	9 pages, accepted to appear in NIPS 2016
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1607.02046 [cs.CV]
	(or arXiv:1607.02046v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1607.02046

Submission history

From: Grégory Rogez [view email]
[v1] Thu, 7 Jul 2016 15:30:05 UTC (3,436 KB)
[v2] Fri, 28 Oct 2016 12:43:51 UTC (3,247 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators