Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation

Li, Sijin; Zhang, Weichen; Chan, Antoni B.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1508.06708 (cs)

[Submitted on 27 Aug 2015]

Title:Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation

Authors:Sijin Li, Weichen Zhang, Antoni B. Chan

View PDF

Abstract:This paper focuses on structured-output learning using deep neural networks for 3D human pose estimation from monocular images. Our network takes an image and 3D pose as inputs and outputs a score value, which is high when the image-pose pair matches and low otherwise. The network structure consists of a convolutional neural network for image feature extraction, followed by two sub-networks for transforming the image features and pose into a joint embedding. The score function is then the dot-product between the image and pose embeddings. The image-pose embedding and score function are jointly trained using a maximum-margin cost function. Our proposed framework can be interpreted as a special form of structured support vector machines where the joint feature space is discriminatively learned using deep neural networks. We test our framework on the Human3.6m dataset and obtain state-of-the-art results compared to other recent methods. Finally, we present visualizations of the image-pose embedding space, demonstrating the network has learned a high-level embedding of body-orientation and pose-configuration.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1508.06708 [cs.CV]
	(or arXiv:1508.06708v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1508.06708

Submission history

From: Sijin Li [view email]
[v1] Thu, 27 Aug 2015 03:21:15 UTC (1,861 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2015-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sijin Li
Weichen Zhang
Antoni B. Chan

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators