Self-supervised Learning of Pose Embeddings from Spatiotemporal Relations in Videos

Sümer, Ömer; Dencker, Tobias; Ommer, Björn

Computer Science > Computer Vision and Pattern Recognition

arXiv:1708.02179 (cs)

[Submitted on 7 Aug 2017]

Title:Self-supervised Learning of Pose Embeddings from Spatiotemporal Relations in Videos

Authors:Ömer Sümer, Tobias Dencker, Björn Ommer

View PDF

Abstract:Human pose analysis is presently dominated by deep convolutional networks trained with extensive manual annotations of joint locations and beyond. To avoid the need for expensive labeling, we exploit spatiotemporal relations in training videos for self-supervised learning of pose embeddings. The key idea is to combine temporal ordering and spatial placement estimation as auxiliary tasks for learning pose similarities in a Siamese convolutional network. Since the self-supervised sampling of both tasks from natural videos can result in ambiguous and incorrect training labels, our method employs a curriculum learning idea that starts training with the most reliable data samples and gradually increases the difficulty. To further refine the training process we mine repetitive poses in individual videos which provide reliable labels while removing inconsistencies. Our pose embeddings capture visual characteristics of human pose that can boost existing supervised representations in human pose estimation and retrieval. We report quantitative and qualitative results on these tasks in Olympic Sports, Leeds Pose Sports and MPII Human Pose datasets.

Comments:	To appear in ICCV 2017
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1708.02179 [cs.CV]
	(or arXiv:1708.02179v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1708.02179

Submission history

From: Tobias Dencker [view email]
[v1] Mon, 7 Aug 2017 15:57:32 UTC (3,423 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2017-08

Change to browse by:

cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ömer Sümer
Tobias Dencker
Björn Ommer

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised Learning of Pose Embeddings from Spatiotemporal Relations in Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised Learning of Pose Embeddings from Spatiotemporal Relations in Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators