ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids

Jayaraman, Dinesh; Gao, Ruohan; Grauman, Kristen

Computer Science > Computer Vision and Pattern Recognition

arXiv:1709.00505 (cs)

[Submitted on 1 Sep 2017 (v1), last revised 31 Jul 2018 (this version, v4)]

Title:ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids

Authors:Dinesh Jayaraman, Ruohan Gao, Kristen Grauman

View PDF

Abstract:We introduce an unsupervised feature learning approach that embeds 3D shape information into a single-view image representation. The main idea is a self-supervised training objective that, given only a single 2D image, requires all unseen views of the object to be predictable from learned features. We implement this idea as an encoder-decoder convolutional neural network. The network maps an input image of an unknown category and unknown viewpoint to a latent space, from which a deconvolutional decoder can best "lift" the image to its complete viewgrid showing the object from all viewing angles. Our class-agnostic training procedure encourages the representation to capture fundamental shape primitives and semantic regularities in a data-driven manner---without manual semantic labels. Our results on two widely-used shape datasets show 1) our approach successfully learns to perform "mental rotation" even for objects unseen during training, and 2) the learned latent space is a powerful representation for object recognition, outperforming several existing unsupervised feature learning methods.

Comments:	To appear at ECCV 2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1709.00505 [cs.CV]
	(or arXiv:1709.00505v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1709.00505

Submission history

From: Dinesh Jayaraman [view email]
[v1] Fri, 1 Sep 2017 23:15:28 UTC (2,910 KB)
[v2] Sat, 28 Apr 2018 03:34:11 UTC (4,963 KB)
[v3] Tue, 15 May 2018 04:17:28 UTC (4,964 KB)
[v4] Tue, 31 Jul 2018 03:02:06 UTC (2,733 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators