Data-Efficient Image Recognition with Contrastive Predictive Coding

Hénaff, Olivier J.; Razavi, Ali; Doersch, Carl; Eslami, S. M. Ali; Oord, Aaron van den

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.09272v1 (cs)

[Submitted on 22 May 2019 (this version), latest version 1 Jul 2020 (v3)]

Title:Data-Efficient Image Recognition with Contrastive Predictive Coding

Authors:Olivier J. Hénaff, Ali Razavi, Carl Doersch, S. M. Ali Eslami, Aaron van den Oord

View PDF

Abstract:Large scale deep learning excels when labeled images are abundant, yet data-efficient learning remains a longstanding challenge. While biological vision is thought to leverage vast amounts of unlabeled data to solve classification problems with limited supervision, computer vision has so far not succeeded in this `semi-supervised' regime. Our work tackles this challenge with Contrastive Predictive Coding, an unsupervised objective which extracts stable structure from still images. The result is a representation which, equipped with a simple linear classifier, separates ImageNet categories better than all competing methods, and surpasses the performance of a fully-supervised AlexNet model. When given a small number of labeled images (as few as 13 per class), this representation retains a strong classification performance, outperforming state-of-the-art semi-supervised methods by 10% Top-5 accuracy and supervised methods by 20%. Finally, we find our unsupervised representation to serve as a useful substrate for image detection on the PASCAL-VOC 2007 dataset, approaching the performance of representations trained with a fully annotated ImageNet dataset. We expect these results to open the door to pipelines that use scalable unsupervised representations as a drop-in replacement for supervised ones for real-world vision tasks where labels are scarce.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1905.09272 [cs.CV]
	(or arXiv:1905.09272v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.09272

Submission history

From: Aäron van den Oord [view email]
[v1] Wed, 22 May 2019 17:57:49 UTC (7,632 KB)
[v2] Fri, 6 Dec 2019 18:35:23 UTC (1,930 KB)
[v3] Wed, 1 Jul 2020 11:22:05 UTC (1,869 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Data-Efficient Image Recognition with Contrastive Predictive Coding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Data-Efficient Image Recognition with Contrastive Predictive Coding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators