Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Žbontar, Jure; LeCun, Yann

Computer Science > Computer Vision and Pattern Recognition

arXiv:1510.05970 (cs)

[Submitted on 20 Oct 2015 (v1), last revised 18 May 2016 (this version, v2)]

Title:Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Authors:Jure Žbontar, Yann LeCun

View PDF

Abstract:We present a method for extracting depth information from a rectified image pair. Our approach focuses on the first stage of many stereo algorithms: the matching cost computation. We approach the problem by learning a similarity measure on small image patches using a convolutional neural network. Training is carried out in a supervised manner by constructing a binary classification data set with examples of similar and dissimilar pairs of patches. We examine two network architectures for this task: one tuned for speed, the other for accuracy. The output of the convolutional neural network is used to initialize the stereo matching cost. A series of post-processing steps follow: cross-based cost aggregation, semiglobal matching, a left-right consistency check, subpixel enhancement, a median filter, and a bilateral filter. We evaluate our method on the KITTI 2012, KITTI 2015, and Middlebury stereo data sets and show that it outperforms other approaches on all three data sets.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1510.05970 [cs.CV]
	(or arXiv:1510.05970v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1510.05970
Journal reference:	JMLR 17(65):1-32, 2016

Submission history

From: Jure Žbontar [view email]
[v1] Tue, 20 Oct 2015 17:15:05 UTC (2,585 KB)
[v2] Wed, 18 May 2016 19:53:41 UTC (2,591 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.NE

< prev | next >

new | recent | 2015-10

Change to browse by:

cs
cs.CV
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jure Zbontar
Yann LeCun

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators