Failing to Learn: Autonomously Identifying Perception Failures for Self-driving Cars

Ramanagopal, Manikandasriram Srinivasan; Anderson, Cyrus; Vasudevan, Ram; Johnson-Roberson, Matthew

doi:10.1109/LRA.2018.2857402

Computer Science > Computer Vision and Pattern Recognition

arXiv:1707.00051 (cs)

[Submitted on 30 Jun 2017 (v1), last revised 26 Jul 2018 (this version, v4)]

Title:Failing to Learn: Autonomously Identifying Perception Failures for Self-driving Cars

Authors:Manikandasriram Srinivasan Ramanagopal, Cyrus Anderson, Ram Vasudevan, Matthew Johnson-Roberson

View PDF

Abstract:One of the major open challenges in self-driving cars is the ability to detect cars and pedestrians to safely navigate in the world. Deep learning-based object detector approaches have enabled great advances in using camera imagery to detect and classify objects. But for a safety critical application, such as autonomous driving, the error rates of the current state of the art are still too high to enable safe operation. Moreover, the characterization of object detector performance is primarily limited to testing on prerecorded datasets. Errors that occur on novel data go undetected without additional human labels. In this letter, we propose an automated method to identify mistakes made by object detectors without ground truth labels. We show that inconsistencies in the object detector output between a pair of similar images can be used as hypotheses for false negatives (e.g., missed detections) and using a novel set of features for each hypothesis, an off-the-shelf binary classifier can be used to find valid errors. In particular, we study two distinct cues - temporal and stereo inconsistencies - using data that are readily available on most autonomous vehicles. Our method can be used with any camera-based object detector and we illustrate the technique on several sets of real world data. We show that a state-of-the-art detector, tracker, and our classifier trained only on synthetic data can identify valid errors on KITTI tracking dataset with an average precision of 0.94. We also release a new tracking dataset with 104 sequences totaling 80,655 labeled pairs of stereo images along with ground truth disparity from a game engine to facilitate further research. The dataset and code are available at this https URL

Comments:	8 pages, 4 figures and 4 tables. Accepted for publication in RA-L and will be presented in IROS 2018 in Madrid, Spain
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:1707.00051 [cs.CV]
	(or arXiv:1707.00051v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1707.00051
Related DOI:	https://doi.org/10.1109/LRA.2018.2857402

Submission history

From: Manikandasriram Srinivasan Ramanagopal [view email]
[v1] Fri, 30 Jun 2017 21:42:47 UTC (7,632 KB)
[v2] Wed, 12 Jul 2017 01:58:46 UTC (7,632 KB)
[v3] Mon, 26 Mar 2018 19:09:44 UTC (11,133 KB)
[v4] Thu, 26 Jul 2018 19:41:39 UTC (5,758 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Failing to Learn: Autonomously Identifying Perception Failures for Self-driving Cars

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Failing to Learn: Autonomously Identifying Perception Failures for Self-driving Cars

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators