Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

Schulam, Peter; Saria, Suchi

Statistics > Machine Learning

arXiv:1901.00403 (stat)

[Submitted on 2 Jan 2019 (v1), last revised 28 Feb 2019 (this version, v2)]

Title:Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

Authors:Peter Schulam, Suchi Saria

View PDF

Abstract:To use machine learning in high stakes applications (e.g. medicine), we need tools for building confidence in the system and evaluating whether it is reliable. Methods to improve model reliability often require new learning algorithms (e.g. using Bayesian inference to obtain uncertainty estimates). An alternative is to audit a model after it is trained. In this paper, we describe resampling uncertainty estimation (RUE), an algorithm to audit the pointwise reliability of predictions. Intuitively, RUE estimates the amount that a prediction would change if the model had been fit on different training data. The algorithm uses the gradient and Hessian of the model's loss function to create an ensemble of predictions. Experimentally, we show that RUE more effectively detects inaccurate predictions than existing tools for auditing reliability subsequent to training. We also show that RUE can create predictive distributions that are competitive with state-of-the-art methods like Monte Carlo dropout, probabilistic backpropagation, and deep ensembles, but does not depend on specific algorithms at train-time like these methods do.

Comments:	To appear in the proceedings of Artificial Intelligence and Statistics (AISTATS) 2019
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
Cite as:	arXiv:1901.00403 [stat.ML]
	(or arXiv:1901.00403v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1901.00403

Submission history

From: Peter Schulam [view email]
[v1] Wed, 2 Jan 2019 14:53:33 UTC (548 KB)
[v2] Thu, 28 Feb 2019 21:33:19 UTC (549 KB)

Statistics > Machine Learning

Title:Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators