Disrupting Deep Uncertainty Estimation Without Harming Accuracy

Galil, Ido; El-Yaniv, Ran

Computer Science > Machine Learning

arXiv:2110.13741 (cs)

[Submitted on 26 Oct 2021]

Title:Disrupting Deep Uncertainty Estimation Without Harming Accuracy

Authors:Ido Galil, Ran El-Yaniv

View PDF

Abstract:Deep neural networks (DNNs) have proven to be powerful predictors and are widely used for various tasks. Credible uncertainty estimation of their predictions, however, is crucial for their deployment in many risk-sensitive applications. In this paper we present a novel and simple attack, which unlike adversarial attacks, does not cause incorrect predictions but instead cripples the network's capacity for uncertainty estimation. The result is that after the attack, the DNN is more confident of its incorrect predictions than about its correct ones without having its accuracy reduced. We present two versions of the attack. The first scenario focuses on a black-box regime (where the attacker has no knowledge of the target network) and the second scenario attacks a white-box setting. The proposed attack is only required to be of minuscule magnitude for its perturbations to cause severe uncertainty estimation damage, with larger magnitudes resulting in completely unusable uncertainty estimations. We demonstrate successful attacks on three of the most popular uncertainty estimation methods: the vanilla softmax score, Deep Ensembles and MC-Dropout. Additionally, we show an attack on SelectiveNet, the selective classification architecture. We test the proposed attack on several contemporary architectures such as MobileNetV2 and EfficientNetB0, all trained to classify ImageNet.

Comments:	To be published in NeurIPS 2021
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2110.13741 [cs.LG]
	(or arXiv:2110.13741v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2110.13741
Journal reference:	Neural Information Processing Systems Conference (2021)

Submission history

From: Ido Galil [view email]
[v1] Tue, 26 Oct 2021 14:44:00 UTC (3,552 KB)

Computer Science > Machine Learning

Title:Disrupting Deep Uncertainty Estimation Without Harming Accuracy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Disrupting Deep Uncertainty Estimation Without Harming Accuracy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators