Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

Pomponi, Jary; Scardapane, Simone; Uncini, Aurelio

doi:10.1016/j.neucom.2021.01.090

Computer Science > Machine Learning

arXiv:2003.00952 (cs)

[Submitted on 2 Mar 2020 (v1), last revised 30 Sep 2020 (this version, v2)]

Title:Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

Authors:Jary Pomponi, Simone Scardapane, Aurelio Uncini

View PDF

Abstract:Bayesian Neural Networks (BNNs) are trained to optimize an entire distribution over their weights instead of a single set, having significant advantages in terms of, e.g., interpretability, multi-task learning, and calibration. Because of the intractability of the resulting optimization problem, most BNNs are either sampled through Monte Carlo methods, or trained by minimizing a suitable Evidence Lower BOund (ELBO) on a variational approximation. In this paper, we propose a variant of the latter, wherein we replace the Kullback-Leibler divergence in the ELBO term with a Maximum Mean Discrepancy (MMD) estimator, inspired by recent work in variational inference. After motivating our proposal based on the properties of the MMD term, we proceed to show a number of empirical advantages of the proposed formulation over the state-of-the-art. In particular, our BNNs achieve higher accuracy on multiple benchmarks, including several image classification tasks. In addition, they are more robust to the selection of a prior over the weights, and they are better calibrated. As a second contribution, we provide a new formulation for estimating the uncertainty on a given prediction, showing it performs in a more robust fashion against adversarial attacks and the injection of noise over their inputs, compared to more classical criteria such as the differential entropy.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2003.00952 [cs.LG]
	(or arXiv:2003.00952v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2003.00952
Related DOI:	https://doi.org/10.1016/j.neucom.2021.01.090

Submission history

From: Jary Pomponi [view email]
[v1] Mon, 2 Mar 2020 14:54:48 UTC (1,555 KB)
[v2] Wed, 30 Sep 2020 09:56:44 UTC (1,557 KB)

Computer Science > Machine Learning

Title:Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators