Dropout as a Structured Shrinkage Prior

Nalisnick, Eric; Hernández-Lobato, José Miguel; Smyth, Padhraic

Statistics > Machine Learning

arXiv:1810.04045 (stat)

[Submitted on 9 Oct 2018 (v1), last revised 29 May 2019 (this version, v3)]

Title:Dropout as a Structured Shrinkage Prior

Authors:Eric Nalisnick, José Miguel Hernández-Lobato, Padhraic Smyth

View PDF

Abstract:Dropout regularization of deep neural networks has been a mysterious yet effective tool to prevent overfitting. Explanations for its success range from the prevention of "co-adapted" weights to it being a form of cheap Bayesian inference. We propose a novel framework for understanding multiplicative noise in neural networks, considering continuous distributions as well as Bernoulli noise (i.e. dropout). We show that multiplicative noise induces structured shrinkage priors on a network's weights. We derive the equivalence through reparametrization properties of scale mixtures and without invoking any approximations. Given the equivalence, we then show that dropout's Monte Carlo training objective approximates marginal MAP estimation. We leverage these insights to propose a novel shrinkage framework for resnets, terming the prior 'automatic depth determination' as it is the natural analog of automatic relevance determination for network depth. Lastly, we investigate two inference strategies that improve upon the aforementioned MAP approximation in regression benchmarks.

Comments:	ICML 2019
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1810.04045 [stat.ML]
	(or arXiv:1810.04045v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1810.04045

Submission history

From: Eric Nalisnick [view email]
[v1] Tue, 9 Oct 2018 14:44:08 UTC (137 KB)
[v2] Sun, 10 Feb 2019 14:35:37 UTC (862 KB)
[v3] Wed, 29 May 2019 14:01:20 UTC (1,170 KB)

Statistics > Machine Learning

Title:Dropout as a Structured Shrinkage Prior

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Dropout as a Structured Shrinkage Prior

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators