Learning Structural Weight Uncertainty for Sequential Decision-Making

Zhang, Ruiyi; Li, Chunyuan; Chen, Changyou; Carin, Lawrence

Statistics > Machine Learning

arXiv:1801.00085 (stat)

[Submitted on 30 Dec 2017 (v1), last revised 2 Apr 2018 (this version, v2)]

Title:Learning Structural Weight Uncertainty for Sequential Decision-Making

Authors:Ruiyi Zhang, Chunyuan Li, Changyou Chen, Lawrence Carin

View PDF

Abstract:Learning probability distributions on the weights of neural networks (NNs) has recently proven beneficial in many applications. Bayesian methods, such as Stein variational gradient descent (SVGD), offer an elegant framework to reason about NN model uncertainty. However, by assuming independent Gaussian priors for the individual NN weights (as often applied), SVGD does not impose prior knowledge that there is often structural information (dependence) among weights. We propose efficient posterior learning of structural weight uncertainty, within an SVGD framework, by employing matrix variate Gaussian priors on NN parameters. We further investigate the learned structural uncertainty in sequential decision-making problems, including contextual bandits and reinforcement learning. Experiments on several synthetic and real datasets indicate the superiority of our model, compared with state-of-the-art methods.

Comments:	Accepted by AISTATS 2018
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1801.00085 [stat.ML]
	(or arXiv:1801.00085v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1801.00085

Submission history

From: Ruiyi Zhang [view email]
[v1] Sat, 30 Dec 2017 04:34:34 UTC (1,759 KB)
[v2] Mon, 2 Apr 2018 01:06:13 UTC (2,398 KB)

Statistics > Machine Learning

Title:Learning Structural Weight Uncertainty for Sequential Decision-Making

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Learning Structural Weight Uncertainty for Sequential Decision-Making

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators