A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

Arenz, Oleg; Dahlinger, Philipp; Ye, Zihan; Volpp, Michael; Neumann, Gerhard

Computer Science > Machine Learning

arXiv:2209.11533 (cs)

[Submitted on 23 Sep 2022 (v1), last revised 17 Jul 2023 (this version, v2)]

Title:A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

Authors:Oleg Arenz, Philipp Dahlinger, Zihan Ye, Michael Volpp, Gerhard Neumann

View PDF

Abstract:Variational inference with Gaussian mixture models (GMMs) enables learning of highly tractable yet multi-modal approximations of intractable target distributions with up to a few hundred dimensions. The two currently most effective methods for GMM-based variational inference, VIPS and iBayes-GMM, both employ independent natural gradient updates for the individual components and their weights. We show for the first time, that their derived updates are equivalent, although their practical implementations and theoretical guarantees differ. We identify several design choices that distinguish both approaches, namely with respect to sample selection, natural gradient estimation, stepsize adaptation, and whether trust regions are enforced or the number of components adapted. We argue that for both approaches, the quality of the learned approximations can heavily suffer from the respective design choices: By updating the individual components using samples from the mixture model, iBayes-GMM often fails to produce meaningful updates to low-weight components, and by using a zero-order method for estimating the natural gradient, VIPS scales badly to higher-dimensional problems. Furthermore, we show that information-geometric trust-regions (used by VIPS) are effective even when using first-order natural gradient estimates, and often outperform the improved Bayesian learning rule (iBLR) update used by iBayes-GMM. We systematically evaluate the effects of design choices and show that a hybrid approach significantly outperforms both prior works. Along with this work, we publish our highly modular and efficient implementation for natural gradient variational inference with Gaussian mixture models, which supports 432 different combinations of design choices, facilitates the reproduction of all our experiments, and may prove valuable for the practitioner.

Comments:	This version corresponds to the camera ready version published at Transactions of Machine Learning Research (TMLR). this https URL
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:2209.11533 [cs.LG]
	(or arXiv:2209.11533v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.11533
Journal reference:	Transactions on Machine Learning Research (2023) ISSN: 2835-8856

Submission history

From: Oleg Arenz [view email]
[v1] Fri, 23 Sep 2022 11:43:27 UTC (585 KB)
[v2] Mon, 17 Jul 2023 10:02:55 UTC (825 KB)

Computer Science > Machine Learning

Title:A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators