Invertible Gaussian Reparameterization: Revisiting the Gumbel-Softmax

Potapczynski, Andres; Loaiza-Ganem, Gabriel; Cunningham, John P.

Statistics > Machine Learning

arXiv:1912.09588v2 (stat)

[Submitted on 19 Dec 2019 (v1), revised 7 Feb 2020 (this version, v2), latest version 29 Aug 2022 (v5)]

Title:Invertible Gaussian Reparameterization: Revisiting the Gumbel-Softmax

Authors:Andres Potapczynski, Gabriel Loaiza-Ganem, John P. Cunningham

View PDF

Abstract:The Gumbel-Softmax is a continuous distribution over the simplex that is often used as a relaxation of discrete distributions. Because it can be readily interpreted and easily reparameterized, it enjoys widespread use. Unfortunately, we show that the cost of this aesthetic interpretability is material: the temperature hyperparameter must be set too high, KL estimates are noisy, and as a result, performance suffers. We circumvent the previous issues by proposing a much simpler and more flexible reparameterizable family of distributions that transforms Gaussian noise into a one-hot approximation through an invertible function. This invertible function is composed of a modified softmax and can incorporate diverse transformations that serve different specific purposes. For example, the stick-breaking procedure allows us to extend the reparameterization trick to distributions with countably infinite support, or normalizing flows let us increase the flexibility of the distribution. Our construction improves numerical stability and outperforms the Gumbel-Softmax in a variety of experiments while generating samples that are closer to their discrete counterparts and achieving lower-variance gradients.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1912.09588 [stat.ML]
	(or arXiv:1912.09588v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1912.09588

Submission history

From: Andres Potapczynski [view email]
[v1] Thu, 19 Dec 2019 23:11:39 UTC (535 KB)
[v2] Fri, 7 Feb 2020 19:35:17 UTC (660 KB)
[v3] Thu, 11 Jun 2020 23:07:40 UTC (773 KB)
[v4] Mon, 26 Oct 2020 18:26:19 UTC (784 KB)
[v5] Mon, 29 Aug 2022 13:35:13 UTC (794 KB)

Statistics > Machine Learning

Title:Invertible Gaussian Reparameterization: Revisiting the Gumbel-Softmax

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Invertible Gaussian Reparameterization: Revisiting the Gumbel-Softmax

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators