Neural Discrete Representation Learning

Oord, Aaron van den; Vinyals, Oriol; Kavukcuoglu, Koray

Computer Science > Machine Learning

arXiv:1711.00937 (cs)

[Submitted on 2 Nov 2017 (v1), last revised 30 May 2018 (this version, v2)]

Title:Neural Discrete Representation Learning

Authors:Aaron van den Oord, Oriol Vinyals, Koray Kavukcuoglu

View PDF

Abstract:Learning useful representations without supervision remains a key challenge in machine learning. In this paper, we propose a simple yet powerful generative model that learns such discrete representations. Our model, the Vector Quantised-Variational AutoEncoder (VQ-VAE), differs from VAEs in two key ways: the encoder network outputs discrete, rather than continuous, codes; and the prior is learnt rather than static. In order to learn a discrete latent representation, we incorporate ideas from vector quantisation (VQ). Using the VQ method allows the model to circumvent issues of "posterior collapse" -- where the latents are ignored when they are paired with a powerful autoregressive decoder -- typically observed in the VAE framework. Pairing these representations with an autoregressive prior, the model can generate high quality images, videos, and speech as well as doing high quality speaker conversion and unsupervised learning of phonemes, providing further evidence of the utility of the learnt representations.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1711.00937 [cs.LG]
	(or arXiv:1711.00937v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1711.00937

Submission history

From: Aäron van den Oord [view email]
[v1] Thu, 2 Nov 2017 21:14:44 UTC (6,182 KB)
[v2] Wed, 30 May 2018 14:58:27 UTC (7,283 KB)

Computer Science > Machine Learning

Title:Neural Discrete Representation Learning

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Neural Discrete Representation Learning

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators