Theory and Experiments on Vector Quantized Autoencoders

Roy, Aurko; Vaswani, Ashish; Neelakantan, Arvind; Parmar, Niki

Computer Science > Machine Learning

arXiv:1805.11063 (cs)

[Submitted on 28 May 2018 (v1), last revised 20 Jul 2018 (this version, v2)]

Title:Theory and Experiments on Vector Quantized Autoencoders

Authors:Aurko Roy, Ashish Vaswani, Arvind Neelakantan, Niki Parmar

View PDF

Abstract:Deep neural networks with discrete latent variables offer the promise of better symbolic reasoning, and learning abstractions that are more useful to new tasks. There has been a surge in interest in discrete latent variable models, however, despite several recent improvements, the training of discrete latent variable models has remained challenging and their performance has mostly failed to match their continuous counterparts. Recent work on vector quantized autoencoders (VQ-VAE) has made substantial progress in this direction, with its perplexity almost matching that of a VAE on datasets such as CIFAR-10. In this work, we investigate an alternate training technique for VQ-VAE, inspired by its connection to the Expectation Maximization (EM) algorithm. Training the discrete bottleneck with EM helps us achieve better image generation results on CIFAR-10, and together with knowledge distillation, allows us to develop a non-autoregressive machine translation model whose accuracy almost matches a strong greedy autoregressive baseline Transformer, while being 3.3 times faster at inference.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1805.11063 [cs.LG]
	(or arXiv:1805.11063v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.11063

Submission history

From: Aurko Roy [view email]
[v1] Mon, 28 May 2018 17:16:20 UTC (310 KB)
[v2] Fri, 20 Jul 2018 06:55:09 UTC (310 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Aurko Roy
Ashish Vaswani
Arvind Neelakantan
Niki Parmar

export BibTeX citation

Computer Science > Machine Learning

Title:Theory and Experiments on Vector Quantized Autoencoders

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Theory and Experiments on Vector Quantized Autoencoders

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators