Coverage and Quality Driven Training of Generative Image Models

Shmelkov, Konstantin; Lucas, Thomas; Alahari, Karteek; Schmid, Cordelia; Verbeek, Jakob

Computer Science > Computer Vision and Pattern Recognition

arXiv:1901.01091v1 (cs)

[Submitted on 4 Jan 2019 (this version), latest version 3 Jan 2020 (v3)]

Title:Coverage and Quality Driven Training of Generative Image Models

Authors:Konstantin Shmelkov, Thomas Lucas, Karteek Alahari, Cordelia Schmid, Jakob Verbeek

View PDF

Abstract:Generative modeling of natural images has been extensively studied in recent years, yielding remarkable progress. Current state-of-the-art methods are either based on maximum likelihood estimation or adversarial training. Both methods have their own drawbacks, which are complementary in nature. The first leads to over-generalization as the maximum likelihood criterion encourages models to cover the support of the training data by heavily penalizing small masses assigned to training data. Simplifying assumptions in such models limits their capacity and makes them spill mass on unrealistic samples. The second leads to mode-dropping since adversarial training encourages high quality samples from the model, but only indirectly enforces diversity among the samples. To overcome these drawbacks we make two contributions. First, we propose a novel extension to the variational autoencoders model by using deterministic invertible transformation layers to map samples from the decoder to the image space. This induces correlations among the pixels given the latent variables, improving over commonly used factorial decoders. Second, we propose a training approach that leverages coverage and quality based criteria. Our models obtain likelihood scores competitive with state-of-the-art likelihood-based models, while achieving sample quality typical of adversarially trained networks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1901.01091 [cs.CV]
	(or arXiv:1901.01091v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1901.01091

Submission history

From: Thomas Lucas [view email]
[v1] Fri, 4 Jan 2019 13:43:18 UTC (2,845 KB)
[v2] Fri, 1 Mar 2019 14:47:20 UTC (1,892 KB)
[v3] Fri, 3 Jan 2020 15:03:37 UTC (7,126 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Coverage and Quality Driven Training of Generative Image Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Coverage and Quality Driven Training of Generative Image Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators