Combating Mode Collapse in GANs via Manifold Entropy Estimation

Liu, Haozhe; Li, Bing; Wu, Haoqian; Liang, Hanbang; Huang, Yawen; Li, Yuexiang; Ghanem, Bernard; Zheng, Yefeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2208.12055v5 (cs)

[Submitted on 25 Aug 2022 (v1), revised 1 Feb 2023 (this version, v5), latest version 8 Apr 2023 (v6)]

Title:Combating Mode Collapse in GANs via Manifold Entropy Estimation

Authors:Haozhe Liu, Bing Li, Haoqian Wu, Hanbang Liang, Yawen Huang, Yuexiang Li, Bernard Ghanem, Yefeng Zheng

View PDF

Abstract:Generative Adversarial Networks (GANs) have shown compelling results in various tasks and applications in recent years. However, mode collapse remains a critical problem in GANs. In this paper, we propose a novel training pipeline to address the mode collapse issue of GANs. Different from existing methods, we propose to generalize the discriminator as feature embedding and maximize the entropy of distributions in the embedding space learned by the discriminator. Specifically, two regularization terms, i.e., Deep Local Linear Embedding (DLLE) and Deep Isometric feature Mapping (DIsoMap), are designed to encourage the discriminator to learn the structural information embedded in the data, such that the embedding space learned by the discriminator can be well-formed. Based on the well-learned embedding space supported by the discriminator, a non-parametric entropy estimator is designed to efficiently maximize the entropy of embedding vectors, playing as an approximation of maximizing the entropy of the generated distribution. By improving the discriminator and maximizing the distance of the most similar samples in the embedding space, our pipeline effectively reduces the mode collapse without sacrificing the quality of generated samples. Extensive experimental results show the effectiveness of our method, which outperforms the GAN baseline, MaF-GAN on CelebA (9.13 vs. 12.43 in FID) and surpasses the recent state-of-the-art energy-based model on the ANIME-FACE dataset (2.80 vs. 2.26 in Inception score). The code is available at this https URL

Comments:	Accepted by AAAI'2023 (Oral); Code is released at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2208.12055 [cs.CV]
	(or arXiv:2208.12055v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2208.12055

Submission history

From: Haozhe Liu [view email]
[v1] Thu, 25 Aug 2022 12:33:31 UTC (3,460 KB)
[v2] Wed, 23 Nov 2022 09:26:33 UTC (3,460 KB)
[v3] Fri, 9 Dec 2022 11:40:02 UTC (2,976 KB)
[v4] Wed, 11 Jan 2023 17:10:10 UTC (2,976 KB)
[v5] Wed, 1 Feb 2023 09:56:45 UTC (2,976 KB)
[v6] Sat, 8 Apr 2023 11:03:02 UTC (2,976 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Combating Mode Collapse in GANs via Manifold Entropy Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Combating Mode Collapse in GANs via Manifold Entropy Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators