Self-Attention Generative Adversarial Networks

Zhang, Han; Goodfellow, Ian; Metaxas, Dimitris; Odena, Augustus

Statistics > Machine Learning

arXiv:1805.08318 (stat)

[Submitted on 21 May 2018 (v1), last revised 14 Jun 2019 (this version, v2)]

Title:Self-Attention Generative Adversarial Networks

Authors:Han Zhang, Ian Goodfellow, Dimitris Metaxas, Augustus Odena

View PDF

Abstract:In this paper, we propose the Self-Attention Generative Adversarial Network (SAGAN) which allows attention-driven, long-range dependency modeling for image generation tasks. Traditional convolutional GANs generate high-resolution details as a function of only spatially local points in lower-resolution feature maps. In SAGAN, details can be generated using cues from all feature locations. Moreover, the discriminator can check that highly detailed features in distant portions of the image are consistent with each other. Furthermore, recent work has shown that generator conditioning affects GAN performance. Leveraging this insight, we apply spectral normalization to the GAN generator and find that this improves training dynamics. The proposed SAGAN achieves the state-of-the-art results, boosting the best published Inception score from 36.8 to 52.52 and reducing Frechet Inception distance from 27.62 to 18.65 on the challenging ImageNet dataset. Visualization of the attention layers shows that the generator leverages neighborhoods that correspond to object shapes rather than local regions of fixed shape.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1805.08318 [stat.ML]
	(or arXiv:1805.08318v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1805.08318

Submission history

From: Augustus Odena [view email]
[v1] Mon, 21 May 2018 23:10:35 UTC (7,292 KB)
[v2] Fri, 14 Jun 2019 18:20:10 UTC (7,439 KB)

Statistics > Machine Learning

Title:Self-Attention Generative Adversarial Networks

Submission history

Access Paper:

References & Citations

3 blog links

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Self-Attention Generative Adversarial Networks

Submission history

Access Paper:

References & Citations

3 blog links

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators