Finding the global semantic representation in GAN through Frechet Mean

Choi, Jaewoong; Hwang, Geonho; Cho, Hyunsoo; Kang, Myungjoo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.05509 (cs)

[Submitted on 11 Oct 2022 (v1), last revised 23 Apr 2023 (this version, v2)]

Title:Finding the global semantic representation in GAN through Frechet Mean

Authors:Jaewoong Choi, Geonho Hwang, Hyunsoo Cho, Myungjoo Kang

View PDF

Abstract:The ideally disentangled latent space in GAN involves the global representation of latent space with semantic attribute coordinates. In other words, considering that this disentangled latent space is a vector space, there exists the global semantic basis where each basis component describes one attribute of generated images. In this paper, we propose an unsupervised method for finding this global semantic basis in the intermediate latent space in GANs. This semantic basis represents sample-independent meaningful perturbations that change the same semantic attribute of an image on the entire latent space. The proposed global basis, called Fréchet basis, is derived by introducing Fréchet mean to the local semantic perturbations in a latent space. Fréchet basis is discovered in two stages. First, the global semantic subspace is discovered by the Fréchet mean in the Grassmannian manifold of the local semantic subspaces. Second, Fréchet basis is found by optimizing a basis of the semantic subspace via the Fréchet mean in the Special Orthogonal Group. Experimental results demonstrate that Fréchet basis provides better semantic factorization and robustness compared to the previous methods. Moreover, we suggest the basis refinement scheme for the previous methods. The quantitative experiments show that the refined basis achieves better semantic factorization while constrained on the same semantic subspace given by the previous method.

Comments:	25 pages, 21 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2210.05509 [cs.CV]
	(or arXiv:2210.05509v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.05509

Submission history

From: Jaewoong Choi [view email]
[v1] Tue, 11 Oct 2022 15:01:25 UTC (12,694 KB)
[v2] Sun, 23 Apr 2023 09:30:56 UTC (47,075 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Finding the global semantic representation in GAN through Frechet Mean

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Finding the global semantic representation in GAN through Frechet Mean

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators