What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Humayun, Ahmed Imtiaz; Amara, Ibtihel; Vasconcelos, Cristina; Ramachandran, Deepak; Schumann, Candice; He, Junfeng; Heller, Katherine; Farnadi, Golnoosh; Rostamzadeh, Negar; Havaei, Mohammad

Computer Science > Machine Learning

arXiv:2408.08307 (cs)

[Submitted on 15 Aug 2024 (v1), last revised 6 Feb 2025 (this version, v2)]

Title:What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Authors:Ahmed Imtiaz Humayun, Ibtihel Amara, Cristina Vasconcelos, Deepak Ramachandran, Candice Schumann, Junfeng He, Katherine Heller, Golnoosh Farnadi, Negar Rostamzadeh, Mohammad Havaei

View PDF HTML (experimental)

Abstract:Deep Generative Models are frequently used to learn continuous representations of complex data distributions using a finite number of samples. For any generative model, including pre-trained foundation models with Diffusion or Transformer architectures, generation performance can significantly vary across the learned data manifold. In this paper we study the local geometry of the learned manifold and its relationship to generation outcomes for a wide range of generative models, including DDPM, Diffusion Transformer (DiT), and Stable Diffusion 1.4. Building on the theory of continuous piecewise-linear (CPWL) generators, we characterize the local geometry in terms of three geometric descriptors - scaling ($\psi$), rank ($\nu$), and complexity/un-smoothness ($\delta$). We provide quantitative and qualitative evidence showing that for a given latent-image pair, the local descriptors are indicative of generation aesthetics, diversity, and memorization by the generative model. Finally, we demonstrate that by training a reward model on the local scaling for Stable Diffusion, we can self-improve both generation aesthetics and diversity using `geometry reward' based guidance during denoising.

Comments:	Accepted for publication at ICLR 2025
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2408.08307 [cs.LG]
	(or arXiv:2408.08307v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.08307

Submission history

From: Ahmed Imtiaz Humayun [view email]
[v1] Thu, 15 Aug 2024 17:59:06 UTC (39,756 KB)
[v2] Thu, 6 Feb 2025 09:30:06 UTC (13,207 KB)

Computer Science > Machine Learning

Title:What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators