Score-Based Generative Models Detect Manifolds

Pidstrigach, Jakiw

Statistics > Machine Learning

arXiv:2206.01018v1 (stat)

[Submitted on 2 Jun 2022 (this version), latest version 15 Oct 2022 (v3)]

Title:Score-Based Generative Models Detect Manifolds

Authors:Jakiw Pidstrigach

View PDF

Abstract:Score-based generative models (SGMs) need to approximate the scores $\nabla \log p_t$ of the intermediate distributions as well as the final distribution $p_T$ of the forward process. The theoretical underpinnings of the effects of these approximations are still lacking. We find precise conditions under which SGMs are able to produce samples from an underlying (low-dimensional) data manifold $\mathcal{M}$. This assures us that SGMs are able to generate the "right kind of samples". For example, taking $\mathcal{M}$ to be the subset of images of faces, we find conditions under which the SGM robustly produces an image of a face, even though the relative frequencies of these images might not accurately represent the true data generating distribution. Moreover, this analysis is a first step towards understanding the generalization properties of SGMs: Taking $\mathcal{M}$ to be the set of all training samples, our results provide a precise description of when the SGM memorizes its training data.

Comments:	19 pages, 4 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA); Probability (math.PR)
MSC classes:	68T99
ACM classes:	I.2.0
Cite as:	arXiv:2206.01018 [stat.ML]
	(or arXiv:2206.01018v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2206.01018

Submission history

From: Jakiw Pidstrigach [view email]
[v1] Thu, 2 Jun 2022 12:29:10 UTC (4,308 KB)
[v2] Wed, 17 Aug 2022 09:18:51 UTC (12,119 KB)
[v3] Sat, 15 Oct 2022 12:54:18 UTC (12,119 KB)

Statistics > Machine Learning

Title:Score-Based Generative Models Detect Manifolds

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Score-Based Generative Models Detect Manifolds

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators