Fast Diffusion Model

Wu, Zike; Zhou, Pan; Kawaguchi, Kenji; Zhang, Hanwang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.06991 (cs)

[Submitted on 12 Jun 2023 (v1), last revised 4 Oct 2023 (this version, v2)]

Title:Fast Diffusion Model

Authors:Zike Wu, Pan Zhou, Kenji Kawaguchi, Hanwang Zhang

View PDF

Abstract:Diffusion models (DMs) have been adopted across diverse fields with its remarkable abilities in capturing intricate data distributions. In this paper, we propose a Fast Diffusion Model (FDM) to significantly speed up DMs from a stochastic optimization perspective for both faster training and sampling. We first find that the diffusion process of DMs accords with the stochastic optimization process of stochastic gradient descent (SGD) on a stochastic time-variant problem. Then, inspired by momentum SGD that uses both gradient and an extra momentum to achieve faster and more stable convergence than SGD, we integrate momentum into the diffusion process of DMs. This comes with a unique challenge of deriving the noise perturbation kernel from the momentum-based diffusion process. To this end, we frame the process as a Damped Oscillation system whose critically damped state -- the kernel solution -- avoids oscillation and yields a faster convergence speed of the diffusion process. Empirical results show that our FDM can be applied to several popular DM frameworks, e.g., VP, VE, and EDM, and reduces their training cost by about 50% with comparable image synthesis performance on CIFAR-10, FFHQ, and AFHQv2 datasets. Moreover, FDM decreases their sampling steps by about 3x to achieve similar performance under the same samplers. The code is available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2306.06991 [cs.CV]
	(or arXiv:2306.06991v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.06991

Submission history

From: Zike Wu [view email]
[v1] Mon, 12 Jun 2023 09:38:04 UTC (4,194 KB)
[v2] Wed, 4 Oct 2023 09:10:03 UTC (4,202 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Fast Diffusion Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Fast Diffusion Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators