Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

Li, Muyang; Lin, Ji; Meng, Chenlin; Ermon, Stefano; Han, Song; Zhu, Jun-Yan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.02048v1 (cs)

[Submitted on 3 Nov 2022 (this version), latest version 13 Sep 2023 (v4)]

Title:Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

Authors:Muyang Li, Ji Lin, Chenlin Meng, Stefano Ermon, Song Han, Jun-Yan Zhu

View PDF

Abstract:During image editing, existing deep generative models tend to re-synthesize the entire output from scratch, including the unedited regions. This leads to a significant waste of computation, especially for minor editing operations. In this work, we present Spatially Sparse Inference (SSI), a general-purpose technique that selectively performs computation for edited regions and accelerates various generative models, including both conditional GANs and diffusion models. Our key observation is that users tend to make gradual changes to the input image. This motivates us to cache and reuse the feature maps of the original image. Given an edited image, we sparsely apply the convolutional filters to the edited regions while reusing the cached features for the unedited regions. Based on our algorithm, we further propose Sparse Incremental Generative Engine (SIGE) to convert the computation reduction to latency reduction on off-the-shelf hardware. With 1.2%-area edited regions, our method reduces the computation of DDIM by 7.5$\times$ and GauGAN by 18$\times$ while preserving the visual fidelity. With SIGE, we accelerate the speed of DDIM by 3.0x on RTX 3090 and 6.6$\times$ on Apple M1 Pro CPU, and GauGAN by 4.2$\times$ on RTX 3090 and 14$\times$ on Apple M1 Pro CPU.

Comments:	NeurIPS 2022 Website: this https URL Code: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
Cite as:	arXiv:2211.02048 [cs.CV]
	(or arXiv:2211.02048v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.02048

Submission history

From: Muyang Li [view email]
[v1] Thu, 3 Nov 2022 17:59:55 UTC (7,014 KB)
[v2] Tue, 15 Nov 2022 23:05:18 UTC (45,667 KB)
[v3] Tue, 11 Apr 2023 19:08:09 UTC (37,577 KB)
[v4] Wed, 13 Sep 2023 20:32:50 UTC (24,905 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators