Memory-Efficient Fine-Tuning for Quantized Diffusion Model

Ryu, Hyogon; Lim, Seohyun; Shim, Hyunjung

Computer Science > Computer Vision and Pattern Recognition

arXiv:2401.04339 (cs)

[Submitted on 9 Jan 2024 (v1), last revised 18 Jul 2024 (this version, v2)]

Title:Memory-Efficient Fine-Tuning for Quantized Diffusion Model

Authors:Hyogon Ryu, Seohyun Lim, Hyunjung Shim

View PDF HTML (experimental)

Abstract:The emergence of billion-parameter diffusion models such as Stable Diffusion XL, Imagen, and DALL-E 3 has significantly propelled the domain of generative AI. However, their large-scale architecture presents challenges in fine-tuning and deployment due to high resource demands and slow inference speed. This paper explores the relatively unexplored yet promising realm of fine-tuning quantized diffusion models. Our analysis revealed that the baseline neglects the distinct patterns in model weights and the different roles throughout time steps when finetuning the diffusion model. To address these limitations, we introduce a novel memory-efficient fine-tuning method specifically designed for quantized diffusion models, dubbed TuneQDM. Our approach introduces quantization scales as separable functions to consider inter-channel weight patterns. Then, it optimizes these scales in a timestep-specific manner for effective reflection of the role of each time step. TuneQDM achieves performance on par with its full-precision counterpart while simultaneously offering significant memory efficiency. Experimental results demonstrate that our method consistently outperforms the baseline in both single-/multi-subject generations, exhibiting high subject fidelity and prompt fidelity comparable to the full precision model.

Comments:	Accepted by ECCV2024. Code will be released at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2401.04339 [cs.CV]
	(or arXiv:2401.04339v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2401.04339

Submission history

From: Hyogon Ryu [view email]
[v1] Tue, 9 Jan 2024 03:42:08 UTC (8,640 KB)
[v2] Thu, 18 Jul 2024 11:38:17 UTC (4,866 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Memory-Efficient Fine-Tuning for Quantized Diffusion Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Memory-Efficient Fine-Tuning for Quantized Diffusion Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators