A Closer Look at Parameter-Efficient Tuning in Diffusion Models

Xiang, Chendong; Bao, Fan; Li, Chongxuan; Su, Hang; Zhu, Jun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.18181 (cs)

[Submitted on 31 Mar 2023 (v1), last revised 12 Apr 2023 (this version, v2)]

Title:A Closer Look at Parameter-Efficient Tuning in Diffusion Models

Authors:Chendong Xiang, Fan Bao, Chongxuan Li, Hang Su, Jun Zhu

View PDF

Abstract:Large-scale diffusion models like Stable Diffusion are powerful and find various real-world applications while customizing such models by fine-tuning is both memory and time inefficient. Motivated by the recent progress in natural language processing, we investigate parameter-efficient tuning in large diffusion models by inserting small learnable modules (termed adapters). In particular, we decompose the design space of adapters into orthogonal factors -- the input position, the output position as well as the function form, and perform Analysis of Variance (ANOVA), a classical statistical approach for analyzing the correlation between discrete (design options) and continuous variables (evaluation metrics). Our analysis suggests that the input position of adapters is the critical factor influencing the performance of downstream tasks. Then, we carefully study the choice of the input position, and we find that putting the input position after the cross-attention block can lead to the best performance, validated by additional visualization analyses. Finally, we provide a recipe for parameter-efficient tuning in diffusion models, which is comparable if not superior to the fully fine-tuned baseline (e.g., DreamBooth) with only 0.75 \% extra parameters, across various customized tasks.

Comments:	8pages, now our code is available at: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2303.18181 [cs.CV]
	(or arXiv:2303.18181v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.18181

Submission history

From: Chendong Xiang [view email]
[v1] Fri, 31 Mar 2023 16:23:29 UTC (16,885 KB)
[v2] Wed, 12 Apr 2023 14:41:12 UTC (16,885 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Closer Look at Parameter-Efficient Tuning in Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Closer Look at Parameter-Efficient Tuning in Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators