Classifier-free Guidance with Adaptive Scaling

Malarz, Dawid; Kasymov, Artur; Zięba, Maciej; Tabor, Jacek; Spurek, Przemysław

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.10574 (cs)

[Submitted on 14 Feb 2025]

Title:Classifier-free Guidance with Adaptive Scaling

Authors:Dawid Malarz, Artur Kasymov, Maciej Zięba, Jacek Tabor, Przemysław Spurek

View PDF HTML (experimental)

Abstract:Classifier-free guidance (CFG) is an essential mechanism in contemporary text-driven diffusion models. In practice, in controlling the impact of guidance we can see the trade-off between the quality of the generated images and correspondence to the prompt. When we use strong guidance, generated images fit the conditioned text perfectly but at the cost of their quality. Dually, we can use small guidance to generate high-quality results, but the generated images do not suit our prompt. In this paper, we present $\beta$-CFG ($\beta$-adaptive scaling in Classifier-Free Guidance), which controls the impact of guidance during generation to solve the above trade-off. First, $\beta$-CFG stabilizes the effects of guiding by gradient-based adaptive normalization. Second, $\beta$-CFG uses the family of single-modal ($\beta$-distribution), time-dependent curves to dynamically adapt the trade-off between prompt matching and the quality of samples during the diffusion denoising process. Our model obtained better FID scores, maintaining the text-to-image CLIP similarity scores at a level similar to that of the reference CFG.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.10574 [cs.CV]
	(or arXiv:2502.10574v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.10574

Submission history

From: Przemysław Spurek [view email]
[v1] Fri, 14 Feb 2025 22:04:53 UTC (42,782 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Classifier-free Guidance with Adaptive Scaling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Classifier-free Guidance with Adaptive Scaling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators