Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints

Chen, Jian; Zhang, Ruiyi; Zhou, Yufan; Jain, Rajiv; Xu, Zhiqiang; Rossi, Ryan; Chen, Changyou

Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.04754 (cs)

[Submitted on 7 Feb 2024 (v1), last revised 15 May 2024 (this version, v2)]

Title:Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints

Authors:Jian Chen, Ruiyi Zhang, Yufan Zhou, Rajiv Jain, Zhiqiang Xu, Ryan Rossi, Changyou Chen

View PDF HTML (experimental)

Abstract:Controllable layout generation refers to the process of creating a plausible visual arrangement of elements within a graphic design (e.g., document and web designs) with constraints representing design intentions. Although recent diffusion-based models have achieved state-of-the-art FID scores, they tend to exhibit more pronounced misalignment compared to earlier transformer-based models. In this work, we propose the $\textbf{LA}$yout $\textbf{C}$onstraint diffusion mod$\textbf{E}$l (LACE), a unified model to handle a broad range of layout generation tasks, such as arranging elements with specified attributes and refining or completing a coarse layout design. The model is based on continuous diffusion models. Compared with existing methods that use discrete diffusion models, continuous state-space design can enable the incorporation of differentiable aesthetic constraint functions in training. For conditional generation, we introduce conditions via masked input. Extensive experiment results show that LACE produces high-quality layouts and outperforms existing state-of-the-art baselines.

Comments:	Accepted by ICLR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2402.04754 [cs.CV]
	(or arXiv:2402.04754v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.04754

Submission history

From: Jian Chen [view email]
[v1] Wed, 7 Feb 2024 11:12:41 UTC (2,994 KB)
[v2] Wed, 15 May 2024 19:32:58 UTC (2,994 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators