PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation

Seol, Jaejung; Kim, Seojun; Yoo, Jaejun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.00995 (cs)

[Submitted on 1 Apr 2024 (v1), last revised 28 Jul 2024 (this version, v3)]

Title:PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation

Authors:Jaejung Seol, Seojun Kim, Jaejun Yoo

View PDF HTML (experimental)

Abstract:Visual layout plays a critical role in graphic design fields such as advertising, posters, and web UI design. The recent trend towards content-aware layout generation through generative models has shown promise, yet it often overlooks the semantic intricacies of layout design by treating it as a simple numerical optimization. To bridge this gap, we introduce PosterLlama, a network designed for generating visually and textually coherent layouts by reformatting layout elements into HTML code and leveraging the rich design knowledge embedded within language models. Furthermore, we enhance the robustness of our model with a unique depth-based poster augmentation strategy. This ensures our generated layouts remain semantically rich but also visually appealing, even with limited data. Our extensive evaluations across several benchmarks demonstrate that PosterLlama outperforms existing methods in producing authentic and content-aware layouts. It supports an unparalleled range of conditions, including but not limited to unconditional layout generation, element conditional layout generation, layout completion, among others, serving as a highly versatile user manipulation tool.

Comments:	ECCV 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.00995 [cs.CV]
	(or arXiv:2404.00995v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.00995

Submission history

From: Jaejung Seol [view email]
[v1] Mon, 1 Apr 2024 08:46:35 UTC (4,618 KB)
[v2] Tue, 2 Apr 2024 05:16:55 UTC (4,618 KB)
[v3] Sun, 28 Jul 2024 08:27:46 UTC (11,347 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators