LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge

Kang, Kyoungkook; Sim, Gyujin; Kim, Geonung; Kim, Donguk; Nam, Seungho; Cho, Sunghyun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.01197 (cs)

[Submitted on 2 Jan 2025]

Title:LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge

Authors:Kyoungkook Kang, Gyujin Sim, Geonung Kim, Donguk Kim, Seungho Nam, Sunghyun Cho

View PDF HTML (experimental)

Abstract:Layers have become indispensable tools for professional artists, allowing them to build a hierarchical structure that enables independent control over individual visual elements. In this paper, we propose LayeringDiff, a novel pipeline for the synthesis of layered images, which begins by generating a composite image using an off-the-shelf image generative model, followed by disassembling the image into its constituent foreground and background layers. By extracting layers from a composite image, rather than generating them from scratch, LayeringDiff bypasses the need for large-scale training to develop generative capabilities for individual layers. Furthermore, by utilizing a pretrained off-the-shelf generative model, our method can produce diverse contents and object scales in synthesized layers. For effective layer decomposition, we adapt a large-scale pretrained generative prior to estimate foreground and background layers. We also propose high-frequency alignment modules to refine the fine-details of the estimated layers. Our comprehensive experiments demonstrate that our approach effectively synthesizes layered images and supports various practical applications.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.01197 [cs.CV]
	(or arXiv:2501.01197v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.01197

Submission history

From: Kyoungkook Kang [view email]
[v1] Thu, 2 Jan 2025 11:18:25 UTC (2,996 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators