RAD: Region-Aware Diffusion Models for Image Inpainting

Kim, Sora; Suh, Sungho; Lee, Minsik

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.09191 (cs)

[Submitted on 12 Dec 2024 (v1), last revised 19 Dec 2024 (this version, v3)]

Title:RAD: Region-Aware Diffusion Models for Image Inpainting

Authors:Sora Kim, Sungho Suh, Minsik Lee

View PDF HTML (experimental)

Abstract:Diffusion models have achieved remarkable success in image generation, with applications broadening across various domains. Inpainting is one such application that can benefit significantly from diffusion models. Existing methods either hijack the reverse process of a pretrained diffusion model or cast the problem into a larger framework, \ie, conditioned generation. However, these approaches often require nested loops in the generation process or additional components for conditioning. In this paper, we present region-aware diffusion models (RAD) for inpainting with a simple yet effective reformulation of the vanilla diffusion models. RAD utilizes a different noise schedule for each pixel, which allows local regions to be generated asynchronously while considering the global image context. A plain reverse process requires no additional components, enabling RAD to achieve inference time up to 100 times faster than the state-of-the-art approaches. Moreover, we employ low-rank adaptation (LoRA) to fine-tune RAD based on other pretrained diffusion models, reducing computational burdens in training as well. Experiments demonstrated that RAD provides state-of-the-art results both qualitatively and quantitatively, on the FFHQ, LSUN Bedroom, and ImageNet datasets.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.09191 [cs.CV]
	(or arXiv:2412.09191v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.09191

Submission history

From: Sora Kim [view email]
[v1] Thu, 12 Dec 2024 11:38:46 UTC (45,489 KB)
[v2] Tue, 17 Dec 2024 05:21:52 UTC (45,490 KB)
[v3] Thu, 19 Dec 2024 02:44:14 UTC (35,465 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RAD: Region-Aware Diffusion Models for Image Inpainting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RAD: Region-Aware Diffusion Models for Image Inpainting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators