DDP: Diffusion Model for Dense Visual Prediction

Ji, Yuanfeng; Chen, Zhe; Xie, Enze; Hong, Lanqing; Liu, Xihui; Liu, Zhaoqiang; Lu, Tong; Li, Zhenguo; Luo, Ping

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.17559 (cs)

[Submitted on 30 Mar 2023 (v1), last revised 13 May 2023 (this version, v2)]

Title:DDP: Diffusion Model for Dense Visual Prediction

Authors:Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo

View PDF

Abstract:We propose a simple, efficient, yet powerful framework for dense visual predictions based on the conditional diffusion pipeline. Our approach follows a "noise-to-map" generative paradigm for prediction by progressively removing noise from a random Gaussian distribution, guided by the image. The method, called DDP, efficiently extends the denoising diffusion process into the modern perception pipeline. Without task-specific design and architecture customization, DDP is easy to generalize to most dense prediction tasks, e.g., semantic segmentation and depth estimation. In addition, DDP shows attractive properties such as dynamic inference and uncertainty awareness, in contrast to previous single-step discriminative methods. We show top results on three representative tasks with six diverse benchmarks, without tricks, DDP achieves state-of-the-art or competitive performance on each task compared to the specialist counterparts. For example, semantic segmentation (83.9 mIoU on Cityscapes), BEV map segmentation (70.6 mIoU on nuScenes), and depth estimation (0.05 REL on KITTI). We hope that our approach will serve as a solid baseline and facilitate future research

Comments:	Added controlnet exp
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.17559 [cs.CV]
	(or arXiv:2303.17559v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.17559

Submission history

From: Ji Yuanfeng [view email]
[v1] Thu, 30 Mar 2023 17:26:50 UTC (12,313 KB)
[v2] Sat, 13 May 2023 11:38:59 UTC (14,523 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DDP: Diffusion Model for Dense Visual Prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DDP: Diffusion Model for Dense Visual Prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators