Optimized View and Geometry Distillation from Multi-view Diffuser

Zhang, Youjia; Song, Zikai; Yu, Junqing; Luo, Yawei; Yang, Wei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.06198 (cs)

[Submitted on 11 Dec 2023 (v1), last revised 8 Mar 2024 (this version, v3)]

Title:Optimized View and Geometry Distillation from Multi-view Diffuser

Authors:Youjia Zhang, Zikai Song, Junqing Yu, Yawei Luo, Wei Yang

View PDF HTML (experimental)

Abstract:Generating multi-view images from a single input view using image-conditioned diffusion models is a recent advancement and has shown considerable potential. However, issues such as the lack of consistency in synthesized views and over-smoothing in extracted geometry persist. Previous methods integrate multi-view consistency modules or impose additional supervisory to enhance view consistency while compromising on the flexibility of camera positioning and limiting the versatility of view synthesis. In this study, we consider the radiance field optimized during geometry extraction as a more rigid consistency prior, compared to volume and ray aggregation used in previous works. We further identify and rectify a critical bias in the traditional radiance field optimization process through score distillation from a multi-view diffuser. We introduce an Unbiased Score Distillation (USD) that utilizes unconditioned noises from a 2D diffusion model, greatly refining the radiance field fidelity. We leverage the rendered views from the optimized radiance field as the basis and develop a two-step specialization process of a 2D diffusion model, which is adept at conducting object-specific denoising and generating high-quality multi-view images. Finally, we recover faithful geometry and texture directly from the refined multi-view images. Empirical evaluations demonstrate that our optimized geometry and view distillation technique generates comparable results to the state-of-the-art models trained on extensive datasets, all while maintaining freedom in camera positioning. Please see our project page at this https URL.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.06198 [cs.CV]
	(or arXiv:2312.06198v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.06198

Submission history

From: Youjia Zhang [view email]
[v1] Mon, 11 Dec 2023 08:22:24 UTC (3,023 KB)
[v2] Sun, 17 Dec 2023 14:50:10 UTC (3,023 KB)
[v3] Fri, 8 Mar 2024 07:36:58 UTC (4,519 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Optimized View and Geometry Distillation from Multi-view Diffuser

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Optimized View and Geometry Distillation from Multi-view Diffuser

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators