Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data

Szymanowicz, Stanislaw; Rupprecht, Christian; Vedaldi, Andrea

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.07881 (cs)

[Submitted on 13 Jun 2023 (v1), last revised 1 Sep 2023 (this version, v2)]

Title:Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data

Authors:Stanislaw Szymanowicz, Christian Rupprecht, Andrea Vedaldi

View PDF

Abstract:We present Viewset Diffusion, a diffusion-based generator that outputs 3D objects while only using multi-view 2D data for supervision. We note that there exists a one-to-one mapping between viewsets, i.e., collections of several 2D views of an object, and 3D models. Hence, we train a diffusion model to generate viewsets, but design the neural network generator to reconstruct internally corresponding 3D models, thus generating those too. We fit a diffusion model to a large number of viewsets for a given category of objects. The resulting generator can be conditioned on zero, one or more input views. Conditioned on a single view, it performs 3D reconstruction accounting for the ambiguity of the task and allowing to sample multiple solutions compatible with the input. The model performs reconstruction efficiently, in a feed-forward manner, and is trained using only rendering losses using as few as three views per viewset. Project page: this http URL.

Comments:	International Conference on Computer Vision 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.07881 [cs.CV]
	(or arXiv:2306.07881v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.07881

Submission history

From: Stanislaw Szymanowicz [view email]
[v1] Tue, 13 Jun 2023 16:18:51 UTC (7,364 KB)
[v2] Fri, 1 Sep 2023 11:09:36 UTC (10,850 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators