Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

Lee, Inhee; Kim, Byungjun; Joo, Hanbyul

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.14410 (cs)

[Submitted on 22 Apr 2024]

Title:Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

Authors:Inhee Lee, Byungjun Kim, Hanbyul Joo

View PDF HTML (experimental)

Abstract:In this paper, we present a method to reconstruct the world and multiple dynamic humans in 3D from a monocular video input. As a key idea, we represent both the world and multiple humans via the recently emerging 3D Gaussian Splatting (3D-GS) representation, enabling to conveniently and efficiently compose and render them together. In particular, we address the scenarios with severely limited and sparse observations in 3D human reconstruction, a common challenge encountered in the real world. To tackle this challenge, we introduce a novel approach to optimize the 3D-GS representation in a canonical space by fusing the sparse cues in the common space, where we leverage a pre-trained 2D diffusion model to synthesize unseen views while keeping the consistency with the observed 2D appearances. We demonstrate our method can reconstruct high-quality animatable 3D humans in various challenging examples, in the presence of occlusion, image crops, few-shot, and extremely sparse observations. After reconstruction, our method is capable of not only rendering the scene in any novel views at arbitrary time instances, but also editing the 3D scene by removing individual humans or applying different motions for each human. Through various experiments, we demonstrate the quality and efficiency of our methods over alternative existing approaches.

Comments:	The project page is available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.14410 [cs.CV]
	(or arXiv:2404.14410v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.14410

Submission history

From: Inhee Lee [view email]
[v1] Mon, 22 Apr 2024 17:59:50 UTC (13,114 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators