Holistic 3D Human and Scene Mesh Estimation from Single View Images

Weng, Zhenzhen; Yeung, Serena

Computer Science > Computer Vision and Pattern Recognition

arXiv:2012.01591 (cs)

[Submitted on 2 Dec 2020 (v1), last revised 16 Apr 2021 (this version, v2)]

Title:Holistic 3D Human and Scene Mesh Estimation from Single View Images

Authors:Zhenzhen Weng, Serena Yeung

View PDF

Abstract:The 3D world limits the human body pose and the human body pose conveys information about the surrounding objects. Indeed, from a single image of a person placed in an indoor scene, we as humans are adept at resolving ambiguities of the human pose and room layout through our knowledge of the physical laws and prior perception of the plausible object and human poses. However, few computer vision models fully leverage this fact. In this work, we propose an end-to-end trainable model that perceives the 3D scene from a single RGB image, estimates the camera pose and the room layout, and reconstructs both human body and object meshes. By imposing a set of comprehensive and sophisticated losses on all aspects of the estimations, we show that our model outperforms existing human body mesh methods and indoor scene reconstruction methods. To the best of our knowledge, this is the first model that outputs both object and human predictions at the mesh level, and performs joint optimization on the scene and human poses.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2012.01591 [cs.CV]
	(or arXiv:2012.01591v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2012.01591

Submission history

From: Zhenzhen Weng [view email]
[v1] Wed, 2 Dec 2020 23:22:03 UTC (12,024 KB)
[v2] Fri, 16 Apr 2021 17:30:41 UTC (13,066 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Serena Yeung

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Holistic 3D Human and Scene Mesh Estimation from Single View Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Holistic 3D Human and Scene Mesh Estimation from Single View Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators