TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition

Chen, Yongwei; Chen, Rui; Lei, Jiabao; Zhang, Yabin; Jia, Kui

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.11277v1 (cs)

[Submitted on 20 Oct 2022 (this version), latest version 3 Nov 2022 (v2)]

Title:TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition

Authors:Yongwei Chen, Rui Chen, Jiabao Lei, Yabin Zhang, Kui Jia

View PDF

Abstract:Creation of 3D content by stylization is a promising yet challenging problem in computer vision and graphics research. In this work, we focus on stylizing photorealistic appearance renderings of a given surface mesh of arbitrary topology. Motivated by the recent surge of cross-modal supervision of the Contrastive Language-Image Pre-training (CLIP) model, we propose TANGO, which transfers the appearance style of a given 3D shape according to a text prompt in a photorealistic manner. Technically, we propose to disentangle the appearance style as the spatially varying bidirectional reflectance distribution function, the local geometric variation, and the lighting condition, which are jointly optimized, via supervision of the CLIP loss, by a spherical Gaussians based differentiable renderer. As such, TANGO enables photorealistic 3D style transfer by automatically predicting reflectance effects even for bare, low-quality meshes, without training on a task-specific dataset. Extensive experiments show that TANGO outperforms existing methods of text-driven 3D style transfer in terms of photorealistic quality, consistency of 3D geometry, and robustness when stylizing low-quality meshes. Our codes and results are available at our project webpage this https URL.

Comments:	Accepted by NeurIPS 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.11277 [cs.CV]
	(or arXiv:2210.11277v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.11277

Submission history

From: Yongwei Chen [view email]
[v1] Thu, 20 Oct 2022 13:52:18 UTC (6,641 KB)
[v2] Thu, 3 Nov 2022 05:09:34 UTC (6,481 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators