TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction

Liu, Yunfei; Zhu, Lei; Lin, Lijian; Zhu, Ye; Zhang, Ailing; Li, Yu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.10982 (cs)

[Submitted on 16 Feb 2025 (v1), last revised 18 Feb 2025 (this version, v2)]

Title:TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction

Authors:Yunfei Liu, Lei Zhu, Lijian Lin, Ye Zhu, Ailing Zhang, Yu Li

View PDF

Abstract:3D facial reconstruction from a single in-the-wild image is a crucial task in human-centered computer vision tasks. While existing methods can recover accurate facial shapes, there remains significant space for improvement in fine-grained expression capture. Current approaches struggle with irregular mouth shapes, exaggerated expressions, and asymmetrical facial movements. We present TEASER (Token EnhAnced Spatial modeling for Expressions Reconstruction), which addresses these challenges and enhances 3D facial geometry performance. TEASER tackles two main limitations of existing methods: insufficient photometric loss for self-reconstruction and inaccurate localization of subtle expressions. We introduce a multi-scale tokenizer to extract facial appearance information. Combined with a neural renderer, these tokens provide precise geometric guidance for expression reconstruction. Furthermore, TEASER incorporates a pose-dependent landmark loss to further improve geometric performances. Our approach not only significantly enhances expression reconstruction quality but also offers interpretable tokens suitable for various downstream applications, such as photorealistic facial video driving, expression transfer, and identity swapping. Quantitative and qualitative experimental results across multiple datasets demonstrate that TEASER achieves state-of-the-art performance in precise expression reconstruction.

Comments:	Accepted by ICLR 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2502.10982 [cs.CV]
	(or arXiv:2502.10982v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.10982

Submission history

From: Yunfei Liu [view email]
[v1] Sun, 16 Feb 2025 04:00:06 UTC (4,392 KB)
[v2] Tue, 18 Feb 2025 03:43:41 UTC (4,392 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators