Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

Tiwari, Hitika; Chen, Min-Hung; Tsai, Yi-Min; Kuo, Hsien-Kai; Chen, Hung-Jen; Jou, Kevin; Venkatesh, K. S.; Chen, Yong-Sheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2112.14382 (cs)

[Submitted on 29 Dec 2021 (v1), last revised 21 Oct 2022 (this version, v3)]

Title:Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

Authors:Hitika Tiwari, Min-Hung Chen, Yi-Min Tsai, Hsien-Kai Kuo, Hung-Jen Chen, Kevin Jou, K. S. Venkatesh, Yong-Sheng Chen

View PDF

Abstract:Despite the recent developments in 3D Face Reconstruction from occluded and noisy face images, the performance is still unsatisfactory. Moreover, most existing methods rely on additional dependencies, posing numerous constraints over the training procedure. Therefore, we propose a Self-Supervised RObustifying GUidancE (ROGUE) framework to obtain robustness against occlusions and noise in the face images. The proposed network contains 1) the Guidance Pipeline to obtain the 3D face coefficients for the clean faces and 2) the Robustification Pipeline to acquire the consistency between the estimated coefficients for occluded or noisy images and the clean counterpart. The proposed image- and feature-level loss functions aid the ROGUE learning process without posing additional dependencies. To facilitate model evaluation, we propose two challenging occlusion face datasets, ReaChOcc and SynChOcc, containing real-world and synthetic occlusion-based face images for robustness evaluation. Also, a noisy variant of the test dataset of CelebA is produced for evaluation. Our method outperforms the current state-of-the-art method by large margins (e.g., for the perceptual errors, a reduction of 23.8% for real-world occlusions, 26.4% for synthetic occlusions, and 22.7% for noisy images), demonstrating the effectiveness of the proposed approach. The occlusion datasets and the corresponding evaluation code are released publicly at this https URL.

Comments:	Accepted by The 33rd British Machine Vision Conference (BMVC) 2022. Evaluation code and datasets: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2112.14382 [cs.CV]
	(or arXiv:2112.14382v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2112.14382

Submission history

From: Hitika Tiwari [view email]
[v1] Wed, 29 Dec 2021 03:30:50 UTC (28,519 KB)
[v2] Fri, 14 Oct 2022 07:49:29 UTC (35,614 KB)
[v3] Fri, 21 Oct 2022 04:38:14 UTC (35,614 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators