RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild

Berral-Soler, Rafael; Madrid-Cuevas, Francisco J.; Muñoz-Salinas, Rafael; Marín-Jiménez, Manuel J.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2011.01890 (cs)

[Submitted on 3 Nov 2020]

Title:RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild

Authors:Rafael Berral-Soler, Francisco J. Madrid-Cuevas, Rafael Muñoz-Salinas, Manuel J. Marín-Jiménez

View PDF

Abstract:Human head pose estimation in images has applications in many fields such as human-computer interaction or video surveillance tasks. In this work, we address this problem, defined here as the estimation of both vertical (tilt/pitch) and horizontal (pan/yaw) angles, through the use of a single Convolutional Neural Network (ConvNet) model, trying to balance precision and inference speed in order to maximize its usability in real-world applications. Our model is trained over the combination of two datasets: 'Pointing'04' (aiming at covering a wide range of poses) and 'Annotated Facial Landmarks in the Wild' (in order to improve robustness of our model for its use on real-world images). Three different partitions of the combined dataset are defined and used for training, validation and testing purposes. As a result of this work, we have obtained a trained ConvNet model, coined RealHePoNet, that given a low-resolution grayscale input image, and without the need of using facial landmarks, is able to estimate with low error both tilt and pan angles (~4.4° average error on the test partition). Also, given its low inference time (~6 ms per head), we consider our model usable even when paired with medium-spec hardware (i.e. GTX 1060 GPU). * Code available at: this https URL * Demo video at: this https URL

Comments:	Accepted for publication at Neural Computing and Applications
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2011.01890 [cs.CV]
	(or arXiv:2011.01890v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2011.01890

Submission history

From: Manuel Marin-Jimenez [view email]
[v1] Tue, 3 Nov 2020 18:09:05 UTC (23,837 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators