Landmark Guidance Independent Spatio-channel Attention and Complementary Context Information based Facial Expression Recognition

Gera, Darshan; Balasubramanian, S

doi:10.1016/j.patrec.2021.01.029

Computer Science > Computer Vision and Pattern Recognition

arXiv:2007.10298 (cs)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 20 Jul 2020 (v1), last revised 25 Jul 2020 (this version, v2)]

Title:Landmark Guidance Independent Spatio-channel Attention and Complementary Context Information based Facial Expression Recognition

Authors:Darshan Gera, S Balasubramanian

View PDF

Abstract:A recent trend to recognize facial expressions in the real-world scenario is to deploy attention based convolutional neural networks (CNNs) locally to signify the importance of facial regions and, combine it with global facial features and/or other complementary context information for performance gain. However, in the presence of occlusions and pose variations, different channels respond differently, and further that the response intensity of a channel differ across spatial locations. Also, modern facial expression recognition(FER) architectures rely on external sources like landmark detectors for defining attention. Failure of landmark detector will have a cascading effect on FER. Additionally, there is no emphasis laid on the relevance of features that are input to compute complementary context information. Leveraging on the aforementioned observations, an end-to-end architecture for FER is proposed in this work that obtains both local and global attention per channel per spatial location through a novel spatio-channel attention net (SCAN), without seeking any information from the landmark detectors. SCAN is complemented by a complementary context information (CCI) branch. Further, using efficient channel attention (ECA), the relevance of features input to CCI is also attended to. The representation learnt by the proposed architecture is robust to occlusions and pose variations. Robustness and superior performance of the proposed model is demonstrated on both in-lab and in-the-wild datasets (AffectNet, FERPlus, RAF-DB, FED-RO, SFEW, CK+, Oulu-CASIA and JAFFE) along with a couple of constructed face mask datasets resembling masked faces in COVID-19 scenario. Codes are publicly available at this https URL

Comments:	A couple of reference citations corrected, few details added and code link provided
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2007.10298 [cs.CV]
	(or arXiv:2007.10298v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2007.10298
Journal reference:	Pattern Recognition Letters 145 (2021)
Related DOI:	https://doi.org/10.1016/j.patrec.2021.01.029

Submission history

From: Darshan Gera [view email]
[v1] Mon, 20 Jul 2020 17:33:32 UTC (3,592 KB)
[v2] Sat, 25 Jul 2020 14:50:25 UTC (4,564 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Landmark Guidance Independent Spatio-channel Attention and Complementary Context Information based Facial Expression Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Landmark Guidance Independent Spatio-channel Attention and Complementary Context Information based Facial Expression Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators