Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion

Yu, Jun; Zheng, Yang; Wang, Lei; Wang, Yongqi; Xu, Shengfan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2503.11935 (cs)

[Submitted on 15 Mar 2025 (v1), last revised 21 Mar 2025 (this version, v3)]

Title:Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion

Authors:Jun Yu, Yang Zheng, Lei Wang, Yongqi Wang, Shengfan Xu

View PDF HTML (experimental)

Abstract:Facial expression recognition is a challenging classification task that holds broad application prospects in the field of human-computer interaction. This paper aims to introduce the method we will adopt in the 8th Affective and Behavioral Analysis in the Wild (ABAW) Competition, which will be held during the Conference on Computer Vision and Pattern Recognition (CVPR) in this http URL of all, we apply the frequency masking technique and the method of extracting data at equal time intervals to conduct targeted processing on the original videos. Then, based on the residual hybrid convolutional neural network and the multi-branch convolutional neural network respectively, we design feature extraction models for image and audio sequences. In particular, we propose a global channel-spatial attention mechanism to enhance the features initially extracted from both the audio and image modalities this http URL, we adopt a decision fusion strategy based on the proportional criterion to fuse the classification results of the two single modalities, obtain an emotion probability vector, and output the final emotional classification. We also design a coarse - fine granularity loss function to optimize the performance of the entire network, which effectively improves the accuracy of facial expression this http URL the facial expression recognition task of the 8th ABAW Competition, our method ranked third on the official validation set. This result fully confirms the effectiveness and competitiveness of the method we have proposed.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2503.11935 [cs.CV]
	(or arXiv:2503.11935v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2503.11935

Submission history

From: Yang Zheng [view email]
[v1] Sat, 15 Mar 2025 00:59:34 UTC (323 KB)
[v2] Tue, 18 Mar 2025 05:50:24 UTC (610 KB)
[v3] Fri, 21 Mar 2025 09:31:13 UTC (321 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators