Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition

About

We present a novel facial expression recognition network, called Distract your Attention Network (DAN). Our method is based on two key observations. Firstly, multiple classes share inherently similar underlying facial appearance, and their differences could be subtle. Secondly, facial expressions exhibit themselves through multiple facial regions simultaneously, and the recognition requires a holistic approach by encoding high-order interactions among local features. To address these issues, we propose our DAN with three key components: Feature Clustering Network (FCN), Multi-head cross Attention Network (MAN), and Attention Fusion Network (AFN). The FCN extracts robust features by adopting a large-margin learning objective to maximize class separability. In addition, the MAN instantiates a number of attention heads to simultaneously attend to multiple facial areas and build attention maps on these regions. Further, the AFN distracts these attentions to multiple locations before fusing the attention maps to a comprehensive one. Extensive experiments on three public datasets (including AffectNet, RAF-DB, and SFEW 2.0) verified that the proposed method consistently achieves state-of-the-art facial expression recognition performance. Code will be made available at https://github.com/yaoing/DAN.

Zhengyao Wen, Wenzhong Lin, Tao Wang, Ge Xu• 2021

Related benchmarks

TaskDatasetResultRank
Facial Expression RecognitionRAF-DB (test)
Accuracy89.7
180
Facial Expression RecognitionAffectNet 7-way (test)
Accuracy65.69
91
Facial Expression RecognitionAffectNet 8-way (test)
Accuracy62.09
65
Facial Expression RecognitionAffWild2 (test)
Accuracy65.82
33
Facial Expression RecognitionSFEW 2.0 (val)
Accuracy57.88
31
Facial Expression RecognitionAffectNet (test)
Accuracy62.09
28
Facial Expression RecognitionSFEW 2.0
Accuracy53.18
27
Facial Expression RecognitionExpW
Accuracy55.78
27
Emotion RecognitionAffectNet 7 classes (test val)
Accuracy65.69
25
Facial Expression RecognitionAffectNet 7 classes
Accuracy65.69
23
Showing 10 of 15 rows

Other info

Code

Follow for update