POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition
About
Facial expression recognition (FER) is an important task in computer vision, having practical applications in areas such as human-computer interaction, education, healthcare, and online monitoring. In this challenging FER task, there are three key issues especially prevalent: inter-class similarity, intra-class discrepancy, and scale sensitivity. While existing works typically address some of these issues, none have fully addressed all three challenges in a unified framework. In this paper, we propose a two-stream Pyramid crOss-fuSion TransformER network (POSTER), that aims to holistically solve all three issues. Specifically, we design a transformer-based cross-fusion method that enables effective collaboration of facial landmark features and image features to maximize proper attention to salient facial regions. Furthermore, POSTER employs a pyramid structure to promote scale invariance. Extensive experimental results demonstrate that our POSTER achieves new state-of-the-art results on RAF-DB (92.05%), FERPlus (91.62%), as well as AffectNet 7 class (67.31%) and 8 class (63.34%). The code is available at https://github.com/zczcwh/POSTER.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Facial Expression Recognition | RAF-DB (test) | Accuracy92.05 | 180 | |
| Facial Expression Recognition | FERPlus (test) | Accuracy0.9162 | 100 | |
| Facial Expression Recognition | AffectNet 7-way (test) | Accuracy67.31 | 91 | |
| Facial Expression Recognition | AffectNet 8-way (test) | Accuracy63.34 | 65 | |
| Facial Expression Recognition | RAF-DB | Accuracy92.05 | 45 | |
| Facial Expression Recognition | JAFFE | Accuracy96.67 | 36 | |
| Facial Expression Recognition | AffWild2 (test) | Accuracy67.74 | 33 | |
| Facial Expression Recognition | FERPlus | Accuracy91.62 | 29 | |
| Facial Expression Recognition | AffectNet (test) | Accuracy63.34 | 28 | |
| Facial Expression Recognition | FERG (test) | Accuracy96.87 | 18 |