Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection

About

Due to its importance in facial behaviour analysis, facial action unit (AU) detection has attracted increasing attention from the research community. Leveraging the online knowledge distillation framework, we propose the ``FANTrans" method for AU detection. Our model consists of a hybrid network of convolution and transformer blocks to learn per-AU features and to model AU co-occurrences. The model uses a pre-trained face alignment network as the feature extractor. After further transformation by a small learnable add-on convolutional subnet, the per-AU features are fed into transformer blocks to enhance their representation. As multiple AUs often appear together, we propose a learnable attention drop mechanism in the transformer block to learn the correlation between the features for different AUs. We also design a classifier that predicts AU presence by considering all AUs' features, to explicitly capture label dependencies. Finally, we make the attempt of adapting online knowledge distillation in the training stage for this task, further improving the model's performance. Experiments on the BP4D and DISFA datasets demonstrating the effectiveness of proposed method.

Jing Yang, Jie Shen, Yiming Lin, Yordan Hristov, Maja Pantic• 2022

Related benchmarks

TaskDatasetResultRank
Facial Action Unit DetectionDISFA
F1 (AU 1)56.4
47
Facial Action Unit DetectionDISFA (test)
Avg AU Score63.8
39
Facial Action Unit RecognitionBP4D
AU 1 F155.4
26
Showing 3 of 3 rows

Other info

Follow for update