Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HSEmotion Team at ABAW-10 Competition: Facial Expression Recognition, Valence-Arousal Estimation, Action Unit Detection and Fine-Grained Violence Classification

About

This article presents our results for the 10th Affective Behavior Analysis in-the-Wild (ABAW) competition. For frame-wise facial emotion understanding tasks (frame-wise facial expression recognition, valence-arousal estimation, action unit detection), we propose a fast approach based on facial embedding extraction with pre-trained EfficientNet-based emotion recognition models. If the latter model's confidence exceeds a threshold, its prediction is used. Otherwise, we feed embeddings into a simple multi-layered perceptron trained on the AffWild2 dataset. Estimated class-level scores are smoothed in a sliding window of fixed size to mitigate noise in frame-wise predictions. For the fine-grained violence detection task, we examine several pre-trained architectures for frame embeddings and their aggregation for video classification. Experimental results on four tasks from the ABAW challenge demonstrate that our approach significantly improves validation metrics over existing baselines.

Andrey V. Savchenko, Kseniia Tsypliakova• 2026

Related benchmarks

TaskDatasetResultRank
Action Unit DetectionAff-wild2 (val)
F1-score PAU54.7
46
Expression RecognitionAff-wild2 (val)
F1 Score (P_EXPR)47.4
22
Valence-Arousal EstimationAff-wild2 (val)
PCC (VA)0.562
13
Showing 3 of 3 rows

Other info

Follow for update