Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAVDESS

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio-driven Talking Head GenerationRAVDESS (cross-identity)
FAD1.885
48
Talking Head GenerationRAVDESS intra-identity 1.0
FAD0.833
48
Speech Emotion RecognitionRAVDESS
Unweighted Accuracy92.58
43
Emotion RecognitionRAVDESS (test)
Accuracy0.9735
29
Emotion RecognitionRAVDESS
Accuracy72
28
Audio-Driven Facial AnimationRAVDESS 42 (test)
PSNR30.772
24
Emotion RecognitionRAVDESS (val)
Accuracy97.46
20
Emotion RecognitionRAVDESS 7-class
WAR83.61
19
Discrete Emotion RecognitionRavdess 19 (test)
Accuracy44.04
19
Audio ClassificationRAVDESS
Base Accuracy63.55
13
Avatar FingerprintingRAVDESS (Evaluation)
AUC (%)83
12
Speech Emotion RecognitionRAVDESS 8 classes (test)
Weighted Accuracy84.72
12
Speech Emotion RecognitionRAVDESS In-Domain v1 (test)
Accuracy85.74
12
Song Emotion RecognitionRAVDESS Song
Weighted Accuracy85.8
11
Audiovisual Emotion RecognitionRAVDESS
Accuracy (AV)81.58
11
Response AppropriatenessRAVDESS
Response Appropriateness48
9
Emotion RecognitionRAVDESS (6-fold cross-val)
Accuracy74.86
9
Talking-head generationRAVDESS
IQA Score4.602
8
Speech Emotion RecognitionRAVDESS (6-fold subject-independent cross-validation)
Weighted Accuracy (WA)93.4
8
Facial Emotion RecognitionRAVDESS
WAR87.99
8
Dynamic Facial Expression RecognitionRAVDESS 7-class
WAR83.69
8
Audio ClassificationRAVDESS (test)
Accuracy0.4596
7
Self ReenactmentRAVDESS
PSNR26.5507
6
Emotion RecognitionRAVDESS (speaker-independent)
Accuracy51.7
6
Portrait Image AnimationRAVDESS
Sync-C5.223
6
Showing 25 of 29 rows