Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ESD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Speech Emotion RecognitionESD In-Domain v1 (test)
ACC93.86
13
Object DetectionESD
AP46.5
13
Text-to-SpeechESD (test)
MOS4.47
11
Emotional Text-to-SpeechESD (English)
WER1.411
9
Empathetic Response GenerationESD
Emotional Reaction1.851
8
Emotion Style TransferESD (test)
UTMOS3.93
7
Emotional Speech SynthesisESD English (test)
Score (Neutral)78.39
5
Text-to-SpeechESD English (test)
WER6.8
5
Speech Emotion RecognitionESD
UA98.9
5
Instance SegmentationESD-1 (test)
Accuracy (2 Objects)95
5
Voice ConversionESD
WER0.149
4
Chain GenerationESD-CoT (test)
B-144.87
3
Showing 12 of 12 rows