Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Speech Emotion Captioning on EMOSpeech (test)
Loading...
71.95
SIM
SECap + STMIL + SCCL
63.63
65.79
67.95
70.11
Jan 14, 2026
SIM
Updated 4d ago
Evaluation Results
Method
Method
Links
SIM
SECap + STMIL + SCCL
Encoder=HuBERT, Decode...
2026.01
71.95
SLAM-LLM
Encoder=emotion2vec, D...
2026.01
71.1
SECap + STMIL
Encoder=HuBERT, Decode...
2026.01
68.75
SECap
Encoder=HuBERT, Decode...
2026.01
67.29
Baseline
Encoder=HTSAT, Decoder...
2026.01
63.95
Feedback
Search any
task
Search any
task