Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMSU

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio UnderstandingMMSU
Perception Score55.7
37
Audio Question-AnsweringMMSU
Score77.7
23
Multi-task Language UnderstandingMMSU
Accuracy71.6
23
Multimodal Speech UnderstandingMMSU All (test)
Accuracy75.2
20
Multimodal Speech UnderstandingMMSU Paralinguistic subset (test)
Accuracy65.18
20
Speech UnderstandingMMSU
Accuracy81.3
16
General Audio UnderstandingMMSU 1.0 (test)
Perception Semantics72.13
16
Audio UnderstandingMMSU (test)
Overall Score66.64
15
Multimodal UnderstandingMMSU
MMSU Score79.36
14
Paralinguistic PerceptionMMSU Paralinguistic
Para. Score54.51
12
Multi-task KnowledgeMMSU
Accuracy67.1
11
KnowledgeMMSU (test)
Performance77
11
Audio Understanding & ReasoningMMSU
Score83.7
9
Speech ReasoningMMSU S→T only
Accuracy43.2
9
Audio-conditioned reasoningMMSU
Acc57.63
8
Audio ReasoningMMSU
Accuracy (Audio Reasoning)70.7
7
Showing 16 of 16 rows