Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MLVU

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video UnderstandingMLVU
Score78.19
221
Long Video UnderstandingMLVU
Accuracy83.8
205
Video Question-AnsweringMLVU
Accuracy76.2
194
Video UnderstandingMLVU
Accuracy81.5
114
Video UnderstandingMLVU
Accuracy87.34
80
Long Video UnderstandingMLVU (dev)
Score78.1
63
Long Video UnderstandingMLVU (test)
Average Score81
60
Multi-discipline Long Video UnderstandingMLVU
Score68.9
55
Video UnderstandingMLVU 3-120min (test)
Accuracy47.7
49
Video UnderstandingMLVU 3-120min (dev)
Accuracy63
49
Video Question AnsweringMLVU 78 (test)
Accuracy76.66
45
Video Question AnsweringMLVU
M-Avg Score72.4
40
Long-video Question AnsweringMLVU
M-Avg79.5
39
Long Video UnderstandingMLVU 3-120 min
Accuracy82.1
36
Video Question AnsweringMLVU (dev)
Accuracy78.1
34
Video UnderstandingMLVU (test)
Average100.3
34
Long Video UnderstandingMLVU v1.0 (test)
MLVU Score67.77
28
Offline Video UnderstandingMLVU v1 (test)
Accuracy71.5
26
Video Question AnsweringMLVU 1.0 (test)
Accuracy68.5
26
Video UnderstandingMLVU
Score78.1
24
Long Video UnderstandingMLVU MCQ (test)
Accuracy81.5
22
Long Video UnderstandingMLVU multiple-choice task
Overall Accuracy73.4
21
Video UnderstandingMLVU MCQ (test)
Accuracy81.5
21
Video Understanding ReasoningMLVU
Accuracy73.46
21
Video UnderstandingMLVU (dev)
MLVU Dev Score68.4
21
Showing 25 of 56 rows