Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MLVU

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video UnderstandingMLVU
Score78.19
221
Long Video UnderstandingMLVU
Score79.8
154
Video Question-AnsweringMLVU
Accuracy76.2
143
Video UnderstandingMLVU
Accuracy87.34
80
Long Video UnderstandingMLVU (dev)
Score78.1
63
Long Video UnderstandingMLVU (test)
Average Score81
60
Video UnderstandingMLVU 3-120min (test)
Accuracy47.7
49
Video UnderstandingMLVU 3-120min (dev)
Accuracy63
49
Video Question AnsweringMLVU 78 (test)
Accuracy76.66
45
Multi-discipline Long Video UnderstandingMLVU
Score68.9
44
Video Question AnsweringMLVU
M-Avg Score72.4
40
Long-video Question AnsweringMLVU
M-Avg79.5
39
Video UnderstandingMLVU (test)
Average100.3
34
Long Video UnderstandingMLVU 3-120 min
Accuracy82.1
23
Long Video UnderstandingMLVU MCQ (test)
Accuracy81.5
22
Long Video UnderstandingMLVU multiple-choice task
Overall Accuracy73.4
21
Video UnderstandingMLVU MCQ (test)
Accuracy81.5
21
Video Understanding ReasoningMLVU
Accuracy73.46
21
Video UnderstandingMLVU
Accuracy64.7
20
Video Question AnsweringMLVU (dev)
Accuracy74.5
19
Long Video UnderstandingMLVU (651s)
Accuracy78.1
18
Video UnderstandingMLVU
Base Accuracy68.4
18
Video UnderstandingMLVU
Accuracy71.4
17
Video Question AnsweringMLVU MCQ
Accuracy82.1
17
Video UnderstandingMLVU (dev)
MLVU Dev Score68.4
17
Showing 25 of 46 rows