Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Video-MMMU

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal Fact-Level AttributionVideo-MMMU 1.0 (sampled examples)
Accuracy86.8
24
Video UnderstandingVideo-MMMU (test)
Accuracy63.6
9
Visual Question AnsweringVideo-MMMU 1.0 (sampled examples)
Acc86
4
Showing 3 of 3 rows