Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vision-Language Benchmark Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Vision-Language UnderstandingVision-Language Benchmark Suite Aggregate
Aggregate Performance (%)100
34
Multimodal UnderstandingVision-Language Benchmark Suite MMMU, MathVista, MMBEn, MMBCn, MMStar, HallBench, AI2D, OCRBench
MMMU Score63.9
10
Showing 2 of 2 rows