Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Image benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image UnderstandingImage benchmarks Aggregate
Overall Score64.82
21
Multimodal Understanding and ReasoningImage Benchmarks HallBench, MME, TextVQA, ChartQA, AI2D, RealWorldQA, CCBench, OCRVQA, SQA-IMG, POPE
HallBench Score46.5
13
Showing 2 of 2 rows