Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ZeroBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image ReasoningZeroBench sub
Accuracy24.4
14
Multimodal reasoningZeroBench
Accuracy26.35
14
Multimodal reasoningZeroBench main
Pass@111
13
MathZeroBench
Score17.66
8
General Reasoning & UnderstandingZeroBench
Accuracy18.9
8
Multimodal reasoningZeroBench sub
Pass@130.8
7
Showing 6 of 6 rows