Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-benchmark Score

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Search and Perception-intensive ReasoningMulti-benchmark Score Overall
Overall Score64.59
14
Showing 1 of 1 rows