Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Aggregate Math Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal Mathematical ReasoningAggregate Math Benchmarks
Overall Macro Score87.47
6
Showing 1 of 1 rows