Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematics, Multilingual, Coding, and Instruction Following

Benchmarks

Task NameDataset NameSOTA ResultTrend
ReasoningMathematics, Multilingual, Coding, and Instruction Following Aggregate
Average Normalized Score95.2
9
Showing 1 of 1 rows