Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OlymMATH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical ReasoningOlymMATH
Accuracy9.3
16
Mathematical ReasoningOlymMATH Hard
Pass@17.5
3
Mathematical ReasoningOlymMATH Easy
Pass@141.62
3
Showing 3 of 3 rows