Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OCW

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical ReasoningOCW
Accuracy20.2
16
Mathematical ReasoningOCW (test)
Accuracy17.6
8
Mathematical Problem SolvingOCW
Accuracy17.6
7
Mathematical Problem SolvingOCW (test)
maj@k0.308
5
Showing 4 of 4 rows