Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OlyBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Symbolic ReasoningOlyBench
Accuracy36.5
25
Mathematical ReasoningOlyBench
Accuracy44.6
11
Showing 2 of 2 rows