Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

We-Math

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-step mathematical reasoningWe-Math (test)
S1 Score72.8
20
Math ReasoningWe-Math
Pass@176.4
19
Showing 2 of 2 rows