Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BrUMO

Benchmarks

Task NameDataset NameSOTA ResultTrend
ReasoningBRUMO 2025
Accuracy60.83
21
Mathematical ReasoningBRUMO
Trace Count826
20
ReasoningBrumo 25
Trace Count613
20
ReasoningBrUMO25
Pass@194.58
14
Mathematical ReasoningBRUMO
Accuracy67.5
7
Mathematical ReasoningBRUMO 2025 (test)
Pass@1 Accuracy56.66
4
Mathematical ReasoningBRUMO 2025
Pass@451.42
2
Showing 7 of 7 rows