Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Arithmetic

Benchmarks

Task NameDataset NameSOTA ResultTrend
Causal Variable IdentificationArithmetic
F1 (X)88.2
7
Outcome ReasoningArithmetic
M' F1 Mean87.8
7
Mathematical ReasoningArithmetic (val)
Accuracy83
3
Showing 3 of 3 rows