Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GSM-Plus

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical ReasoningGSM-Plus
Acc (Original)93.4
28
Mathematical ReasoningGSM-Plus (test)
Accuracy68.8
20
Mathematical ReasoningGSM-Plus (mini)
Accuracy52.8
8
Math Word Problem SolvingGSM+ v1 (test)
Accuracy65.7
6
Showing 4 of 4 rows