Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on GSM-SYS
Loading...
80.9
Accuracy
SATLM
18.604
34.777
50.95
67.123
May 16, 2023
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
SATLM
Language Model=code-da...
2023.05
80.9
SATLM
Language Model=code-da...
2023.05
69.4
COT
Language Model=code-da...
2023.05
56.1
PROGLM
Language Model=code-da...
2023.05
53.4
COT
Language Model=code-da...
2023.05
46.5
PROGLM
Language Model=code-da...
2023.05
43.4
STANDARD
Language Model=code-da...
2023.05
21
Feedback
Search any
task
Search any
task