Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on GSM8K (Accuracy, Output Tokens, Expense)
Loading...
84.46
Accuracy
TALE-EP
26.0432
41.2091
56.375
71.5409
Dec 24, 2024
Accuracy
Output Tokens
Expense
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Output Tokens
Expense
TALE-EP
Prompting Strategy=TAL...
2024.12
84.46
77.26
279.84
Vanilla CoT
Prompting Strategy=Van...
2024.12
81.35
318.1
541.09
Directly Answering
Prompting Strategy=Dir...
2024.12
28.29
12.46
39.43
Feedback
Search any
task
Search any
task