Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on GSM8K (Acc)
Loading...
55
Accuracy (GSM8K)
SEA
31.08
37.29
43.5
49.71
May 26, 2025
Accuracy (GSM8K)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy (GSM8K)
SEA
Base Model=LLaMA2-13B-...
2025.05
55
BoN64
Base Model=LLaMA2-13B-...
2025.05
46
SFT
Base Model=LLaMA2-13B-...
2025.05
32
Feedback
Search any
task
Search any
task