Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on ASDiv-Aug (accuracy)
Loading...
92.14
Accuracy
SoftCoT
36.7912
51.1606
65.53
79.8994
Feb 17, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
SoftCoT
N (Number of reasoning...
2025.02
92.14
Zero-Shot CoT
N (Number of reasoning...
2025.02
91.97
Zero-Shot Assist-CoT
N (Number of reasoning...
2025.02
91.91
SoftCoT
N (Number of reasoning...
2025.02
91.83
Zero-Shot CoT
N (Number of reasoning...
2025.02
91.7
Zero-Shot Assist-CoT
N (Number of reasoning...
2025.02
91.64
Coconut
N (Number of reasoning...
2025.02
90.37
Coconut
N (Number of reasoning...
2025.02
89.4
SoftCoT
Backbone=LLaMA-3.1-8B-...
2025.02
87.19
Zero-Shot Assist-CoT
Backbone=LLaMA-3.1-8B-...
2025.02
86.96
Zero-Shot CoT-Unk
Backbone=LLaMA-3.1-8B-...
2025.02
86.9
Coconut
Backbone=LLaMA-3.1-8B-...
2025.02
86.8
Zero-Shot CoT
Backbone=LLaMA-3.1-8B-...
2025.02
86.78
LoRA Fine-Tuning
Backbone=LLaMA-3.1-8B-...
2025.02
86.67
Coconut
Backbone=GPT-2
2025.02
38.92
Feedback
Search any
task
Search any
task