Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on CP
Loading...
54
Accuracy
StrategyLLM-SC
14.48
24.74
35
45.26
Nov 15, 2023
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
StrategyLLM-SC
Backbone=Meta-Llama-3-...
2023.11
54
SolutionLLM
Backbone=Meta-Llama-3-...
2023.11
51.5
StrategyLLM
Backbone=Meta-Llama-3-...
2023.11
51.5
CoT
Backbone=Meta-Llama-3-...
2023.11
48.5
StrategyLLM-SC
Backbone=Mixtral-8x22B...
2023.11
47.5
CoT-SC
Backbone=Meta-Llama-3-...
2023.11
47
SolutionLLM
Backbone=Mixtral-8x22B...
2023.11
44.5
StrategyLLM
Backbone=Mixtral-8x22B...
2023.11
44.5
CoT
Backbone=Mixtral-8x22B...
2023.11
41
CoT-SC
Backbone=Mixtral-8x22B...
2023.11
40.5
StrategyLLM-SC
Backbone=Mixtral-8x7B-...
2023.11
32
StrategyLLM
Backbone=Mixtral-8x7B-...
2023.11
28.5
CoT-SC
Backbone=Mixtral-8x7B-...
2023.11
26.5
StrategyLLM-SC
Backbone=Meta-Llama-3-...
2023.11
25
StrategyLLM
Backbone=Meta-Llama-3-...
2023.11
24.5
CoT
Backbone=Mixtral-8x7B-...
2023.11
24.5
SolutionLLM
Backbone=Mixtral-8x7B-...
2023.11
22.5
SolutionLLM
Backbone=Meta-Llama-3-...
2023.11
20.5
CoT-SC
Backbone=Meta-Llama-3-...
2023.11
19.5
CoT
Backbone=Meta-Llama-3-...
2023.11
16
Feedback
Search any
task
Search any
task