Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on MA
Loading...
98.7
Accuracy
StrategyLLM
68.124
76.062
84
91.938
Nov 15, 2023
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
StrategyLLM
Model=GPT-4
2023.11
98.7
StrategyLLM-SC
Model=GPT-4
2023.11
98.7
SolutionLLM
Model=GPT-4
2023.11
96.7
CoT-SC
Model=GPT-4
2023.11
94.7
CoT
Model=GPT-4
2023.11
92.7
StrategyLLM-SC
Model=Claude-3-Sonnet
2023.11
88
StrategyLLM
Model=Claude-3-Sonnet
2023.11
83.3
CoT-SC
Model=Claude-3-Sonnet
2023.11
76.7
CoT
Model=Claude-3-Sonnet
2023.11
72.7
SolutionLLM
Model=Claude-3-Sonnet
2023.11
69.3
Feedback
Search any
task
Search any
task