Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on CorrectBench
Loading...
0.8256
Accuracy
MP
0.51724
0.597295
0.67735
0.757405
Feb 21, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
MP
Model=Qwen-3-8B, Promp...
2026.02
0.8256
Ann Brown
Model=Qwen-3-8B, Promp...
2026.02
0.8256
CoT
Model=Qwen-3-8B, Promp...
2026.02
0.8215
Std
Model=Qwen-3-8B, Promp...
2026.02
0.7903
CoT
Model=Llama-3-8B, Prom...
2026.02
0.7502
MP
Model=Llama-3-8B, Prom...
2026.02
0.7486
Ann Brown
Model=Llama-3-8B, Prom...
2026.02
0.6814
Std
Model=Llama-3-8B, Prom...
2026.02
0.5291
Feedback
Search any
task
Search any
task