Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding on HumanEval (Solve Rate)
Loading...
0.9618
Solve Rate
CapFlow
0.898256
0.914753
0.93125
0.947747
Feb 11, 2026
Solve Rate
Executability
Updated 4d ago
Evaluation Results
Method
Method
Links
Solve Rate
Executability
CapFlow
Type=Learning, Setting...
2026.02
0.9618
-
ScoreFlow
Type=Learning, Setting...
2026.02
0.9541
-
AFlow
Type=Refinement, Setti...
2026.02
0.9389
-
ScoreFlow
Type=Learning, Setting...
2026.02
0.9389
-
CapFlow
Type=Learning, Setting...
2026.02
0.9389
-
CoT-SC
Type=Manual, Setting=M...
2026.02
0.9236
-
SPP
Type=Manual, Setting=M...
2026.02
0.9236
-
CoT
Type=Manual, Setting=M...
2026.02
0.916
-
Self-Refine
Type=Manual, Setting=M...
2026.02
0.9083
-
ADAS
Type=Refinement, Setti...
2026.02
0.9083
-
GPT-4o-mini
Type=Manual, Setting=M...
2026.02
0.9007
-
Feedback
Search any
task
Search any
task