Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Science on GPQA (Solve Rate/Executability)
Loading...
42.41
Solve Rate
CapFlow
29.7636
33.0468
36.33
39.6132
Feb 11, 2026
Solve Rate
Executability
Updated 4d ago
Evaluation Results
Method
Method
Links
Solve Rate
Executability
CapFlow
Type=Learning, Setting...
2026.02
42.41
-
CapFlow
Type=Learning, Setting...
2026.02
42.41
-
AFlow
Type=Refinement, Setti...
2026.02
42.18
-
ScoreFlow
Type=Learning, Setting...
2026.02
38.69
-
ScoreFlow
Type=Learning, Setting...
2026.02
38.69
-
ADAS
Type=Refinement, Setti...
2026.02
36.12
-
CoT-SC
Type=Manual, Setting=M...
2026.02
35.95
-
CoT
Type=Manual, Setting=M...
2026.02
35.73
-
GPT-4o-mini
Type=Manual, Setting=M...
2026.02
34.16
-
SPP
Type=Manual, Setting=M...
2026.02
33.72
-
Self-Refine
Type=Manual, Setting=M...
2026.02
30.25
-
Feedback
Search any
task
Search any
task