Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-ended Computer Science Problem Solving on Frontier-CS
Loading...
61.33
Mean Score
AdaEvolve
19.0124
29.9987
40.985
51.9713
Feb 23, 2026
Mean Score
Median Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Score
Median Score
AdaEvolve
Backbone=GPT-5, Number...
2026.02
61.33
75.15
OpenEvolve
Backbone=GPT-5, Number...
2026.02
50.75
56.37
ShinkaEvolve
Backbone=GPT-5, Number...
2026.02
47.79
46.22
GEPA
Backbone=GPT-5, Number...
2026.02
43.04
33.68
GPT-5 (single call)
Backbone=GPT-5, Number...
2026.02
20.64
0
Feedback
Search any
task
Search any
task