Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
BlackBox Optimization
Loading...
3.9372
Best Score
DeltaEvolve
0.275256
1.225953
2.17665
3.127347
Feb 2, 2026
Best Score
Token Consumption
Updated 1mo ago
Evaluation Results
Method
Method
Links
Best Score
Token Consumption
DeltaEvolve
LLM Ensemble Family=Ge...
2026.02
3.9372
1,227,388
DeltaEvolve
LLM Ensemble Family=GP...
2026.02
2.7297
1,390,709
AlphaEvolve (Full Code)
LLM Ensemble Family=GP...
2026.02
2.6415
1,852,841
AlphaEvolve (Full Code)
LLM Ensemble Family=Ge...
2026.02
2.5221
1,894,890
Greedy Refine
LLM Ensemble Family=Ge...
2026.02
2.3403
1,054,566
Greedy Refine
LLM Ensemble Family=GP...
2026.02
2.2618
555,430
Parallel Sampling
LLM Ensemble Family=GP...
2026.02
0.4161
390,872
Parallel Sampling
LLM Ensemble Family=Ge...
2026.02
0.4161
390,872
Feedback
Search any
task
Search any
task