Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
PDE Solver on PDE Solver
Loading...
0.9931
Best Score
DeltaEvolve
0.7409
0.806375
0.87185
0.937325
Feb 2, 2026
Best Score
Token Consumption
Updated 1mo ago
Evaluation Results
Method
Method
Links
Best Score
Token Consumption
DeltaEvolve
LLM Ensemble Family=Ge...
2026.02
0.9931
253,719
AlphaEvolve (Full Code)
LLM Ensemble Family=Ge...
2026.02
0.9901
595,094
DeltaEvolve
LLM Ensemble Family=GP...
2026.02
0.8915
562,848
AlphaEvolve (Full Code)
LLM Ensemble Family=GP...
2026.02
0.885
711,298
Parallel Sampling
LLM Ensemble Family=GP...
2026.02
0.7506
154,016
Parallel Sampling
LLM Ensemble Family=Ge...
2026.02
0.7506
154,016
Greedy Refine
LLM Ensemble Family=GP...
2026.02
0.7506
375,186
Greedy Refine
LLM Ensemble Family=Ge...
2026.02
0.7506
255,332
Feedback
Search any
task
Search any
task