Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Transaction Scheduling on ADRS TXN
Loading...
4,348
Best
AdaEvolve
2,660.08
3,098.29
3,536.5
3,974.71
Feb 23, 2026
Best
Mean
Updated 4d ago
Evaluation Results
Method
Method
Links
Best
Mean
AdaEvolve
Backbone=GPT-5
2026.02
4,348
4,317
OE
Backbone=GPT-5
2026.02
4,329
4,239
Shinka
Backbone=GPT-5
2026.02
4,329
4,090
AdaEvolve
Backbone=Gemini-3-Pro
2026.02
4,310
4,221
OE
Backbone=Gemini-3-Pro
2026.02
4,274
4,109
Shinka
Backbone=Gemini-3-Pro
2026.02
4,255
3,932
GEPA
Backbone=Gemini-3-Pro
2026.02
4,167
3,616
GEPA
Backbone=GPT-5
2026.02
3,984
3,753
Human / SOTA
2026.02
2,725
-
Feedback
Search any
task
Search any
task