Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Transaction Scheduling on ADRS TXN
Loading...
4,348
Best
AdaEvolve
2,659.872
3,098.136
3,536.4
3,974.664
Feb 23, 2026
Feb 27, 2026
Mar 4, 2026
Mar 8, 2026
Mar 13, 2026
Mar 17, 2026
Mar 22, 2026
Best
Mean
Updated 26d ago
Evaluation Results
Method
Method
Links
Best
Mean
AdaEvolve
Backbone=GPT-5
2026.02
4,348
4,317
OE
Backbone=GPT-5
2026.02
4,329
4,239
Shinka
Backbone=GPT-5
2026.02
4,329
4,090
AdaEvolve
Backbone=Gemini-3-Pro
2026.02
4,310
4,221
OE
Backbone=Gemini-3-Pro
2026.02
4,274
4,109
Shinka
Backbone=Gemini-3-Pro
2026.02
4,255
3,932
GEPA
Backbone=Gemini-3-Pro
2026.02
4,167
3,616
GEPA
Backbone=GPT-5
2026.02
3,984
3,753
Engram
Number of runs=10, Con...
2026.03
3,918.6
-
OE
Number of runs=10, Con...
2026.03
3,713.7
-
Human / SOTA
2026.02
2,725
-
Human SOTA
2026.03
2,724.8
-
Feedback
Search any
task
Search any
task