Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Generalized Planning on PDDL trading domain
Loading...
100
Solution Rate
cot4
-4
23
50
77
Mar 10, 2026
Solution Rate
Runtime
Score
Cost
Generation Time
Updated 1mo ago
Evaluation Results
Method
Method
Links
Solution Rate
Runtime
Score
Cost
Generation Time
cot4
LLM=GPT-4
2026.03
100
0.24
0.21
-
-
cot4o
LLM=GPT-4o
2026.03
100
0.15
0.88
-
-
evo_nev
Variant=No Evolution
2026.03
100
0.25
0.19
1.9
901.03
evo_mini
LLM=GPT-4o-mini
2026.03
100
0.14
0.92
0.15
2,061.29
evo_ds
Variant=DS
2026.03
100
0.15
0.91
1.87
659.57
evo
Name=GenePlan (Full)
2026.03
100
0.15
0.92
1.83
653.72
fd_1800
Search Limit=1800s
2026.03
93.33
394.48
0.93
-
-
fd_300
Search Limit=300s
2026.03
43.33
219.94
0.43
-
-
fd_opt
Search Strategy=Optimal
2026.03
0
-
0
-
-
evo_abl
Variant=Ablation
2026.03
0
-
0
-
-
Feedback
Search any
task
Search any
task