Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Generalized Planning on PDDL
Loading...
100
Solution Percentage
fd_1800
-4
23
50
77
Mar 10, 2026
Solution Percentage
Runtime
Score
Cost
Generation Time
Updated 1mo ago
Evaluation Results
Method
Method
Links
Solution Percentage
Runtime
Score
Cost
Generation Time
fd_1800
Search Limit=1800s
2026.03
100
31.87
100
-
-
cot4
LLM=GPT-4
2026.03
100
5.95
6
-
-
cot4o
LLM=GPT-4o
2026.03
100
0.93
11
-
-
evo_nev
Variant=No Evolution
2026.03
100
5.88
6
2.32
853.06
evo_mini
LLM=GPT-4o-mini
2026.03
100
5.9
6
0.15
1,589.72
evo_ds
Variant=DS
2026.03
100
1.49
29
2.52
738.03
evo
Name=GenePlan (Full)
2026.03
100
0.75
71
2.81
933.97
fd_300
Search Limit=300s
2026.03
90
83.48
90
-
-
fd_opt
Search Strategy=Optimal
2026.03
0
-
0
-
-
evo_abl
Variant=Ablation
2026.03
0
-
0
-
-
Feedback
Search any
task
Search any
task