Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Generalized Planning on PDDL hiking domain
Loading...
100
Solution Rate
fd_300
-4
23
50
77
Mar 10, 2026
Solution Rate
Runtime
Score
Cost
Generation Time
Updated 1mo ago
Evaluation Results
Method
Method
Links
Solution Rate
Runtime
Score
Cost
Generation Time
fd_300
Search Limit=300s
2026.03
100
5.45
89
-
-
fd_1800
Search Limit=1800s
2026.03
100
2.73
89
-
-
fd_opt
Search Strategy=Optimal
2026.03
100
12.68
100
-
-
cot4
LLM=GPT-4
2026.03
100
0.91
55
-
-
cot4o
LLM=GPT-4o
2026.03
100
0.9
55
-
-
evo_nev
Variant=No Evolution
2026.03
100
0.9
55
1.45
602.82
evo_mini
LLM=GPT-4o-mini
2026.03
100
1.03
55
0.09
16,105.96
evo_ds
Variant=DS
2026.03
100
0.93
89
1.55
573.33
evo
Name=GenePlan (Full)
2026.03
100
0.88
89
1.67
657
evo_abl
Variant=Ablation
2026.03
0
-
0
-
-
Feedback
Search any
task
Search any
task