Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Generalized Planning on PDDL heavypack
Loading...
100
Percent Solved
fd_300
-4
23
50
77
Mar 10, 2026
Percent Solved
Runtime
Score
Cost
Generation Time
Updated 1mo ago
Evaluation Results
Method
Method
Links
Percent Solved
Runtime
Score
Cost
Generation Time
fd_300
Search Limit=300s
2026.03
100
21.19
1
-
-
fd_1800
Search Limit=1800s
2026.03
100
10.04
1
-
-
fd_opt
Search Strategy=Optimal
2026.03
100
11.49
1
-
-
cot4
LLM=GPT-4
2026.03
100
1.15
1
-
-
cot4o
LLM=GPT-4o
2026.03
100
1.25
1
-
-
evo_nev
Variant=No Evolution
2026.03
100
1.24
1
1.26
460.61
evo_mini
LLM=GPT-4o-mini
2026.03
100
1.14
1
0.07
895.7
evo_ds
Variant=DS
2026.03
100
1.12
1
1.3
437.16
evo
Name=GenePlan (Full)
2026.03
100
1.15
1
1.26
461.24
evo_abl
Variant=Ablation
2026.03
0
-
0
-
-
Feedback
Search any
task
Search any
task