Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Stack on PDDLLM v1 (test)
Loading...
98.5
Planning Success Rate
Expert
39.428
54.764
70.1
85.436
May 23, 2025
Planning Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Planning Success Rate
Expert
Time limit=50 s
2025.05
98.5
PDDLLM
Time limit=50 s
2025.05
97.5
RuleAsMem
Time limit=50 s, ablat...
2025.05
85.5
LLMTAMP-FF
Time limit=50 s
2025.05
70.8
LLMTAMP-FR
Time limit=50 s
2025.05
64.2
LLMTAMP
Time limit=50 s
2025.05
41.7
Feedback
Search any
task
Search any
task