Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Overall on PDDLLM v1 (test)
Loading...
95.7
Planning Success Rate
Expert
33.3
49.5
65.7
81.9
May 23, 2025
Planning Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Planning Success Rate
Expert
Time limit=50 s
2025.05
95.7
PDDLLM
Time limit=50 s
2025.05
93.3
RuleAsMem
Time limit=50 s, ablat...
2025.05
69.9
LLMTAMP-FF
Time limit=50 s
2025.05
52.5
LLMTAMP-FR
Time limit=50 s
2025.05
48.6
LLMTAMP
Time limit=50 s
2025.05
35.7
Feedback
Search any
task
Search any
task