| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Multilingual Benchmark | IA Score7.37 | 17 | 2mo ago | ||
| PDDLLM v1 (test) | Planning Success Rate73.3 | 6 | 3mo ago | ||
| Habitat Rearrange Easy challenge 2.0 (val) | Galactic | Success Rate26.4 | 2 | 3mo ago | |
| Habitat 2.0 (train) | Galactic | Success Rate36.7 | 2 | 3mo ago | |
| Galactic (Eval) | Galactic | Success Rate86.7 | 1 | 3mo ago | |
| Galactic (train) | Galactic | Success Rate95.3 | 1 | 3mo ago |