| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Multilingual Benchmark | IA Score7.37 | 17 | 1mo ago | ||
| PDDLLM v1 (test) | Planning Success Rate73.3 | 6 | 1mo ago | ||
| Habitat Rearrange Easy challenge 2.0 (val) | Galactic | Success Rate26.4 | 2 | 1mo ago | |
| Habitat 2.0 (train) | Galactic | Success Rate36.7 | 2 | 1mo ago | |
| Galactic (Eval) | Galactic | Success Rate86.7 | 1 | 1mo ago | |
| Galactic (train) | Galactic | Success Rate95.3 | 1 | 1mo ago |