| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| REVERIE Unseen (test) | SR81.51 | 43 | 1mo ago | ||
| REVERIE (val unseen) | Success Rate (SR)61 | 34 | 1mo ago | ||
| D4RL antmaze-medium-diverse | OFQL | Normalized Score90.2 | 22 | 1mo ago | |
| D4RL antmaze-medium-play | OFQL | Normalized Score88.1 | 22 | 1mo ago | |
| Obstacle World (test) | Plaintext w/ Camera Views | Accuracy99.5 | 21 | 1mo ago | |
| PointMaze | DHRL | Success Rate9,980 | 21 | 1mo ago | |
| D4RL antmaze-large-diverse (antmaze-l-d) | FQL | Normalized Score83 | 17 | 1mo ago | |
| D4RL antmaze-large-play (antmaze-l-p) | FQL | Normalized Score84 | 17 | 1mo ago | |
| Complex | DHRL | Success Rate4,010 | 16 | 1mo ago | |
| Bottleneck | DHRL | Success Rate38.7 | 16 | 1mo ago | |
| AntMaze | DHRL | Success Rate9,110 | 16 | 1mo ago | |
| AntMaze Small | DHRL | Success Rate9,510 | 16 | 1mo ago | |
| MiniWorld FourRooms | LaP3 | Success Rate89.1 | 15 | 1mo ago | |
| CityNav seen (val) | Navigation Error (NE)9.1 | 14 | 1mo ago | ||
| CityNav unseen (val) | Navigation Error (NE)9.4 | 14 | 1mo ago | ||
| CityNav (test unseen) | Navigation Error (NE)9.8 | 14 | 1mo ago | ||
| REVERIE (val seen) | FAST-MATTN | Success Rate (SR)50.53 | 14 | 1mo ago | |
| MiniWorld MazeS3 | A2C | Success Rate98.7 | 14 | 1mo ago | |
| CityNav Sao Paulo 1.0 | AgentNav (GPT 5) | Success29 | 12 | 1mo ago | |
| CityNav Vienna 1.0 | AgentNav (GPT 5) | Success56 | 12 | 1mo ago | |
| CityNav Tokyo 1.0 | AgentNav (GPT 5) | Success Rate0.3 | 12 | 1mo ago | |
| CityNav New York 1.0 | AgentNav (O3) | Success Rate95 | 12 | 1mo ago | |
| D4RL AntMaze umaze v2 | IQL | Initial D4RL Score137.4 | 12 | 1mo ago | |
| D4RL antmaze-umaze-diverse | C-LAP | Normalized Score7,500 | 12 | 1mo ago | |
| HIL platform Scenario S5 | E-Navi | CPU Utilization (%)28.9 | 10 | 1mo ago |