| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Rover(r, w, s, o) domain (test) | GPA + Soft-FLARES | Planning Time0.01 | 104 | 1mo ago | |
| IPC Domains | Flexibility93 | 99 | 17d ago | ||
| nuScenes (val) | GuideFlow | Collision Rate (Avg)0.07 | 80 | 1mo ago | |
| Sudoku | ESPO | Accuracy92.7 | 76 | 8d ago | |
| Countdown | ESPO | Accuracy82 | 68 | 1mo ago | |
| Room Domain | Our Proposed Framework | Time (s)0.15 | 56 | 1mo ago | |
| NAVSIM (navtest) | DiffusionDrive | NC99.6 | 53 | 1mo ago | |
| Traffic Norms Domain | T (s)0.592 | 52 | 1mo ago | ||
| nuPlan 14 Random (test) | CLS-NR0.948 | 49 | 1mo ago | ||
| Delicate Can (c) (test) | Time (s)0 | 48 | 1mo ago | ||
| NAVSIM (test) | PDMS90.8 | 44 | 1mo ago | ||
| nuScenes v1.0-trainval (val) | Senna | ST-P3 L2 Error (1s)0.11 | 39 | 22d ago | |
| LIBERO Object Suite | V-VLAPS (both) | Average MCTS Simulations31.79 | 33 | 1mo ago | |
| LIBERO Spatial Suite | V-VLAPS (both) | Average MCTS Simulation28.8 | 33 | 1mo ago | |
| Openscene (val) | MOSAIC | EPDMS85.02 | 30 | 8d ago | |
| PushT | LeWM | Success Rate96 | 27 | 26d ago | |
| NAVSIM | MTDrive | Path Deviation Metric Score96.2 | 25 | 29d ago | |
| nuScenes | SAMoE-VLA | L2 Error (Avg)0.29 | 24 | 11d ago | |
| UltraTool (test) | GraphSAGE | n-F172.81 | 24 | 1mo ago | |
| nuPlan 14 Hard (test) | CLS-NR86 | 23 | 1mo ago | ||
| TravelPlanner #180 (val) | HiMAP-Travel | CS-Micro95.64 | 22 | 1mo ago | |
| BlocksWorld | Success Rate100 | 20 | 18d ago | ||
| HM3D | SERP | SPL78.4 | 18 | 1mo ago | |
| NavSim (Navhard) | DriveSuprim | NC0.989 | 18 | 1mo ago | |
| Wall | Adversarial WM | Success Rate0.94 | 18 | 1mo ago |