| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Sudoku | ESPO | Accuracy92.7 | 129 | 2d ago | |
| Rover(r, w, s, o) domain (test) | GPA + Soft-FLARES | Planning Time0.01 | 104 | 3mo ago | |
| IPC Domains | Flexibility93 | 99 | 2mo ago | ||
| nuScenes (val) | GuideFlow | Collision Rate (Avg)0.07 | 97 | 9d ago | |
| Countdown | BGPO | Accuracy87.5 | 89 | 2d ago | |
| NAVSIM (test) | DriveSuprim | PDMS93.5 | 59 | 23h ago | |
| Room Domain | Our Proposed Framework | Time (s)0.15 | 56 | 3mo ago | |
| NAVSIM (navtest) | DiffusionDrive | NC99.6 | 53 | 3mo ago | |
| Traffic Norms Domain | T (s)0.592 | 52 | 3mo ago | ||
| nuPlan 14 Random (test) | CLS-NR0.948 | 49 | 2mo ago | ||
| Delicate Can (c) (test) | Time (s)0 | 48 | 3mo ago | ||
| nuScenes v1.0-trainval (val) | Senna | ST-P3 L2 Error (1s)0.11 | 39 | 21d ago | |
| PushT | SD-JEPA | Success Rate97.3 | 35 | 2d ago | |
| LIBERO Object Suite | V-VLAPS (both) | Average MCTS Simulations31.79 | 33 | 3mo ago | |
| LIBERO Spatial Suite | V-VLAPS (both) | Average MCTS Simulation28.8 | 33 | 3mo ago | |
| 3D-Continuous Light-Dark | VOMCPOW | Mean Return4.8 | 30 | 15d ago | |
| Openscene (val) | MOSAIC | EPDMS85.02 | 30 | 1mo ago | |
| Countdown | GDSD w/ TLC | Accuracy85.6 | 27 | 5d ago | |
| NAVSIM | MTDrive | Path Deviation Metric Score96.2 | 25 | 2mo ago | |
| Navsim v2 (navtest) | DriveFuture | NC98.8 | 24 | 19d ago | |
| NavSim (Navhard) | Hydra-MDP | NC97.6 | 24 | 2d ago | |
| nuScenes | SAMoE-VLA | L2 Error (Avg)0.29 | 24 | 1mo ago | |
| UltraTool (test) | GraphSAGE | n-F172.81 | 24 | 3mo ago | |
| NAVSIM v1 | Human Driver | PDMS94.8 | 23 | 9d ago | |
| nuPlan 14 Hard (test) | CLS-NR86 | 23 | 3mo ago |