| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Closed-loop Planning | Bench2Drive | Driving Score89.07 | 90 | |
| End-to-End Autonomous Driving | Bench2Drive base set | Driving Score81.9 | 46 | |
| End-to-End Autonomous Driving | Bench2Drive | Driving Score73.86 | 27 | |
| Closed-loop Autonomous Driving | Bench2Drive closed-loop | DS97 | 24 | |
| Closed-loop Autonomous Driving | Bench2Drive | Driving Score (DS)88.42 | 21 | |
| Closed-loop Planning | Bench2Drive (test) | Driving Score86.77 | 21 | |
| Autonomous Driving | Bench2Drive 220 routes across CARLA towns | Efficiency161.31 | 20 | |
| Autonomous Driving | Bench2Drive base (train) | Driving Score78.04 | 19 | |
| Autonomous Driving | Bench2Drive 220 short segment scenarios | Driving Score82.37 | 18 | |
| Autonomous Driving | Bench2Drive base set closed-loop | Driving Score (DS)0.7922 | 15 | |
| Open-loop Planning | Bench2Drive base set open-loop | Average L2 Error0.57 | 15 | |
| Autonomous Driving | Bench2Drive local benchmark | DS85.94 | 14 | |
| Autonomous Driving | Bench2Drive base set open-loop | Avg L2 Error0.62 | 13 | |
| End-to-end Autonomous Driving | Bench2Drive (test) | Driving Score74.22 | 13 | |
| Autonomous Driving | Bench2Drive base subset (50 clips for open-loop) | Avg L2 Error0.62 | 11 | |
| Planning | Bench2Drive open-loop (val) | Avg L2 Error0.62 | 11 | |
| Autonomous Driving | Bench2Drive (dev10) | Merging Success Rate17.57 | 11 | |
| Planning | Bench2Drive CARLA Leaderboard 2.0 (official 220 routes) | Driving Score64.22 | 11 | |
| End-to-End Autonomous Driving | Bench2Drive Closed-loop | Driving Score96.25 | 10 | |
| Autonomous Driving | Bench2Drive Multi-Ability Benchmark (test) | Merging Score34.67 | 10 | |
| Planning | Bench2Drive (test) | Avg. L2 (m)0.62 | 9 | |
| End-to-End Autonomous Driving | Bench2Drive Open-loop | Avg L2 Error (m)0.62 | 9 | |
| Autonomous Driving | Bench2Drive Multi-Ability | Merging Score0.8127 | 9 | |
| Autonomous Driving | Bench2Drive CARLA v2 (test) | Driving Score (DS)64.22 | 9 | |
| Open-loop Planning | Bench2Drive | Average L2 Error0.61 | 9 |