| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Autonomous Driving | HUGSIM Overall | Reachability Coverage (RC)47.5 | 7 | |
| Autonomous Driving | HUGSIM Extreme | Reachability Coverage (RC)39.1 | 7 | |
| Autonomous Driving | HUGSIM Hard | Reachability Coverage (RC)40.4 | 7 | |
| Autonomous Driving | HUGSIM Medium | Reachability Coverage (RC)50.9 | 7 | |
| Autonomous Driving | HUGSIM Easy | Reachability Coverage (RC)76.9 | 7 | |
| Closed-loop evaluation | HUGSIM pre-challenge v1 (test) | RC (Easy)84.2 | 5 | |
| Closed-loop autonomous driving | HUGSIM zero-shot | RC (Easy)80.9 | 4 |