| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Autonomous driving reasoning (cross-view risk object perception, action prediction, and planning) | DriveLM | Accuracy77.5 | 10 | |
| Visual Question Answering | DriveLM | BLEU-454.56 | 8 | |
| Graph Visual Question Answering | DriveLM GVQA | Accuracy74 | 7 | |
| Graph VQA | DriveLM (test) | BLEU-459.01 | 6 | |
| Language Understanding | DriveLM | BLEU-453.09 | 6 | |
| Driving Scene Understanding | DriveLM (test) | SPICE45.45 | 5 | |
| Generative Question Answering | DriveLM (test) | BLEU-453.09 | 5 | |
| Drive VQA | DriveLM | GPT Score67.3 | 5 | |
| Spatial Perception | DriveLM | Match Score13.43 | 3 |