| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Embodied Question Answering | RoboVQA | BLEU-177.1 | 13 | |
| Temporal Task Planning | RoboVQA | Score74.5 | 11 | |
| Long-horizon reasoning for robotic manipulation | RoboVQA | B-1 Score70.1 | 10 | |
| Robotic Video Question Answering | RoboVQA (test) | BLEU-172.7 | 6 | |
| Embodied QA | RoboVQA (test) | BLEU-177.1 | 5 | |
| Robotic Reasoning | RoboVQA (val) | BLEU-442.8 | 4 |