| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Embodied Question Answering | BridgeEQA 1,100 QA pairs (test) | Answer Correctness64.8 | 15 | |
| Embodied Question Answering | BridgeEQA (test) | Image Citation Relevance88.9 | 15 | |
| Condition Rating | BridgeEQA (instances with < 30 images) | Exact Match Accuracy40.9 | 9 | |
| Condition Rating | BridgeEQA fewer than 30 images | Condition Rating Accuracy (±1)81.8 | 9 |