| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Event-based Video Question Answering | EvQA-Sparse 200 Questions | Accuracy66 | 28 | |
| Event-based Video Question Answering | EvQA 1000 Questions (full) | Accuracy76.1 | 28 | |
| End-to-end Retrieval | EVQA | R@10089.5 | 6 | |
| Visual Entity Recognition | EVQA | Recall@147.4 | 6 | |
| Event-based Visual Question Answering | EvQA | Total Accuracy0.673 | 3 |