| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Question Answering | Molmo QA Benchmarks Long Video 19 | Long Video Average80.4 | 22 | |
| Multi-Image Question Answering | Molmo QA Benchmarks Multi-Image 19 | Average Score (Multi-Image)81.9 | 20 | |
| Image Question Answering | Molmo QA Benchmarks Image 19 | Image Average Accuracy86.2 | 20 |