| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Long Video Understanding | LongVideoBench | Score58.9 | 269 | |
| Long Video Understanding | LongVideoBench (val) | Accuracy80 | 225 | |
| Video Question-Answering | LongVideoBench | Accuracy77.6 | 210 | |
| Long-form Video Understanding | LongVideoBench | Accuracy82.3 | 135 | |
| Video Understanding | LongVideoBench | LongVideoBench Score59.2 | 123 | |
| Long Video Understanding | LongVideoBench | Accuracy67 | 97 | |
| Video Question Answering | LongVideoBench (val) | Accuracy83 | 87 | |
| Video Understanding | LongVideoBench | Accuracy66.7 | 56 | |
| Video Understanding | LongVideoBench 1-60min | Accuracy56.8 | 49 | |
| Video Question Answering | LongVideoBench (LVB) 58 (test) | Accuracy66.42 | 45 | |
| Video Question Answering | LongVideoBench (test) | Accuracy (Long)63.7 | 42 | |
| Long-context video understanding | LongVideoBench (test) | Accuracy66.7 | 28 | |
| Video Understanding | LongVideoBench (test) | Accuracy (Overall)82.3 | 25 | |
| Long Video Understanding | LongVideoBench | LongVideoBench Score57.1 | 24 | |
| Video Question Answering | LONGVIDEOBENCH Medium | Accuracy56.1 | 24 | |
| Video Reasoning | LongVideoBench | LongVideoBench Score59 | 24 | |
| Long-video understanding | LongVideoBench (LVB) (test) | LVB Score67.1 | 22 | |
| Long-video understanding | LongVideoBench (LVB) | Accuracy76.8 | 21 | |
| Video Question Answering | LongVideoBench KFS-Bench (full set) | Accuracy65.5 | 20 | |
| Long-form Video Understanding | LongVideoBench | Overall Score60.4 | 19 | |
| Long Video Understanding | LongVideoBench 23sec-60 min | Accuracy74.4 | 19 | |
| Long Video Understanding | LongVideoBench (test) | Accuracy66.7 | 19 | |
| Long-Video Understanding | LongVideoBench 8s~60m | Score66.7 | 17 | |
| Long Video Understanding | LongVideoBench (LongVB) | Accuracy69.2 | 17 | |
| Long Video Understanding | LongVideoBench 473s | Accuracy67.5 | 16 |