| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LongVideoBench | VideoChat-M1 | Accuracy82.3 | 135 | 16d ago | |
| EgoSchema | Accuracy72.2 | 67 | 12d ago | ||
| LVU | SVT SCALE | Relation Attribute Accuracy76.47 | 44 | 3mo ago | |
| LVBench | LLaVA-Video-7B† + Ours (Query-Conditioned Evidential Keyframe Sampling) | Overall Score49.4 | 35 | 1mo ago | |
| LongVideoBench | Qwen2.5-VL-128 | Overall Score60.4 | 19 | 12d ago | |
| LVU (test) | S5 + LSMCL | Relation Top-1 Acc67.11 | 16 | 3mo ago | |
| LVU 1.0 (test) | HierarQ | Director Accuracy78.4 | 14 | 3mo ago | |
| Video-MME w/o sub. | DynFrame-8B | Score (w/o sub)72.3 | 13 | 7d ago | |
| VideoMME | ST-SimDiff | Overall Score61.7 | 13 | 12d ago | |
| Video-MME long-form duration | ColorTrigger | Overall Performance66.1 | 12 | 2mo ago | |
| LVBench w/o sub (test) | DVD | Accuracy74.2 | 11 | 2mo ago | |
| LV-Bench | MovieChat | ER21.3 | 10 | 3mo ago | |
| LongVideoBench long (val) | VideoSeek | Accuracy73.5 | 7 | 2mo ago | |
| VideoMME long subset w/ sub | VideoSeek | Accuracy81.2 | 6 | 2mo ago | |
| LongVideoBench | LLaVA-Video | Original Score58.4 | 6 | 2mo ago | |
| LVBench w/ sub (test) | VideoSeek | Accuracy76.7 | 3 | 2mo ago |