| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| ActivityNet | Accuracy62.3 | 29 | 3d ago | ||
| Vad-Reasoning-Plus | Qwen3VL-Thinking | BLEU-30.106 | 27 | 3d ago | |
| MSVD | MiniGPT4-Video | Accuracy73.92 | 22 | 3d ago | |
| TruthfulQA | MoLaCE | Neutral Accuracy74.24 | 15 | 3d ago | |
| SAGE Web Search | Weighted Recall (Com. Sci.)35.1 | 12 | 3d ago | ||
| MMAD (test) | MAU-GPT | ROUGE-10.7026 | 12 | 3d ago | |
| TREC-DL-NF (S5) | MinosEval | Kendall's Tau (K)68.61 | 11 | 3d ago | |
| ANTIQUE (S5) | MinosEval | Kendall's Tau (K)65.97 | 11 | 3d ago | |
| Proposed LLM-based evaluation benchmark OEQ | Completeness96.9 | 9 | 3d ago | ||
| QAEGO4D (test) | GroundVQAB | ROUGE30.4 | 9 | 3d ago | |
| TGIF | MiniGPT4-Video | Accuracy0.7222 | 6 | 3d ago |