| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video-Quiz Evaluation | SciVidEval | VLM-as-Judge Score99.5 | 10 | |
| Visual Quality Evaluation | SciVidEval | VLM-as-Judge Score10 | 9 | |
| Narration Quality Evaluation | SciVidEval | Perplexity (PPL)16.67 | 8 | |
| Synchronization Evaluation | SciVidEval | CLIP Score0.643 | 7 |