| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Malicious Pickle Detection | Curated Dataset standard (train-test) | TPR100 | 11 | |
| Jailbreak evaluation | curated dataset (test) | BAD BOT Rate0 | 11 | |
| Action-conditioned 4D scene generation | Curated dataset of 10 scenes (test) | Camera Control93.26 | 8 | |
| Action-conditioned 4D scene generation | Curated dataset of 10 scenes 1.0 (test) | Physics Plausibility93.5 | 7 | |
| Zero-shot Text-guided Video Editing | Curated dataset 90-frames | CLIP-F95.99 | 7 | |
| Zero-shot Text-guided Video Editing | Curated dataset 8-frames | CLIP-F95.95 | 6 | |
| Zero-shot Text-guided Video Editing | Curated dataset 36-frames | CLIP-F9,318 | 5 |