| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Retrieval | SVD | mAP0.902 | 23 | |
| Image-to-Video Generation | SVD-XT Generated Videos | PSNR22.43 | 5 | |
| Dysphonic voice detection | SVD (val) | Mean Accuracy72.89 | 5 | |
| Dysphonic voice detection | SVD (train) | Mean Accuracy89.1 | 5 | |
| Watermark Robustness | SVD-XT | Robustness (Gauss Noise) - Cat I0.972 | 4 | |
| Image-to-Video | SVD | Latency (s)68 | 4 |