| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Grounding | MAD (test) | Recall@1 (IoU=0.1)17.3 | 35 | |
| Video Grounding | MAD 1.0 (test) | R@1 (IoU=0.3)11.26 | 26 | |
| Synthetic in-context reasoning | MAD synthetic (test) | Compression Score55.5 | 24 | |
| Movie Audio Description generation | MAD-eval-Named v2 (test) | C Score28.2 | 17 | |
| Human motion segmentation | MAD | Precision (Pr)0.7043 | 14 | |
| Tabular Classification | MAD M (test) | Macro F1 Score0.861 | 13 | |
| Classification | mad (test) | Median UAR84 | 12 | |
| Anomaly Segmentation | MAD (test) | AUROC98.4 | 11 | |
| Anomaly Localization | MAD | Gorilla99.5 | 11 | |
| Long Video Moment Retrieval | MAD (test) | Recall@1 (Tol 0.1)15 | 10 | |
| Anomaly Detection | MAD | Gorilla93.6 | 9 | |
| Single-image Morphing Attack Detection | MAD22 | EER (OpenCV)0.3 | 8 | |
| Audio Description Generation | MAD-eval-Named (test) | Originality1.49 | 8 | |
| Anomaly Detection | MAD-sys (test) | AUROC (Image-level)95.6 | 6 | |
| Image Classification | MAD-C | CCA94.82 | 5 | |
| Image Classification | MAD-M | CCA98.84 | 5 | |
| Video Temporal Grounding | MAD v2 (test) | Recall@1 (IoU=0.3)14.72 | 4 | |
| Adversarial Defense | MAD-C (Learned) | Defense Success Rate (DSR)90.7 | 4 | |
| Adversarial Defense | MAD-M (Learned) | DSR101.4 | 4 | |
| Anomaly Detection | MAD-Real (test) | AUROC95.6 | 4 | |
| Audio Description Generation | MAD-eval Named | R-L12.1 | 4 | |
| Synthetic token manipulation | MAD benchmark | Compression Score43.8 | 2 | |
| Synthetic in-context reasoning | MAD | Compress- | 0 |