Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MAD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video GroundingMAD (test)
Recall@1 (IoU=0.1)17.3
35
Synthetic in-context reasoningMAD synthetic (test)
Compression Score55.5
24
Video GroundingMAD 1.0 (test)
R@1 (IoU=0.1)12.43
17
Movie Audio Description generationMAD-eval-Named v2 (test)
C Score28.2
17
Human motion segmentationMAD
Precision (Pr)0.7043
14
Anomaly SegmentationMAD (test)
AUROC98.4
11
Anomaly LocalizationMAD
Gorilla99.5
11
Long Video Moment RetrievalMAD (test)
Recall@1 (Tol 0.1)15
10
Tabular ClassificationMAD M (test)
Macro F1 Score0.861
9
Anomaly DetectionMAD
Gorilla93.6
9
Single-image Morphing Attack DetectionMAD22
EER (OpenCV)0.3
8
Audio Description GenerationMAD-eval-Named (test)
Originality1.49
8
Anomaly DetectionMAD-sys (test)
AUROC (Image-level)95.6
6
Audio Description GenerationMAD-eval Named
R-L12.1
4
Synthetic token manipulationMAD benchmark
Compression Score43.8
2
Synthetic in-context reasoningMAD
Compress-
0
Showing 16 of 16 rows