Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MAD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video GroundingMAD (test)
Recall@1 (IoU=0.1)17.3
35
Video GroundingMAD 1.0 (test)
R@1 (IoU=0.3)11.26
26
Synthetic in-context reasoningMAD synthetic (test)
Compression Score55.5
24
Movie Audio Description generationMAD-eval-Named v2 (test)
C Score28.2
17
Human motion segmentationMAD
Precision (Pr)0.7043
14
Tabular ClassificationMAD M (test)
Macro F1 Score0.861
13
Classificationmad (test)
Median UAR84
12
Anomaly SegmentationMAD (test)
AUROC98.4
11
Anomaly LocalizationMAD
Gorilla99.5
11
Long Video Moment RetrievalMAD (test)
Recall@1 (Tol 0.1)15
10
Anomaly DetectionMAD
Gorilla93.6
9
Single-image Morphing Attack DetectionMAD22
EER (OpenCV)0.3
8
Audio Description GenerationMAD-eval-Named (test)
Originality1.49
8
Anomaly DetectionMAD-sys (test)
AUROC (Image-level)95.6
6
Image ClassificationMAD-C
CCA94.82
5
Image ClassificationMAD-M
CCA98.84
5
Video Temporal GroundingMAD v2 (test)
Recall@1 (IoU=0.3)14.72
4
Adversarial DefenseMAD-C (Learned)
Defense Success Rate (DSR)90.7
4
Adversarial DefenseMAD-M (Learned)
DSR101.4
4
Anomaly DetectionMAD-Real (test)
AUROC95.6
4
Audio Description GenerationMAD-eval Named
R-L12.1
4
Synthetic token manipulationMAD benchmark
Compression Score43.8
2
Synthetic in-context reasoningMAD
Compress-
0
Showing 23 of 23 rows