Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Clear

Benchmarks

Task NameDataset NameSOTA ResultTrend
Out-of-Distribution DetectionCLEAR10 ID
AUROC (COCO)99.66
40
Out-of-Distribution DetectionCLEAR100 ID
AUROC (COCO)97.37
40
Visual Question AnsweringCLEAR 1.0 (Retain)
Accuracy70.9
32
ClassificationCLEAR
Error Rate4
24
Machine UnlearningCLEAR (test 2)
Forget Accuracy44
16
Machine UnlearningCLEAR (test 1)
Forget Accuracy42
16
Question AnsweringCLEAR Real-world 1.0
Acc94.7
16
Question AnsweringCLEAR 1.0 (Retain)
R-L Score0.352
16
Question AnsweringCLEAR Forget 1.0
R-L Score0.367
16
Visual Question AnsweringCLEAR Forget 1.0
Accuracy34.2
16
Online Continual Self-Supervised LearningCLEAR100 11 experiences (streaming online)
Final Accuracy51.5
9
Bias EvaluationCLEAR Bias
Age Performance82.9
5
Visual Question AnsweringCLEAR Real QA
Accuracy (Aut)76.6
4
Identity RecognitionCLEAR (Retain)
Recall4.21
4
Identity RecognitionCLEAR (Forget)
Recall62
4
Depth CompletionClear-Real (test)
RMSE0.041
4
PredictionCLEAR Control Group
Time per 1000 Iterations287.85
3
PredictionCLEAR (Treatment Group)
Time per 1000 Iterations232.64
3
Causal ReasoningCLEAR
Accuracy60.5
3
Temporal OOD DetectionClear10 (ID) vs Visual Genome (OOD) (Late split t=8)
FPR9515.34
2
Temporal OOD DetectionClear10 (ID) vs COCO (OOD) (Late split t=8)
FPR@951.34
2
Temporal OOD DetectionClear100 ID vs Flickr30 OOD Early split (t=2)
FPR@95% TPR8.69
2
Temporal OOD DetectionClear100 ID vs ImageNet-1K OOD (Early split t=2)
FPR@95% TPR6.49
2
Temporal OOD DetectionClear100 (ID) vs COCO (OOD) t=2 (Early split)
FPR@9519.14
2
Showing 24 of 24 rows