| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Out-of-Distribution Detection | CLEAR10 ID | AUROC (COCO)99.66 | 40 | |
| Out-of-Distribution Detection | CLEAR100 ID | AUROC (COCO)97.37 | 40 | |
| Visual Question Answering | CLEAR 1.0 (Retain) | Accuracy70.9 | 32 | |
| Classification | CLEAR | Error Rate4 | 24 | |
| Machine Unlearning | CLEAR (test 2) | Forget Accuracy44 | 16 | |
| Machine Unlearning | CLEAR (test 1) | Forget Accuracy42 | 16 | |
| Question Answering | CLEAR Real-world 1.0 | Acc94.7 | 16 | |
| Question Answering | CLEAR 1.0 (Retain) | R-L Score0.352 | 16 | |
| Question Answering | CLEAR Forget 1.0 | R-L Score0.367 | 16 | |
| Visual Question Answering | CLEAR Forget 1.0 | Accuracy34.2 | 16 | |
| Online Continual Self-Supervised Learning | CLEAR100 11 experiences (streaming online) | Final Accuracy51.5 | 9 | |
| Bias Evaluation | CLEAR Bias | Age Performance82.9 | 5 | |
| Visual Question Answering | CLEAR Real QA | Accuracy (Aut)76.6 | 4 | |
| Identity Recognition | CLEAR (Retain) | Recall4.21 | 4 | |
| Identity Recognition | CLEAR (Forget) | Recall62 | 4 | |
| Depth Completion | Clear-Real (test) | RMSE0.041 | 4 | |
| Prediction | CLEAR Control Group | Time per 1000 Iterations287.85 | 3 | |
| Prediction | CLEAR (Treatment Group) | Time per 1000 Iterations232.64 | 3 | |
| Causal Reasoning | CLEAR | Accuracy60.5 | 3 | |
| Temporal OOD Detection | Clear10 (ID) vs Visual Genome (OOD) (Late split t=8) | FPR9515.34 | 2 | |
| Temporal OOD Detection | Clear10 (ID) vs COCO (OOD) (Late split t=8) | FPR@951.34 | 2 | |
| Temporal OOD Detection | Clear100 ID vs Flickr30 OOD Early split (t=2) | FPR@95% TPR8.69 | 2 | |
| Temporal OOD Detection | Clear100 ID vs ImageNet-1K OOD (Early split t=2) | FPR@95% TPR6.49 | 2 | |
| Temporal OOD Detection | Clear100 (ID) vs COCO (OOD) t=2 (Early split) | FPR@9519.14 | 2 |