| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CIFAR100 vs. SVHN | Gradnorm | AURC Score679.48 | 39 | 4d ago | |
| CIFAR100 (test) | Gradnorm | AURC369.86 | 39 | 4d ago | |
| ImageNet vs. Textures New setting | AURC431.12 | 11 | 4d ago | ||
| ImageNet Old setting | AURC238.18 | 11 | 4d ago | ||
| DSMF-CALVIN (test) | I-FailSense | Accuracy90.64 | 10 | 4d ago | |
| ImageNet vs. WILDS New setting | AURC409.89 | 10 | 4d ago | ||
| RoboFail (Out-Of-Domain) | Guardian-8B-Thinking | Execution Binary Acc86 | 8 | 4d ago | |
| UR5-Fail Out-Of-Domain | Execution Binary Acc79 | 7 | 4d ago | ||
| Mobile manipulation environment | PaLM-E-12B | F1 Score91 | 7 | 4d ago | |
| Medical Segmentation Decathlon (MSD) Pancreatic Tumor (test) | SynthCP + VAE alarm | MAE15.19 | 7 | 4d ago | |
| CIFAR100 New FD Setting | MSP | AURC269.42 | 5 | 4d ago | |
| CIFAR100 Old Setting | MSP | AURC27.72 | 5 | 4d ago |