| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CommonsenseQA | Power0.9999 | 207 | 5d ago | ||
| Stanford Cars | SYNC | Selective Prediction Error5.35 | 60 | 5d ago | |
| CIFAR-100 | SYNC | Selective Prediction Error0.4 | 60 | 5d ago | |
| ImageNet-100 | SN | Selective Prediction Error0.2 | 60 | 5d ago | |
| TriviaQA (test) | LEC | Power (α=0.1)100 | 24 | 5d ago | |
| Diabetic Retinopathy (DR) (test) | Sale_EU_crit | AUSC0.65 | 10 | 5d ago | |
| Diabetic Retinopathy (DR) grading patient-stratified (test) | Sale_EU_crit | AUSC (Critical FNR)0.65 | 10 | 5d ago | |
| Tabular Data Averaged across Wine, Heart, Diabetes, Cirrhosis, Yeast (test) | AURC30.68 | 5 | 5d ago | ||
| SQuAD 2.0 | UAT-LITE | Coverage@0.969.04 | 2 | 5d ago | |
| MNLI | UAT-LITE | Coverage @ 0.986.92 | 2 | 5d ago |