| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MEPS (test) | Oracle ECE0.81 | 150 | 22d ago | ||
| MultiCorrupt nuScenes v1.0-trainval (val) | DA-PS | D-ECE6.777 | 26 | 26d ago | |
| AV1451 | Brier Loss | Classwise ECE13.44 | 14 | 1mo ago | |
| ImageNet-C severity level 5 (test) | Δ-UQ | ECE (Mean)0.044 | 13 | 3mo ago | |
| nuScenes Singapore semantic shift from Boston (test) | Density-aware Isotonic Regression | D-ECE3.645 | 7 | 26d ago | |
| CIFAR-10 LT-100 (test) | UniMix+Bayias | ACE2.31 | 7 | 3mo ago | |
| All 6 tabular | Clustered Calibration | ΔNLL (%)0.49 | 1 | 8d ago | |
| WiDS | Clustered Calibration | Delta NLL (%)16 | 1 | 8d ago | |
| Stroke | Clustered Calibration | ∆ NLL (%)0.76 | 1 | 8d ago | |
| LOS | Clustered Calibration | Delta NLL0.16 | 1 | 8d ago | |
| Diabetes130 | Clustered Calibration | Delta NLL (%)17 | 1 | 8d ago | |
| Credit | Clustered Calibration | Delta NLL (%)1.55 | 1 | 8d ago | |
| Adult | Clustered Calibration | Delta NLL (%)12 | 1 | 8d ago |