| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MACE | Prob Entropy | AUROC82.4 | 84 | 1mo ago | |
| CIFAR-100 | GCE | ECE1.25 | 81 | 9d ago | |
| CIFAR-10 | BalCAL | ECE0.76 | 68 | 9d ago | |
| CIFAR10 (test) | BSCE-GRA | ECE0.74 | 61 | 1mo ago | |
| ImageNet 1k (val) | MIR | ECE0.4509 | 56 | 1mo ago | |
| SVHN | BalCAL | ECE0.24 | 40 | 1mo ago | |
| Tiny ImageNet | Brier Loss | Expected Calibration Error0.82 | 32 | 9d ago | |
| Tiny-ImageNet (test) | AdaFL53 | ECE1.35 | 25 | 1mo ago | |
| MN | DNN | MAE5.413 | 20 | 1mo ago | |
| BN | MAE20.84 | 20 | 1mo ago | ||
| CIFAR-10, CIFAR-100, and SVHN | BalCAL | Average ECE1.77 | 13 | 1mo ago | |
| MATH | Verbalized (steered) | ECE3.3 | 12 | 22d ago | |
| C-R34-25 | MAMCC | ECE8.26 | 11 | 1mo ago | |
| C-R34-16 | MAMCC | ECE8.77 | 11 | 1mo ago | |
| C-R34-9 | MAMCC | ECE9.46 | 11 | 1mo ago | |
| C-SIFT | MAMCN | ECE6.68 | 11 | 1mo ago | |
| C-KMeans | DAMCN | ECE9.67 | 11 | 1mo ago | |
| C-SBN | MAMCC | ECE6.23 | 11 | 1mo ago | |
| C-Row | MAMCC | ECE6.01 | 11 | 1mo ago | |
| MATH, GSM8K, SelfAware, and TruthfulQA combined | CARE-GRPO | ECE0.086 | 10 | 1mo ago | |
| CIFAR100 (test) | BSCE-GRA | AdaECE1.52 | 10 | 1mo ago | |
| MN 0.3 | HMC | MAE4.515 | 10 | 1mo ago | |
| BN 10 | HMC | MAE54.63 | 10 | 1mo ago | |
| BN 0.3 | HMC | MAE5.3 | 10 | 1mo ago | |
| ImageNet C R S | Reference Masking Regularization | Smoothed ECE0.105 | 8 | 1mo ago |