| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MACE (test) | SCA | AUROC81.2 | 84 | 4d ago | |
| CIFAR-100-LT (test) | Knowledge-Transferring-based Temperature Scaling | ECE0.015 | 53 | 4d ago | |
| Pubmed | CaGCN | ECE0.0308 | 36 | 4d ago | |
| Citeseer | GATS | ECE3.86 | 36 | 4d ago | |
| Cora | CaGCN | ECE0.0313 | 36 | 4d ago | |
| CoraFull | CaGCN | ECE0.0701 | 28 | 4d ago | |
| SimpleQA | Probe (train on TriviaQA) | Brier Score0.0386 | 27 | 4d ago | |
| Average of four domains Relational Inference Planning | first-second-distance-based (FSD) | Brier Score0.114 | 18 | 4d ago | |
| MultiNLI Mismatch (test) | MIR | ECE0.0071 | 16 | 4d ago | |
| BeyondAIME (test) | Qwen3-4B-Instruct-ppo-value | SNR Gain1.202 | 15 | 4d ago | |
| iNaturalist 2021 | PTSK + PROCAL | ECE0.65 | 12 | 4d ago | |
| FMNIST ID (test) | OTIS | ECE3.26 | 9 | 4d ago | |
| MNIST ID (test) | CEDA | ECE0.14 | 9 | 4d ago | |
| SVHN ID (test) | OE | ECE1.28 | 9 | 4d ago | |
| CIFAR-100 ID (test) | ECE6.08 | 9 | 4d ago | ||
| CIFAR-10 ID (test) | OTIS | ECE1.88 | 9 | 4d ago | |
| HLE (test) | HTC | ECE0.031 | 7 | 4d ago | |
| GPQA (test) | HTC | ECE0.102 | 7 | 4d ago | |
| SimpleQA (test) | HTC | ECE6.8 | 7 | 4d ago | |
| ImageNet ID (test) | ECE2.05 | 6 | 4d ago |