| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| TriQA | CDKC | Expected Calibration Error8.58 | 10 | 4d ago | |
| SeaQA | GRPO | ECE9.42 | 10 | 4d ago | |
| Bamboo | CDKC | Expected Calibration Error34.01 | 10 | 4d ago | |
| BeerQA | CDKC | ECE23.28 | 10 | 4d ago | |
| 2Wiki | CDKC | ECE17.43 | 10 | 4d ago | |
| HotQA | CDKC | ECE22.11 | 10 | 4d ago | |
| MusQ | CDKC | ECE48.08 | 10 | 4d ago | |
| PopQA | CDKC | ECE19.85 | 10 | 4d ago | |
| SIVAL-MIPL r=3 (test) | MAMCC | Reduction in Error45.47 | 6 | 4d ago | |
| Birdsong-MIPL r=3 (test) | MAMCN | Reduction in Error52.68 | 6 | 4d ago | |
| FMNIST-MIPL r=3 (test) | SAMCN | Reduction in Error19.76 | 6 | 4d ago | |
| MNIST-MIPL r=3 (test) | MAMCN | Error Reduction43.18 | 6 | 4d ago | |
| SIVAL-MIPL r=2 (test) | MAMCC | Reduction in Error47.71 | 6 | 4d ago | |
| Birdsong-MIPL r=2 (test) | MAMCC | Reduction in Error54.49 | 6 | 4d ago | |
| FMNIST-MIPL r=2 (test) | DAMCN | Reduction in Error42.51 | 6 | 4d ago | |
| MNIST-MIPL r=2 (test) | DAMCN | Error Reduction58.76 | 6 | 4d ago | |
| SIVAL-MIPL r=1 (test) | MAMCC | Reduction in Error49.52 | 6 | 4d ago | |
| Birdsong-MIPL r=1 (test) | MAMCC | Reduction in Error56.36 | 6 | 4d ago | |
| FMNIST-MIPL r=1 (test) | MAMCN | Reduction in Error48.2 | 6 | 4d ago | |
| MNIST-MIPL r=1 (test) | DAMCC | Error Reduction58.32 | 6 | 4d ago |