Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Uncertainty Calibration benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Uncertainty Calibration
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
CIFAR-10-C
AUGMIX
RMS Calibration Error
5.2
35
1mo ago
QUBIQ 2021
Tackle
Expected Calibration Error (ECE)
0.81
28
3d ago
CIFAR-F
Knowledge-Transferring-based Temperature Scaling
ECE
5.97
27
1mo ago
CIFAR-10.1 C
Knowledge-Transferring-based Temperature Scaling
ECE
29.98
27
1mo ago
CIFAR-10.1
Knowledge-Transferring-based Temperature Scaling
ECE
0.0395
27
1mo ago
RewardBench
Verbalized
Kuiper
0.009
24
1mo ago
JudgeBench
Probe
Kuiper
0.037
24
1mo ago
GPQA diamond
ET-PE
AUROC
0.696
18
1mo ago
GPQA main
PE
AUROC
0.672
18
1mo ago
SciBench
PE
AUROC
78.7
18
1mo ago
MATH 500
ET-PE
AUROC
0.861
18
1mo ago
OASIS
S+LC
AUSE
0.0659
8
1mo ago
VTAB
CLAP (Ours) + CoOp
ECE
13.6
8
1mo ago
CUB200
CLAP (Ours) + MaPLe
ECE
18.4
8
1mo ago
ImageNet-R
CLAP (Ours) + MaPLe
ECE
0.146
8
1mo ago
ImageNet 100
CLAP (Ours) + AttriCLIP
ECE
0.205
8
1mo ago
CIFAR100
MaPLe
ECE
0.168
8
1mo ago
Sentinel-1 Ice Pack
Bayesian Transformer
Weighted ECE
0.17
6
1mo ago
Sentinel-1 MIZ
Bayesian Transformer
Weighted ECE
0.18
6
1mo ago
MEPS Panel 21 2017 (test)
NRC-Embed
Length
242
6
1mo ago
MEPS Panel 20 2017 (test)
DeepEnsemble
Length
19,050
6
1mo ago
MEPS Panel 19 2017 (test)
NRC-Embed
Length
240.5
6
1mo ago
Boston
NRC
MACE
5
6
1mo ago
Concrete
NRC
MACE
0.05
6
1mo ago
Auto-MPG MPG3
NRC
MACE
0.08
6
1mo ago
Showing 25 of 47 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs