Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Uncertainty Quantification benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Uncertainty Quantification
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Average of 6 datasets
Dissimilarity + beamsearch
PRR
65
120
4d ago
Musique 500 randomly sampled queries (test)
R2C
AUROC
0.8322
70
4d ago
HotpotQA 500 randomly sampled queries (test)
R2C
AUROC
83.25
70
4d ago
PopQA 500 randomly sampled queries (test)
R2C
AUROC
0.8709
70
4d ago
Vision Datasets averaged (test)
SGPU
AUROC
81.7
36
4d ago
MulFactTrap (test)
RUfact
ROC AUC
0.898
32
4d ago
Mixed Dataset (real and fake biographies)
RUgen
ROC AUC
0.9001
32
4d ago
MAQA ∆K−1
Structure-Aware Minimum Bayes Risk Decoding
KL Divergence AUC
0.757
28
4d ago
CNN/DailyMail
Structure-Aware Minimum Bayes Risk Decoding
Hamming AUC
0.745
28
4d ago
WMT 19
KLE
COMET AUC
0.608
28
4d ago
MAQA
Structure-Aware Minimum Bayes Risk Decoding
Hamming AUC
83.5
28
4d ago
SciQ (test)
SENTSAR
AUROC
74.5
28
4d ago
CIFAR-10 (test)
MAP
Accuracy
93.5
14
4d ago
PTB
LSTM
CU
417
12
4d ago
MIT-BIH
LSTM
CU
998
12
4d ago
MedQA (test)
SAR
AUROC
0.635
9
4d ago
JetBot Simulation 625 trials (val)
SS EKF + CP
Marginal Coverage
91.2
8
4d ago
MBot hardware 4511 trials (val)
SS EKF + CP
Marginal Coverage
91.8
8
4d ago
MedMCQA (test)
SAR
AUROC
71.7
6
4d ago
Location L4 Delayed
CatB-S1
Covariance
0.108
5
4d ago
Location L4 Overall
DL
Cov
82.2
5
4d ago
Location L3 Delayed
CatB-S1
Covariance
0.406
5
4d ago
Location L3 Overall
DL
Coverage
82.8
5
4d ago
Location L2 Delayed
XGB-S1
Covariance
0.426
5
4d ago
Location L2 Overall
DL
Coverage
85.4
5
4d ago
Showing 25 of 32 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Terms of Service
FAQs