Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Uncertainty Estimation on Shared (GQA, POPE, etc.) (test)
Loading...
0.001
ECE
PIK
0.00076
0.00238
0.004
0.00562
May 11, 2026
ECE
Brier Score
AUCPR
AUROC
Updated 22d ago
Evaluation Results
Method
Method
Links
ECE
Brier Score
AUCPR
AUROC
PIK
Aggregation Protocol=p...
2026.05
0.001
0.001
-
-
SAPLMA
Aggregation Protocol=p...
2026.05
0.001
0.001
0.1
0.1
CCPS
Aggregation Protocol=p...
2026.05
0.001
0.001
0.1
0.1
II
Aggregation Protocol=p...
2026.05
0.007
0.001
0.1
0.1
Feedback
Search any
task
Search any
task