Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Confidence Estimation on MSWML OOD
Loading...
61.2
AURC
TLA
31.872
39.486
47.1
54.714
Feb 16, 2024
AURC
Updated 1mo ago
Evaluation Results
Method
Method
Links
AURC
TLA
2024.02
61.2
PLA
2024.02
60.3
aMSP
2024.02
60
aNE
2024.02
59.5
PLA*
Tuning Proportion=50%
2024.02
58.2
PLA*
Tuning Proportion=10%
2024.02
56.8
MMMC
2024.02
54.4
AEF+SDC
Tuning Proportion=10%
2024.02
46.9
AEF
Tuning Proportion=10%
2024.02
46.5
AEF
Tuning Proportion=50%
2024.02
40.4
AEF+SDC
Tuning Proportion=50%
2024.02
37.4
SDC
2024.02
33.8
Oracle
2024.02
33
Feedback
Search any
task
Search any
task