Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Confidence Estimation on MSWML ID
Loading...
41.8
AURC
aNE
25.576
29.788
34
38.212
Feb 16, 2024
AURC
Updated 1mo ago
Evaluation Results
Method
Method
Links
AURC
aNE
2024.02
41.8
aMSP
2024.02
41.4
TLA
2024.02
41.2
PLA
2024.02
40.8
PLA*
Tuning Proportion=50%
2024.02
39.5
MMMC
2024.02
39.4
PLA*
Tuning Proportion=10%
2024.02
38.3
AEF+SDC
Tuning Proportion=10%
2024.02
36.8
AEF
Tuning Proportion=10%
2024.02
36.5
AEF
Tuning Proportion=50%
2024.02
33
AEF+SDC
Tuning Proportion=50%
2024.02
31.6
SDC
2024.02
28.2
Oracle
2024.02
26.2
Feedback
Search any
task
Search any
task