Share your thoughts, 1 month free Claude Pro on usSee more

Confidence Estimation on TL;DR

0.421Rank Correlation (RK)

Verbalized Confidence

Updated 2mo ago

Evaluation Results

Method	Links
Verbalized Confidence 2026.05		0.421	58.05
Predictive Probability 2026.05		0.4094	58.93
Random Annotator 2026.05		0.4012	60.34
Predictive Probability 2026.05		0.4006	59.24
Simulated Annotators 2026.05		0.3975	60.58
Random Annotator 2026.05		0.3964	60.63
Simulated Annotators 2026.05		0.3931	60.68
Learning Confidence (Vanilla) 2026.05		0.3912	61.34
Learning Confidence (Vanilla) 2026.05		0.364	63.21
Learning Confidence (Ours) 2026.05		0.3461	65.39
Learning Confidence (Ours) 2026.05		0.3196	67.6