Share your thoughts, 1 month free Claude Pro on usSee more

Confidence Estimation on Chatbot Arena

0.3524Rank Correlation (RK)

Verbalized Confidence

Updated 16d ago

Evaluation Results

Method	Links
Verbalized Confidence 2026.05		0.3524	64.71
Random Annotator 2026.05		0.3428	65.47
Predictive Probability 2026.05		0.3407	66.15
Simulated Annotators 2026.05		0.3376	65.91
Learning Confidence (Vanilla) 2026.05		0.2956	69.13
Random Annotator 2026.05		0.2812	72.04
Predictive Probability 2026.05		0.2708	72.93
Simulated Annotators 2026.05		0.2697	73.09
Learning Confidence (Ours) 2026.05		0.2658	72.01
Learning Confidence (Vanilla) 2026.05		0.2574	74.33
Learning Confidence (Ours) 2026.05		0.2253	77.49