Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Social Science Measurement on Formality
Loading...
0.175
Expected Calibration Error (ECE)
BERT
0.16664
0.22307
0.2795
0.33593
May 12, 2026
Expected Calibration Error (ECE)
Brier Score
Updated 21d ago
Evaluation Results
Method
Method
Links
Expected Calibration Error (ECE)
Brier Score
BERT
distillation=soft label
2026.05
0.175
0.26
GPT-5-nano
Verbal=true
2026.05
0.384
0.377
Feedback
Search any
task
Search any
task