Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Social Science Measurement on Humicroedit
Loading...
0.128
Expected Calibration Error (ECE)
BERT
0.12232
0.16066
0.199
0.23734
May 12, 2026
Expected Calibration Error (ECE)
Brier Score
Updated 21d ago
Evaluation Results
Method
Method
Links
Expected Calibration Error (ECE)
Brier Score
BERT
distillation=soft label
2026.05
0.128
0.248
GPT-5-nano
Verbal=true
2026.05
0.27
0.32
Feedback
Search any
task
Search any
task