Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MACE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Confidence CalibrationMACE (test)
AUROC81.2
84
Model CalibrationMACE
AUROC82.4
84
LLM CalibrationMACE
ECE8.6
60
MACE PredictionMACE Prediction n=350 (test)
AUROC81.8
1
Showing 4 of 4 rows