Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Selective Prediction on Banking77 ncal=6,468, delta=0.10, simulated confidence scores (test)

90.6Accuracy (alpha=0.15)

Clopper-Pearson + LTT

20.71238.8565775.144Mar 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
90.625.243.46481.896.8
2026.03
90.128.847.265.581.297.6
2026.03
89.7-23.459.280.296.5
2026.03
88.321.93959.279.295.7
2026.03
87.8--52.277.395.7
2026.03
86.6--43.473.794.2
2026.03
23.4----43.4