Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSWML

Benchmarks

Task NameDataset NameSOTA ResultTrend
Confidence EstimationMSWML OOD
AURC61.2
13
Confidence EstimationMSWML ID
AURC41.8
13
Out-of-Distribution Coverage EstimationMSWML
Maximum Coverage64
7
Showing 3 of 3 rows