Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

15 datasets

Benchmarks

Task NameDataset NameSOTA ResultTrend
Conformal Prediction15 datasets (average)
Coverage95.7
39
Image Classification15 datasets Average (test)
Average Robust Accuracy45.39
12
Zero-shot Image Classification15 datasets Zero-shot (test)
Robust Accuracy (PGD-20, eps=1/255)44.34
5
Showing 3 of 3 rows