| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Classification | Average across datasets | Base Score89.05 | 15 | |
| Disease diagnosis | Average across datasets | AUC89.11 | 15 | |
| Conformal Image Classification | Average across 11 datasets CLIP ViT-B/16 features (test) | Accuracy85.6 | 9 | |
| Image Classification | Average across 11 datasets (Aircraft, CIFAR10, CIFAR100, CUB200, DTD, Flower102, Food101, HAM10000, ImageNet, Resisc45, UCF101) | Avg ACC87.4 | 9 | |
| Average performance across 10 task types | Average across 13 datasets (test) | Avg. Accuracy75.8 | 8 |