| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| 20-task Model Merging Benchmark (14-task + EMNIST, CIFAR10, Food101, FashionMNIST, RenderedSST2, KMNIST) | Fine-tuned | Avg Absolute Accuracy94.7 | 30 | 4d ago | |
| MNIST multi-task (test) | MuDSC_Zip | Accuracy94.62 | 9 | 4d ago | |
| TALL-20 (test) | Accuracy93.5 | 8 | 4d ago | ||
| TALL-14 (test) | Accuracy93.4 | 8 | 4d ago | ||
| TA-8 (test) | Accuracy94.3 | 8 | 4d ago | ||
| Multi-Fashion MNIST (test) | DSelect-k | Accuracy 183.78 | 7 | 4d ago | |
| Multi-MNIST (test) | Task 1 Accuracy92.61 | 7 | 4d ago |