| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | Fashion (test) | Accuracy99.35 | 51 | |
| Next-Item Recommendation | Fashion Amazon (test) | HR@100.661 | 29 | |
| Density Estimation | Fashion (test) | NLL (bits/dim)2.803 | 27 | |
| Multi-view Clustering | Fashion | ACC99.2 | 25 | |
| Clustering | Fashion (full) | ACC67.2 | 24 | |
| Image Generation | Fashion (test) | FID10.3 | 16 | |
| Sequential Recommendation | Fashion | Recall17.67 | 14 | |
| Classification | fashion | F1 Macro89.91 | 12 | |
| Classification | fashion | Accuracy91.35 | 12 | |
| Multi-view Clustering | Fashion V=3 N=10000 | ACC99.74 | 11 | |
| kNN | Fashion | Throughput (QPS)40,982 | 10 | |
| Predicting Generalization | Fashion PGDL (train test) | CMI2.35 | 10 | |
| Unsupervised Online Label Shift | Fashion Square shift | Classification Error4.1 | 9 | |
| Unsupervised Online Label Shift | Fashion Monotone shift | Classification Error5.1 | 9 | |
| Unsupervised Online Label Shift | Fashion (holdout) | Classification Error (Ber)3.5 | 9 | |
| Unsupervised Online Label Shift | Fashion | Classification Error (Ber)3.7 | 9 | |
| Text-to-Image Retrieval | Fashion200K | Recall@1026.1 | 8 | |
| Recommendation | Fashion (test) | R@100.661 | 8 | |
| Online Label Shift | Fashion Bernoulli shift | Average Error7.69 | 7 | |
| Online Label Shift | Fashion Linear shift | Average Error7.84 | 7 | |
| Online Label Shift Adaptation | Fashion Square shift (test) | Average Error7.73 | 7 | |
| Online Label Shift Adaptation | Fashion Sine shift (test) | Average Error (%)9.32 | 7 | |
| Classification | fashion | Balanced Acc91.95 | 7 | |
| Active Learning Classification | Fashion | Total Regret118 | 6 | |
| Image-based recommendation | Fashion dataset (test) | NDCG@100.184 | 6 |