| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text Classification | 20 Newsgroups (test) | Accuracy85.8 | 71 | |
| Document Classification | 20 Newsgroups (test) | Accuracy96.93 | 43 | |
| Topic Modeling | 20 Newsgroups (test) | Perplexity688 | 39 | |
| News topic classification | 20 Newsgroups 40% Instance-Dependent Noise | Accuracy85.02 | 24 | |
| News topic classification | 20 Newsgroups 20% Instance-Dependent Noise | Accuracy85.02 | 24 | |
| News topic classification | 20 Newsgroups 40% Asymmetric Noise | Accuracy85.02 | 24 | |
| News topic classification | 20 Newsgroups 20% Asymmetric Noise | Accuracy85.02 | 24 | |
| News topic classification | 20 Newsgroups 40% Symmetric Noise | Accuracy85.02 | 24 | |
| News topic classification | 20 Newsgroups 20% Symmetric Noise | Accuracy85.15 | 24 | |
| 5-way few-shot text classification | 20 Newsgroups (test) | Accuracy83.2 | 20 | |
| Text Classification | 20 Newsgroups Dir(0.01) (test) | Accuracy0.6441 | 17 | |
| Text Classification | 20 Newsgroups Dir(0.5) (test) | Accuracy70.93 | 17 | |
| Deep Clustering | 20 Newsgroups (20NG) | SC0.287 | 16 | |
| Text Classification | 20 Newsgroups | ECE2.09 | 10 | |
| k-medoids clustering | 20 Newsgroups | Wall-clock Speedup9.07 | 9 | |
| Topic Classification | 20 Newsgroups (20n) original (test) | Accuracy75.3 | 8 | |
| Multi-label Classification | 20 Newsgroups | Rare F1 @ 30%76.1 | 7 | |
| Clustering | 20 Newsgroups | ARI0.49 | 7 | |
| Clustering | 20 Newsgroups | Hard Clustering Purity43.48 | 5 | |
| Topic Modeling | 20 Newsgroups | NPMI0.413 | 4 | |
| Data Valuation (Top-10 Identification) | 20 Newsgroups | Speedup vs STC2.8 | 4 | |
| Runtime Evaluation | 20 Newsgroups | Throughput (img/s)2,600,000 | 4 | |
| Out-of-distribution detection | 20 Newsgroups (test) | FPR@900.63 | 4 |