| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text Classification | AG-NEWS | Accuracy94.1 | 248 | |
| Text Classification | AG News (test) | Accuracy94.7 | 228 | |
| Topic Classification | AG-News | Accuracy95.58 | 225 | |
| Topic Classification | AG News (test) | Accuracy94.91 | 98 | |
| Text Classification | AG NEWS RoBERTa-large (test) | CACC95.6 | 44 | |
| Membership Inference Attack | AG News (test) | AUC0.909 | 43 | |
| Counterfactual Generation | AG News | LFR0.915 | 37 | |
| Language Modeling | AG News | PPL4.76 | 36 | |
| News Classification | AG News (test) | Accuracy91.7 | 34 | |
| Multi-class text classification | AG News | Micro-F10.917 | 33 | |
| Counterfactual Generation | AG News (test) | SLFR98 | 29 | |
| Text Adversarial Example Detection | Ag-News | TPR@10100 | 28 | |
| Language Modeling | AG News (val) | Perplexity18.19 | 28 | |
| Text Anomaly Detection | NLPAD-AGNews | AUROC94.84 | 25 | |
| Faithfulness Evaluation | AG-news (test) | Rate of Label Changes2 | 24 | |
| Adversarial Text Detection | AG News | F1 Score96.7 | 24 | |
| Concept Learning | AG News | Training Time234 | 21 | |
| Text embedding | AG News | t-value54.63 | 20 | |
| Text Classification | AG News 40 labels | Top-1 Error Rate0.1067 | 19 | |
| Adversarial Detection | AG News | F1 Score95.7 | 18 | |
| Short Text Clustering | AG News (test) | Accuracy86.53 | 18 | |
| Class-Conditional Language Generation | AG News | MAUVE (World)0.963 | 16 | |
| Real-time latency evaluation | AG-News | Latency (s)7 | 15 | |
| Topic Classification | AG News (test) | Badnets CACC93.79 | 15 | |
| Language Modeling | AG News (test) | Perplexity52.09 | 14 |