| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text Classification | AgNews | Accuracy95 | 119 | |
| Text Classification | AGNEWS | Clean Accuracy95.2 | 118 | |
| Backdoor Defense | AgNews | Attack Success Rate2.35 | 81 | |
| Text Classification | AGNews synthetic noise (test) | Accuracy94.05 | 50 | |
| Short Text Clustering | AgNews | ACC88.2 | 38 | |
| Text Classification | AGNews (val) | Top-1 Acc93.8 | 30 | |
| Text Classification | AGNews | Accuracy95.68 | 28 | |
| Text Anomaly Detection | AGNews | AUPRC93.52 | 25 | |
| Text Classification | AGNews 4 classes symmetric noise e=0.4 (test) | Accuracy91.63 | 24 | |
| Attribution Faithfulness | AGNews | Faithfulness Score41.7 | 18 | |
| Backdoor Sample Detection | Agnews | AU-ROC99.93 | 16 | |
| Multiple Choice Classification | AGNews | Accuracy84.5 | 16 | |
| Topic classification | AGNEWS | Clean Acc94.4 | 16 | |
| Topic Classification and Text Generation | AGNews (test) | PPL (Output)16.94 | 16 | |
| Topic Classification | AGNews | Clean Acc Change (Abs %)-4.1 | 16 | |
| Text Classification | AGNews (test) | Accuracy (Clean)95.5 | 15 | |
| Topic Classification | AGNEWS (test) | Hit Score (HS)56.9 | 14 | |
| Topic Modeling | AgNews | Diversity100 | 14 | |
| Backdoor Defense | AGNews (test) | Delta CACC5.72 | 12 | |
| Embedding Inversion | AGNEWS (test) | RougeL12.71 | 12 | |
| Controllable Text Generation | AGNews (test) | Output Perplexity (O-PPL)16.94 | 12 | |
| OOD Detection | AGNews | AUROC0.988 | 12 | |
| Open-set selective classification | AGNews (test) | AUAC94.8 | 12 | |
| Topic Control | AGNews (test) | Avg Topic Accuracy97.8 | 11 | |
| Text Classification | AGNews (4 classes) symmetric noise, e=0.3 (test) | Accuracy87.26 | 11 |