| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text Classification | AgNews | Accuracy95 | 119 | |
| Text Classification | AGNEWS | Clean Accuracy95.2 | 118 | |
| Backdoor Defense | AgNews | Attack Success Rate2.03 | 105 | |
| Text Classification | AGNews | Accuracy94 | 61 | |
| Text Classification | AGNews synthetic noise (test) | Accuracy94.05 | 50 | |
| Topic Classification | AGNEWS | FA Score87.3 | 48 | |
| Text Classification | AGNews | Accuracy95.68 | 43 | |
| Short Text Clustering | AgNews | ACC88.2 | 38 | |
| Topic Classification | AGNews | Macro-F188.07 | 30 | |
| Text Classification | AGNews (val) | Top-1 Acc93.8 | 30 | |
| Text Anomaly Detection | AGNews | AUPRC93.52 | 25 | |
| Text Classification | AGNews | Accuracy93.6 | 24 | |
| Text Classification | AGNews 4 classes symmetric noise e=0.4 (test) | Accuracy91.63 | 24 | |
| Active Testing | AGNews | Estimation Error AUC0.0015 | 18 | |
| Topic Classification | AGNEWS | Accuracy (Acc)91.3 | 18 | |
| Attribution Faithfulness | AGNews | Faithfulness Score41.7 | 18 | |
| Backdoor Sample Detection | Agnews | AU-ROC99.93 | 16 | |
| Multiple Choice Classification | AGNews | Accuracy84.5 | 16 | |
| Topic classification | AGNEWS | Clean Acc94.4 | 16 | |
| Topic Classification and Text Generation | AGNews (test) | PPL (Output)16.94 | 16 | |
| Topic Classification | AGNews | Clean Acc Change (Abs %)-4.1 | 16 | |
| Text Classification | AGNews | Macro-F188.68 | 15 | |
| Text Classification | AGNews (test) | Accuracy (Clean)95.5 | 15 | |
| Multi-class Classification | AGNews IID | Accuracy94.24 | 14 | |
| Topic Classification | AGNEWS (test) | Hit Score (HS)56.9 | 14 |