| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Sentiment Classification | SST2 (test) | Accuracy96 | 233 | |
| Sentiment Classification | SST-2 | Accuracy95.99 | 190 | |
| Sentiment Analysis | SST-5 (test) | Accuracy62.27 | 177 | |
| Sentiment Analysis | SST-2 | Accuracy97.48 | 165 | |
| Text Classification | SST-2 | Accuracy96.32 | 136 | |
| Text Classification | SST-2 | Accuracy97.09 | 133 | |
| Sentiment Analysis | SST-5 | Accuracy94.84 | 123 | |
| Text Classification | SST-5 | Accuracy62.31 | 119 | |
| Classification | SST2 | Accuracy96.3 | 102 | |
| Sentiment Analysis | SST | Accuracy100 | 75 | |
| Text Classification | SST2 | Accuracy97.36 | 71 | |
| Text Classification | SST-5 (test) | Accuracy56.21 | 60 | |
| Sentiment Classification | SST-2 | Attack Generation Rate99.67 | 58 | |
| Sentiment Classification | SST-5 | Accuracy70.67 | 52 | |
| Text Classification | SST-1 | Accuracy52.4 | 45 | |
| Explaining LLMs | SST | CRR17.13 | 42 | |
| Sentiment Classification | SST (test) | Accuracy93.8 | 37 | |
| Text Reconstruction Attack | SST-2 | Total Runtime (hours)0.1 | 36 | |
| Text Classification | SST-2 | Harmful Score55.7 | 35 | |
| Sentiment Analysis | SST-2 | Accuracy96.71 | 33 | |
| Membership Inference Attack | SST | AUC1 | 32 | |
| Training Data Reconstruction | SST | ROUGE-11 | 32 | |
| Sentiment Analysis | SST2 | Macro-F195.27 | 30 | |
| Sentiment Analysis | SST5 | Macro-F145.92 | 30 | |
| Sentiment Analysis | SST-2 | ACC96 | 30 |