| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offensive Language Identification | OLID | Accuracy Drop-7 | 40 | |
| Offensive Language Identification | OLID (test) | CACC87.33 | 33 | |
| Text Classification | OLID (test) | Delta CACC1.79 | 18 | |
| Backdoor Defense | OLID (test) | ΔCACC2.21 | 12 | |
| Backdoor Trigger Detection | OLID | Precision0.43 | 10 | |
| Offensive language identification | OLID English Sub-task A 1.0 (test) | Macro F10.735 | 9 | |
| Offensive language target identification | OLID English 2019 (test) | Macro F155.7 | 5 | |
| Categorization of offensive language type | OLID English Sub-task B | Macro F10.619 | 5 | |
| Safety Evaluation | OLID | F1 Score73 | 3 | |
| Text Anomaly Detection | OLID | AUPRC0.1581 | 2 | |
| Text Classification | OLID | Delta ASR- | 0 | |
| Backdoor Defense | OLID | Delta CACC- | 0 |