| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Tweet Classification | TweetEval 1.0 (test) | Emoji (M-F1)34.2 | 18 | |
| Twitter Text Classification | TweetEval latest (test) | Emoji0.297 | 9 | |
| Text Classification | TweetEval Offensive (test) | Accuracy69.14 | 8 | |
| Text Classification | TweetEval Hate (test) | Accuracy55.72 | 8 | |
| Pruning | TweetEval T-Sentiment (test) | AU-MSE1.23 | 8 | |
| Pruning | TweetEval T-Hate (test) | AU-MSE4.88 | 8 | |
| Pruning | TweetEval T-Emotions (test) | AU-MSE1.47 | 8 | |
| Irony Detection | TweetEval irony (test) | Accuracy84.18 | 7 | |
| Detection | TweetEval offensive | Macro F168.3 | 6 | |
| Detection | TweetEval irony | Macro F162.7 | 6 | |
| Detection | TweetEval hate | Macro F161.2 | 6 | |
| Detection | TweetEval stance-feminist (test) | Macro F141.3 | 6 | |
| Safety Evaluation | TweetEval | F172 | 3 | |
| Detection | TweetEval stance-atheism | Macro F127.4 | 3 | |
| Detection | TweetEval stance-atheism (TW-A) (test) | Macro-F10.285 | 3 | |
| Detection | TweetEval-offensive (Tw-O) (test) | Macro F156.3 | 3 | |
| Detection | TweetEval-hate (Tw-H) (test) | Macro F159 | 3 |