| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multilabel Classification | Gpt4o 0.7 | Weighted F10.474 | 3 | |
| Binary Classification | Gpt4o 0.7 | Weighted F182.9 | 3 | |
| Multilabel Classification | Gpt4o 0.5 | Weighted F10.489 | 3 | |
| Binary Classification | Gpt4o 0.5 | Weighted F183.2 | 3 | |
| Multiclass Classification | Gpt4o 0.7 | Weighted F147.6 | 2 | |
| Multiclass Classification | Gpt4o 0.5 | Weighted F10.481 | 2 |