| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Harmful content detection | CADD | Accuracy90.19 | 8 | |
| Harmful content detection | CADD DeepSeek generations | Accuracy60.13 | 4 | |
| Harmful content detection | CADD Llama-3.1 generations | Accuracy69.18 | 4 | |
| Harmful content detection | CADD GPT-4o generations | Accuracy64.84 | 4 | |
| Toxic-neutral pair quality evaluation | Translated CADD | Overall Score2.963 | 1 |