| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Evaluation | ToxiGen | Safety93.1 | 71 | |
| Toxicity Detection | ToxiGen | Score81.4 | 25 | |
| Toxicity Generation | ToxiGen | ToxiGen Score1,633 | 24 | |
| Toxicity Classification | Toxigen | Accuracy60.41 | 22 | |
| Harmlessness | Toxigen | Toxigen (%)100 | 17 | |
| Detoxification | ToxiGen (test) | MTV97.4 | 16 | |
| Influence Estimation | ToxiGen (test) | Spearman Correlation0.44 | 14 | |
| Bias Detection | Toxigen (test) | Accuracy90.3 | 12 | |
| Safety Evaluation | ToxiGen Pretrained Evaluation | Toxicity Rate14.53 | 12 | |
| Toxicity Detection | TOXIGEN (val) | AUC96 | 8 | |
| Misuse Detection | ToxiGen Homophobia (external) | TPR98 | 1 | |
| Misuse Detection | ToxiGen Ethnoracial (external) | TPR91 | 1 | |
| Detoxification Dataset Quality Evaluation | ToxiGen 500 neutral-toxic pairs | Overall O.2.475 | 1 |