| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language model detoxification | RealToxicityPrompts (test) | Distinct-191.3 | 54 | |
| Toxicity Mitigation | RealToxicityPrompts challenging | Avg Toxicity (Max)6.2 | 46 | |
| Detoxification | RealToxicityPrompts challenging | Max Toxicity0.062 | 32 | |
| Toxicity Evaluation | RealToxicityPrompts | Toxicity Score0 | 29 | |
| Toxicity Mitigation | REALTOXICITYPROMPTS | Toxicity21.24 | 24 | |
| Detoxification | RealToxicityPrompts | Avg Max Toxicity0.27 | 22 | |
| Spoofing attack traceability | RealToxicityPrompts (test) | AUC90.11 | 20 | |
| Toxicity evaluation | RealToxicityPrompts 1K non-toxic prompts, 1K toxic prompts | Count of Non-Toxic Samples5 | 14 | |
| Toxicity Mitigation | RealToxicityPrompts (test) | Full Toxicity10.1 | 14 | |
| Toxicity Generation | RealToxicityPrompts (test) | Perspective API Score9.2 | 12 | |
| Toxicity Analysis | RealToxicityPrompts Nontoxic | Exp. Max. Toxicity0.22 | 10 | |
| Controlled Text Generation | RealToxicityPrompts 10K nontoxic prompts | Avg Max Toxicity30.2 | 9 | |
| Toxic Language Suppression | RealToxicityPrompts 10K nontoxic prompts GPT2-large generation (test) | Max Toxicity0.172 | 7 | |
| Detoxification | REALTOXICITYPROMPTS (test) | Toxicity Score (Avg)0.081 | 5 | |
| Toxic Output Mitigation | RealToxicityPrompts 1.0 (Toxic) | Toxicity0.299 | 5 | |
| Toxic Output Mitigation | RealToxicityPrompts 1.0 (Random) | Toxicity0.122 | 5 | |
| Open-ended generation | RealToxicityPrompts Non-toxic prompts (test) | Toxicity Probability7.38 | 4 | |
| Toxicity avoidance | RealToxicityPrompts | Avg Max Toxicity Score0.265 | 4 | |
| Toxicity Generation | RealToxicityPrompts 100k prompts | Toxicity Score (Basic)10.4 | 4 | |
| Safety Evaluation | RealToxicityPrompts (test) | Safety Score96 | 3 | |
| Toxicity Evaluation | RealToxicityPrompts responses | Classifier Score21 | 3 | |
| Open-ended generation | RealToxicityPrompts Toxic prompts (test) | Toxicity Probability74.29 | 2 | |
| Text-to-image generation | REALTOXICITYPROMPTS | Inappropriate Probability10 | 2 |