| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| RealToxicityPrompts | Toxicity Score0 | 29 | 4d ago | ||
| BOLD 23679 prompts (test) | DeStein | Avg Toxicity (Max)0.02 | 18 | 4d ago | |
| AttaQ | RAD | Max Toxicity Score0.04 | 14 | 4d ago | |
| BOLD | RAD | Avg Toxicity (Max)0.022 | 14 | 4d ago | |
| AttaQ 1402 prompts (test) | RAD | Max Toxicity Score0.042 | 14 | 4d ago | |
| RealToxicityPrompts 1K non-toxic prompts, 1K toxic prompts | Count of Non-Toxic Samples5 | 14 | 4d ago | ||
| RealToxicity | Model Surgery | RealTox5.17 | 8 | 4d ago | |
| RealToxicityPrompts responses | CodeGen-Multi-16B | Classifier Score21 | 3 | 4d ago |