| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Harmful Prompts Curated April 13, 2023 | Bad Bot Rate0 | 61 | 4d ago | ||
| curated dataset (test) | BAD BOT Rate0 | 11 | 4d ago | ||
| Synthetic dataset (held-out) | Good Bot Rate100 | 8 | 4d ago | ||
| sexual-content prompts | gpt-5-thinking | Non-Unsafe Rate99.5 | 4 | 4d ago | |
| abuse, disinformation, hate prompts | gpt-5-thinking | Not Unsafe Rate99.9 | 4 | 4d ago | |
| violence prompts | gpt-5-thinking | Non-Unsafe Rate99.9 | 4 | 4d ago | |
| illicit non-violent crime prompts | gpt-5-thinking | Not Unsafe Rate99.5 | 4 | 4d ago |