| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Aegis 2.0 | MCE | F1 Score80 | 10 | 4d ago | |
| AIR-Bench Text + Image (test) | Aetheria | Precision83 | 8 | 4d ago | |
| AIR-Bench Image Only (test) | Precision94 | 8 | 4d ago | ||
| AIR-Bench Text Only (test) | Precision94 | 8 | 4d ago | ||
| Hateful Memes seen (test) | PALI-X-VPD (specialist w/ CoT) | AUC-ROC89.2 | 7 | 4d ago | |
| Production Traffic (live traffic) | Two-Stage Cascade | Refusal Rate (Prod)3.6 | 4 | 4d ago | |
| Laion5B UnsafeBench (test) | GGuard | Hate72 | 4 | 4d ago | |
| Content Moderation Korean (test) | CulturePark | Abusive Rate64.7 | 4 | 4d ago | |
| Content Moderation Chinese (test) | SeaLLM | Bias23.7 | 3 | 4d ago | |
| Lexica UnsafeBench (test) | GGuard | Hate Safety Score65 | 2 | 4d ago | |
| Content Moderation Arabic (test) | CulturePark | Hate Accuracy55.8 | 2 | 4d ago |