| OpenAI Content Moderation | DPR | Average F1 Score83.1 | | 30 | 12d ago |
| MMDS (test) | LLaVAShield-7B | Accuracy95.76 | | 27 | 1mo ago |
| Aegis 2.0 | Qwen3Guard-4B-Gen-strict | F1 Score86.1 | | 21 | 12d ago |
| Multi-category Chinese content moderation dataset 1.0 (test) | CARO (Llama3-8B) | Politics Accuracy89.7 | | 15 | 5d ago |
| Fine-grained Moderation Dataset | CHAIRO (Ours) | Average F189.2 | | 11 | 5d ago |
| UnsafeBench Sexual category (test) | KidsNanny (Stage 1+2) | Accuracy81.4 | | 8 | 1mo ago |
| AIR-Bench Text + Image (test) | Aetheria | Precision83 | | 8 | 1mo ago |
| AIR-Bench Image Only (test) | | Precision94 | | 8 | 1mo ago |
| AIR-Bench Text Only (test) | | Precision94 | | 8 | 1mo ago |
| Hateful Memes seen (test) | PALI-X-VPD (specialist w/ CoT) | AUC-ROC89.2 | | 7 | 1mo ago |
| ILION-Bench v2 (test) | ILION Gate | Accuracy86.1 | | 4 | 1mo ago |
| Production Traffic (live traffic) | Two-Stage Cascade | Refusal Rate (Prod)3.6 | | 4 | 1mo ago |
| Laion5B UnsafeBench (test) | GGuard | Hate72 | | 4 | 1mo ago |
| Content Moderation Korean (test) | CulturePark | Abusive Rate64.7 | | 4 | 1mo ago |
| Business Content Moderation (evaluation set) | Xuanwu VL-2B | Ad Recall99.38 | | 3 | 18d ago |
| Content Moderation Parallel Fan-Out 4/5 Down M6 | ReAct | LLM Calls10 | | 3 | 1mo ago |
| Content Moderation Parallel Fan-Out Cascading M5 | ReAct | LLM Calls9 | | 3 | 1mo ago |
| Content Moderation Parallel Fan-Out Multi Down M4 | ReAct | LLM Calls9 | | 3 | 1mo ago |
| Content Moderation Parallel Fan-Out Toxicity Risk M3 | ReAct | LLM Calls6 | | 3 | 1mo ago |
| Content Moderation Parallel Fan-Out M2: Image Down | ReAct | LLM Calls7 | | 3 | 1mo ago |
| Content Moderation Parallel Fan-Out M1: Happy Path | ReAct | LLM Calls6 | | 3 | 1mo ago |
| Content Moderation Chinese (test) | SeaLLM | Bias23.7 | | 3 | 1mo ago |
| Toxic-Chat Out-of-Distribution | CARO | Pornography Score60.8 | | 2 | 5d ago |
| OpenAI Out-of-Distribution | CARO | Pornography Score82.6 | | 2 | 5d ago |
| Aegis In-Distribution | CARO | Pornography Score75 | | 2 | 5d ago |