Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Unsafe Content Categorization on OpenAI Moderation
Loading...
88.35
Accuracy
Gemini2.5-Flash
17.3804
35.8052
54.23
72.6548
Dec 29, 2025
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini2.5-Flash
2025.12
88.35
GPT4o-mini
2025.12
84.87
ProGuard-3B
2025.12
78.93
ProGuard-7B
2025.12
76.63
LlamaGuard4-12B
2025.12
69.92
LlamaGuard2-8B
2025.12
69.16
LlamaGuard3-11B-Vision
2025.12
69.16
LlamaGuard3-8B
2025.12
59.96
LlamaGuard-7B
2025.12
20.11
Feedback
Search any
task
Search any
task