Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Out-of-Taxonomy Risk Detection on ProGuard Text-Image
Loading...
60.25
F1 Score (%)
ProGuard-7B
29.2476
37.2963
45.345
53.3937
Dec 29, 2025
F1 Score (%)
Updated 3d ago
Evaluation Results
Method
Method
Links
F1 Score (%)
ProGuard-7B
2025.12
60.25
ProGuard-3B
2025.12
50.21
GPT4o-mini
2025.12
42.99
Gemini2.5-Flash
2025.12
30.44
Feedback
Search any
task
Search any
task