Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Classification on OS Bench
Loading...
0.936
Recall
CLUE
0.21528
0.40239
0.5895
0.77661
Dec 31, 2024
Recall
Accuracy
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Recall
Accuracy
F1 Score
CLUE
Model Architecture=LLa...
2024.12
0.936
0.862
0.871
LAION-AI NSFW Detector
Model Architecture=CLI...
2024.12
0.416
0.609
0.515
LAION-AI NSFW Detector
Model Architecture=CLI...
2024.12
0.399
0.609
0.505
Q16
Model Architecture=CLI...
2024.12
0.32
0.608
0.449
Q16
Model Architecture=CLI...
2024.12
0.297
0.625
0.441
Stable Diffusion Safety Checker
Model Architecture=CLI...
2024.12
0.264
0.622
0.41
LLaVA Guard
Prompt=Default Prompt,...
2024.12
0.261
0.612
0.401
LLaVA Guard
Prompt=Modified Prompt...
2024.12
0.243
0.599
0.377
Feedback
Search any
task
Search any
task