Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Content Moderation on UnsafeBench
Loading...
76.7
Accuracy
Tool-MCoT
47.476
55.063
62.65
70.237
Mar 15, 2026
Accuracy
F1-Score
Updated 9d ago
Evaluation Results
Method
Method
Links
Accuracy
F1-Score
Tool-MCoT
Fine-tuning Protocol=GRPO
2026.03
76.7
77.3
Tool-MCoT
Fine-tuning Protocol=R...
2026.03
74.3
75.7
Gemma3-4B-it
Fine-tuning Protocol=LoRA
2026.03
70.9
74.1
Gemma3-4B-it
Fine-tuning Protocol=Base
2026.03
48.6
52.3
Feedback
Search any
task
Search any
task