Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Content Moderation on OpenAI Out-of-Distribution
Loading...
82.6
Pornography Score
CARO
75.008
76.979
78.95
80.921
Apr 12, 2026
Pornography Score
Violence Score
Bias Score
Harmlessness Score
Average Score
Updated 5d ago
Evaluation Results
Method
Method
Links
Pornography Score
Violence Score
Bias Score
Harmlessness Score
Average Score
CARO
backbone=Qwen2.5-7B-In...
2026.04
82.6
32.3
44.1
84
74.2
Qwen2.5-7B-Instruct
model_status=Base Model
2026.04
75.3
23
42.1
81.7
70.8
Feedback
Search any
task
Search any
task