Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OOD safety category inference (Stage 2) on OpenAI Moderation

36.45Mean Reward

Gemini2.5-Flash

7.766815.213422.6630.1066Dec 29, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
36.45
29.05
28.31
2025.12
8.87