Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Safety Evaluation on Image input safety evaluation set

98.6Hate Safety Acc

gpt-5-thinking-nano

92.46494.05795.6597.243Dec 19, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
98.697.398.698.693.996.3
98.498.498.299.599.499.8
97.198.298.698.698.799.2
92.79595.693.992.797.8