Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Response Harmfulness Detection on SPA-VL-Eval

74.73F1 Score

GuardReasoner-Omni 2B

51.683657.666863.6569.6332Feb 3, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
74.73
2026.02
72.62
2026.02
72.13
2026.02
72.01
2026.02
52.57