Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Binary Safe/Unsafe Classification on R-Judge (test)

57.8Accuracy

BraveGuard-Qwen3-Guard-8B

39.91244.55649.253.844May 31, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
57.891.269.7
2026.05
54.440.648.5
2026.05
53.710069.5
40.65.59