Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Binary Safe/Unsafe Classification on ASSE-Safety (test)
Loading...
67.4
Accuracy
BraveGuard-Qwen3-Guard-8B
42.44
48.92
55.4
61.88
May 31, 2026
Accuracy
Recall
F1 Score
Updated 1d ago
Evaluation Results
Method
Method
Links
Accuracy
Recall
F1 Score
BraveGuard-Qwen3-Guard-8B
Backbone=Qwen3-Guard,...
2026.05
67.4
63.9
68.2
Llama3.1-8B-Instruct
Parameters=8B
2026.05
55.2
98.3
70.6
Qwen3-Guard
2026.05
48.2
15.8
25.1
NemoGuard
2026.05
43.4
30.3
36.9
Feedback
Search any
task
Search any
task