Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Malicious Action Detection on WebArena
Loading...
7.46
TPR
GPT-OSS-Safeguard 20B
-0.184
1.8005
3.785
5.7695
Oct 3, 2025
TPR
FPR
Updated 1mo ago
Evaluation Results
Method
Method
Links
TPR
FPR
GPT-OSS-Safeguard 20B
2025.10
7.46
9.32
AprielGuard
2025.10
3.19
0.62
Qwen-3-Guard
2025.10
0.99
0.8
Granite-Guardian 3.3-8B
2025.10
0.11
0.06
Feedback
Search any
task
Search any
task