Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Privacy Violation Detection on CMPL Insurance Single-turn (test)
Loading...
0
rbypass
NeuroFilter
-4
23
50
77
Jan 21, 2026
rbypass
UT
Updated 4d ago
Evaluation Results
Method
Method
Links
rbypass
UT
NeuroFilter
Model Scale=7B Model,...
2026.01
0
0
SAE-based
Model Scale=7B Model,...
2026.01
6
96
Llama Guard
Model Scale=7B Model,...
2026.01
100
0
Feedback
Search any
task
Search any
task