Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Discriminatory Behaviour Detection on PKU-SafeRLHF
Loading...
96
Accuracy
Dual-agent
91.2
93.6
96
98.4
Dec 1, 2025
Accuracy
Precision
Recall
F1 Score
ROC AUC
Spearman Rho
Pearson r
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Precision
Recall
F1 Score
ROC AUC
Spearman Rho
Pearson r
Dual-agent
Backbone=DeepSeek-R1
2025.12
96
95.1
97
96
98.2
0.929
0.938
Feedback
Search any
task
Search any
task