Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Generative AI Output Safety on HarmBench and AdvBench (test)
Loading...
82.88
Safe Rate
Reliable Consensus Sampling
-1.8072
20.1789
42.165
64.1511
Dec 31, 2025
Safe Rate
Abstention Rate
Latency
Updated 4d ago
Evaluation Results
Method
Method
Links
Safe Rate
Abstention Rate
Latency
Reliable Consensus Sampling
adversary_condition=f...
2025.12
82.88
0
24.26
Reliable Consensus Sampling
adversary_condition=f...
2025.12
81.49
0
22.21
Reliable Consensus Sampling
adversary_condition=AVG
2025.12
77.42
0
21.63
Reliable Consensus Sampling
adversary_condition=Co...
2025.12
67.9
0
18.42
Consensus Sampling
adversary_condition=f...
2025.12
20.39
71.79
22.6
Consensus Sampling
adversary_condition=f...
2025.12
15.83
72.52
26.73
Consensus Sampling
adversary_condition=AVG
2025.12
12.56
75.75
23.59
Consensus Sampling
adversary_condition=Co...
2025.12
1.45
82.95
21.43
Feedback
Search any
task
Search any
task