Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Response Generation on AdvBench
Loading...
0.95
Win Rate
Chain-of-Thought
0.9188
0.9269
0.935
0.9431
Jun 26, 2025
Win Rate
Tie Rate
Loss Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Rate
Tie Rate
Loss Rate
Chain-of-Thought
Main Model=Vicuna, Inf...
2025.06
0.95
0.04
0.01
Best-of-N
Main Model=Vicuna, Inf...
2025.06
0.95
0.03
0.02
Multi-Agent Debate
Main Model=Vicuna, Inf...
2025.06
0.92
0.06
0.02
Feedback
Search any
task
Search any
task