Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General LLM Evaluation on XSTest
Loading...
23.1
Refusal Rate
MTSA-T3
22.664
25.607
28.55
31.493
May 22, 2025
Refusal Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Refusal Rate
MTSA-T3
Target Model=Zephyr-7B...
2025.05
23.1
MTSA-T3
Target Model=Llama2-7B...
2025.05
25.2
Baseline
Target Model=Zephyr-7B...
2025.05
28.3
Baseline
Target Model=Llama2-7B...
2025.05
34
Feedback
Search any
task
Search any
task