Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Privacy on Privacy
Loading...
100
RR
No Control
34.6256
51.5978
68.57
85.5422
Nov 4, 2024
RR
Updated 4d ago
Evaluation Results
Method
Method
Links
RR
No Control
Model=Llama2-13B (Chat)
2024.11
100
SAC Single-task
Model=Llama2-13B (Chat)
2024.11
100
SAC Multi-task
Model=Llama2-13B (Chat)
2024.11
100
SAC Single-task
Model=Qwen2-72B (Instr...
2024.11
99.29
SAC Multi-task
Model=Qwen2-72B (Instr...
2024.11
98.93
No Control
Model=Qwen2-72B (Instr...
2024.11
98.57
SAC Single-task
Model=Qwen2-7B (Instruct)
2024.11
93.93
SAC Multi-task
Model=Qwen2-7B (Instruct)
2024.11
87.5
No Control
Model=Qwen2-7B (Instruct)
2024.11
37.14
Feedback
Search any
task
Search any
task