Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering Bias Evaluation on BBQ
Loading...
79
Accuracy (All)
Mistral-7B
30.12
42.81
55.5
68.19
Jul 18, 2024
Accuracy (All)
Accuracy (Gender)
Accuracy (Race)
Accuracy (Religion)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy (All)
Accuracy (Gender)
Accuracy (Race)
Accuracy (Religion)
Mistral-7B
Parameters=7B
2024.07
79
67
67
76
Phi-2 + BiasDPO
Parameters=2.7B, Align...
2024.07
65
68
87
69
Phi-2
Parameters=2.7B
2024.07
50
60
77
54
Gemma-2B
Parameters=2B
2024.07
36
36
30
27
StableLM-3B
Parameters=3B
2024.07
32
32
28
31
Feedback
Search any
task
Search any
task