Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Fairness Evaluation on CrowS-Pair En
Loading...
65.057
Stereotype Score
Full Precision
60.15756
61.42953
62.7015
63.97347
Jan 17, 2026
Stereotype Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Stereotype Score
Full Precision
Model=Llama-3.1-8B-Ins...
2026.01
65.057
AWQ-trust
Model=Llama-3.1-8B-Ins...
2026.01
64.58
AWQ
Model=Llama-3.1-8B-Ins...
2026.01
64.341
AWQ
Model=Gemma-7B-Instruc...
2026.01
62.552
Full Precision
Model=Qwen-2.5-7B-Inst...
2026.01
61.956
AWQ
Model=Qwen-2.5-7B-Inst...
2026.01
61.896
AWQ-trust
Model=Gemma-7B-Instruc...
2026.01
61.598
Full Precision
Model=Gemma-7B-Instruc...
2026.01
60.942
AWQ-trust
Model=Qwen-2.5-7B-Inst...
2026.01
60.346
Feedback
Search any
task
Search any
task