Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Evaluation on Stereotype (test)
Loading...
100
Stereotype Score
SFT
92.2312
94.2481
96.265
98.2819
Feb 8, 2026
Stereotype Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Stereotype Score
SFT
Base Model=Qwen2.5-7B-...
2026.02
100
SFT + OGPSA
Base Model=Qwen2.5-7B-...
2026.02
100
SFT-DPO + OGPSA
Base Model=Qwen2.5-7B-...
2026.02
100
SFT-DPO + General Data
Base Model=Llama3.1-8B...
2026.02
94.44
SFT + OGPSA
Base Model=Llama3.1-8B...
2026.02
92.53
Feedback
Search any
task
Search any
task