Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stereotyping Evaluation on Counterfactual Open-Ended (OCF)
Loading...
2.5
Stereotype Fraction
GPT-4o-mini
2.292
3.696
5.1
6.504
Jul 15, 2024
Stereotype Fraction
Co-occurrence Bias
Stereotypical Association
Updated 1mo ago
Evaluation Results
Method
Method
Links
Stereotype Fraction
Co-occurrence Bias
Stereotypical Association
GPT-4o-mini
Model=GPT-4o-m
2024.07
2.5
62
30.2
Gemini-Pro
Model=Gem-Pro
2024.07
3.1
44.3
23.1
Gemini-Flash
Model=Gem-Fl
2024.07
4.3
38.9
25.5
Gemini-Flash-Lite
Model=Gem-Fl-Lt
2024.07
5.6
53.1
29.5
GPT-4o
Model=GPT-4o
2024.07
7.7
48.7
28.1
Feedback
Search any
task
Search any
task