Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stereotyping Evaluation on DialogSum
Loading...
5.6
Stereotype Fraction
GPT-4o-mini
5.468
6.359
7.25
8.141
Jul 15, 2024
Stereotype Fraction
Co-occurrence Bias
Stereotypical Association
Updated 1mo ago
Evaluation Results
Method
Method
Links
Stereotype Fraction
Co-occurrence Bias
Stereotypical Association
GPT-4o-mini
Model=GPT-4o-m
2024.07
5.6
0.543
30.9
Gemini-Flash-Lite
Model=Gem-Fl-Lt
2024.07
7.2
0.504
30
Gemini-Flash
Model=Gem-Fl
2024.07
8.3
0.413
29
GPT-4o
Model=GPT-4o
2024.07
8.9
0.559
29.6
Gemini-Pro
Model=Gem-Pro
2024.07
8.9
0.496
30.5
Feedback
Search any
task
Search any
task