Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stereotyping Evaluation on RealToxicityPrompts RTP-C Toxic
Loading...
8.2
Stereotype Fraction
GPT-4o
7.88
10.04
12.2
14.36
Jul 15, 2024
Stereotype Fraction
Co-occurrence Bias
Stereotypical Association
Updated 1mo ago
Evaluation Results
Method
Method
Links
Stereotype Fraction
Co-occurrence Bias
Stereotypical Association
GPT-4o
Model=GPT-4o
2024.07
8.2
0.593
0.352
GPT-4o-mini
Model=GPT-4o-m
2024.07
10.2
0.598
0.337
Gemini-Pro
Model=Gem-Pro
2024.07
13.3
0.61
0.317
Gemini-Flash
Model=Gem-Fl
2024.07
14.7
0.785
0.371
Gemini-Flash-Lite
Model=Gem-Fl-Lt
2024.07
16.2
0.647
0.33
Feedback
Search any
task
Search any
task