Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stereotyping Evaluation on DecodingTrust Stereotype (DTS)
Loading...
10.7
Stereotype Fraction
Gemini-Pro
9.992
14.771
19.55
24.329
Jul 15, 2024
Stereotype Fraction
Co-occurrence Bias
Stereotypical Association
Toxic Fraction
Updated 1mo ago
Evaluation Results
Method
Method
Links
Stereotype Fraction
Co-occurrence Bias
Stereotypical Association
Toxic Fraction
Gemini-Pro
Model=Gem-Pro
2024.07
10.7
37.2
21.7
-
GPT-4o
Model=GPT-4o
2024.07
11.8
40.1
23.7
-
GPT-4o-mini
Model=GPT-4o-m
2024.07
23
46.6
30.1
-
Gemini-Flash-Lite
Model=Gem-Fl-Lt
2024.07
24.6
50.1
24.1
-
Gemini-Flash
Model=Gem-Fl
2024.07
28.4
38.2
24.5
-
GPT-4o
2024.07
-
-
-
0.4
GPT-4o-m
2024.07
-
-
-
1.3
Gem-Fl
2024.07
-
-
-
0.5
Gem-Fl-Lt
2024.07
-
-
-
1.2
Gem-Pro
2024.07
-
-
-
0.5
Feedback
Search any
task
Search any
task