Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stereotype Bias Evaluation on StereoSet Overall
Loading...
91.05
LMS
DebiasRAG
56.7196
65.6323
74.545
83.4577
May 2, 2022
Jan 2, 2023
Sep 5, 2023
May 8, 2024
Jan 8, 2025
Sep 11, 2025
May 15, 2026
LMS
SS
ICAT
Updated 16d ago
Evaluation Results
Method
Method
Links
LMS
SS
ICAT
DebiasRAG
Base Model=GPT-2
2026.05
91.05
49.72
90.53
original
Language Model=BERT
2026.05
84.16
58.24
70.29
ADEPT
Language Model=BERT
2026.05
83.88
55.44
74.76
DebiasRAG
Language Model=BERT
2026.05
82.77
54.45
75.4
original
Base Model=GPT-2
2026.05
82.51
57.6
70.02
Davinci
2022.05
77.6
60.8
60.8
OPT-175B
2022.05
74.8
59.9
60
DPCE
Language Model=BERT
2026.05
58.04
51.5
56.31
Feedback
Search any
task
Search any
task