Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Stereotyping Evaluation on Counterfactual Open-Ended (OCF)

2.5Stereotype Fraction

GPT-4o-mini

2.2923.6965.16.504Jul 15, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.07
2.56230.2
2024.07
3.144.323.1
2024.07
4.338.925.5
2024.07
5.653.129.5
2024.07
7.748.728.1