Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Gender Bias Evaluation on RealWorldQuestioning Health Recommendations

0.75Shannon Entropy (T-test Statistic)

ChatGPT-3.5-turbo

-1.3716-0.8208-0.270.2808May 24, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
0.750.45-0.490.611.210.22
2025.05
-0.110.9-0.060.94-0.440.65
2025.05
-0.170.85-1.070.28-0.520.6
2025.05
-1.290.19-0.10.91-0.740.45