Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Gender Bias Evaluation on RealWorldQuestioning Jobs Recommendations
Loading...
1.44
Shannon Entropy (T-statistic)
ChatGPT-4-turbo
-0.6296
-0.0923
0.445
0.9823
May 24, 2025
Shannon Entropy (T-statistic)
Shannon Entropy (p-value)
CTTR (T-statistic)
CTTR (p-value)
Maas (T-statistic)
Maas (p-value)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Shannon Entropy (T-statistic)
Shannon Entropy (p-value)
CTTR (T-statistic)
CTTR (p-value)
Maas (T-statistic)
Maas (p-value)
ChatGPT-4-turbo
Iteration=1, Evaluatio...
2025.05
1.44
0.14
-0.27
0.78
0.11
0.9
ChatGPT-3.5-turbo
Iteration=1, Evaluatio...
2025.05
0.16
0.86
1.41
0.15
-0.62
0.53
Llama-3
Iteration=1, Evaluatio...
2025.05
-0.52
0.6
0.58
0.55
-0.95
0.33
DeepSeek-R1
Iteration=1, Evaluatio...
2025.05
-0.55
0.58
0.92
0.35
-0.42
0.67
Feedback
Search any
task
Search any
task