Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Gender Bias Evaluation on RealWorldQuestioning Education Recommendations
Loading...
2
Shannon Entropy (T-stat)
ChatGPT-4-turbo
-1.5776
-0.6488
0.28
1.2088
May 24, 2025
Shannon Entropy (T-stat)
Shannon Entropy (p-value)
CTTR (T-stat)
CTTR (p-value)
Maas (T-stat)
Maas (p-value)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Shannon Entropy (T-stat)
Shannon Entropy (p-value)
CTTR (T-stat)
CTTR (p-value)
Maas (T-stat)
Maas (p-value)
ChatGPT-4-turbo
Iteration=1, Evaluatio...
2025.05
2
0.04
-0.87
0.38
0.44
0.66
Llama-3
Iteration=1, Evaluatio...
2025.05
-0.24
0.8
0.01
0.98
-1.06
0.28
DeepSeek-R1
Iteration=1, Evaluatio...
2025.05
-1.17
0.24
0.86
0.38
-1.84
0.06
ChatGPT-3.5-turbo
Iteration=1, Evaluatio...
2025.05
-1.44
0.15
-0.89
0.36
-0.52
0.59
Feedback
Search any
task
Search any
task