Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Bias quantification on Us vs. Them Critical texts
Loading...
0.536
Pearson Correlation Coefficient (r)
mini_listwiseBT
0.4268
0.45515
0.4835
0.51185
Dec 16, 2025
Pearson Correlation Coefficient (r)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pearson Correlation Coefficient (r)
mini_listwiseBT
Model=GPT-5-mini, Rati...
2025.12
0.536
mini_direct
Model=GPT-5-mini, Rati...
2025.12
0.505
mini_listwiseElo
Model=GPT-5-mini, Rati...
2025.12
0.5
nano_listwiseBT
Model=GPT-5-nano, Rati...
2025.12
0.46
nano_listwiseElo
Model=GPT-5-nano, Rati...
2025.12
0.431
Feedback
Search any
task
Search any
task