Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Bias Evaluation on PhysicalAppearance
Loading...
0.07
Mean Improvement
iPASa
0.018
0.0315
0.045
0.0585
Sep 25, 2025
Mean Improvement
95% CI Lower Bound
P-Value
Updated 16d ago
Evaluation Results
Method
Method
Links
Mean Improvement
95% CI Lower Bound
P-Value
iPASa
Model=DeepSeek-R1-Dist...
2025.09
0.07
0.06
0
iPASwo
Model=DeepSeek-R1-Dist...
2025.09
0.07
0.05
0
iPASwo
Model=Llama-3.1-8B-Ins...
2025.09
0.06
0.05
0
iPASa
Model=Llama-3.1-8B-Ins...
2025.09
0.05
0.04
0
iPASa
Model=Nous-Hermes-2-Mi...
2025.09
0.04
0.03
0
PASf
Model=Nous-Hermes-2-Mi...
2025.09
0.04
0.03
0
iPASwo
Model=Nous-Hermes-2-Mi...
2025.09
0.03
0.02
0
PASf
Model=Llama-3.1-8B-Ins...
2025.09
0.03
0.01
0
PASf
Model=DeepSeek-R1-Dist...
2025.09
0.02
0
0.01
Feedback
Search any
task
Search any
task