Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Relevance Assessment Label Alignment on Deep Learning 2020
Loading...
0.46
Cohen's Kappa (κ)
query-contrastive
0.3872
0.4061
0.425
0.4439
Apr 5, 2026
Cohen's Kappa (κ)
MAE
Label Distribution (Category 0)
Label Distribution (Category 1)
Label Distribution (Category 2)
Label Distribution (Category 3)
Updated 12d ago
Evaluation Results
Method
Method
Links
Cohen's Kappa (κ)
MAE
Label Distribution (Category 0)
Label Distribution (Category 1)
Label Distribution (Category 2)
Label Distribution (Category 3)
query-contrastive
Model=gpt-oss-120b, q=...
2026.04
0.46
0.45
48
39
10
4
query
Model=gpt-oss-120b, q=1
2026.04
0.4
0.48
48
42
8
3
TREC Topic (Baseline)
Model=TREC Human Judgm...
2026.04
0.39
0.61
39
39
9
12
Feedback
Search any
task
Search any
task