Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Relevance Assessment Label Alignment on DL 2019
Loading...
0.46
Cohen's Kappa (κ)
query-contrastive
0.3664
0.3907
0.415
0.4393
Apr 5, 2026
Cohen's Kappa (κ)
Mean Absolute Error (MAE)
Label Distribution (0)
Label Distribution (1)
Label Distribution (2)
Label Distribution (3)
Updated 12d ago
Evaluation Results
Method
Method
Links
Cohen's Kappa (κ)
Mean Absolute Error (MAE)
Label Distribution (0)
Label Distribution (1)
Label Distribution (2)
Label Distribution (3)
query-contrastive
Model=gpt-oss-120b, q=...
2026.04
0.46
0.54
38
45
11
7
query-contrastive
Model=gpt-oss-120b, q=...
2026.04
0.44
0.54
39
45
12
5
TREC Topic (Baseline)
Model=TREC Human Judgm...
2026.04
0.41
0.67
31
40
13
16
query
Model=gpt-oss-120b, q=1
2026.04
0.37
0.58
40
49
8
3
Feedback
Search any
task
Search any
task