Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Relevance Judgment Agreement on TREC-DL 2023 (test)
Loading...
0.418
Cohen's Kappa
UMBRELA
0.26304
0.30327
0.3435
0.38373
Jan 8, 2026
Cohen's Kappa
Updated 4d ago
Evaluation Results
Method
Method
Links
Cohen's Kappa
UMBRELA
adaptation=score thres...
2026.01
0.418
Rank1-14B
adaptation=score thres...
2026.01
0.406
Rank1-32B
adaptation=score thres...
2026.01
0.392
Rank1-7B
adaptation=score thres...
2026.01
0.386
monoT5 3B
adaptation=score thres...
2026.01
0.336
RankLLaMA-13B
adaptation=score thres...
2026.01
0.314
RankLLaMA-7B
adaptation=score thres...
2026.01
0.311
monoT5 large
adaptation=score thres...
2026.01
0.295
monoT5 base
adaptation=score thres...
2026.01
0.269
Feedback
Search any
task
Search any
task