Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Model Ranking on GovReport ROUGE-L (test)
Loading...
0.823
Kendall Tau (τ)
Baseline
0.79908
0.80529
0.8115
0.81771
Jan 20, 2026
Kendall Tau (τ)
Number of Items
Percentage Used (%)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Kendall Tau (τ)
Number of Items
Percentage Used (%)
Baseline
Ranking Strategy=Baseline
2026.01
0.823
-
-
Adaptive Multi-Model Ranking
Ranking Strategy=Adaptive
2026.01
0.8
98
0.025
Feedback
Search any
task
Search any
task