Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Model Ranking on FLORES COMET (test)
Loading...
0.677
Kendall Tau
Adaptive Multi-Model Ranking
0.49604
0.54302
0.59
0.63698
Jan 20, 2026
Kendall Tau
Items Count
Percentage Used
Updated 1mo ago
Evaluation Results
Method
Method
Links
Kendall Tau
Items Count
Percentage Used
Adaptive Multi-Model Ranking
Ranking Strategy=Adaptive
2026.01
0.677
101
2.5
Baseline
Ranking Strategy=Baseline
2026.01
0.503
-
-
Feedback
Search any
task
Search any
task