Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Summarization (Groundedness) on TL;DR
Loading...
0.46
Kendall's Tau
Static Jury
0.4288
0.4369
0.445
0.4531
Dec 1, 2025
Kendall's Tau
Std Dev
Updated 4d ago
Evaluation Results
Method
Method
Links
Kendall's Tau
Std Dev
Static Jury
Jury Configuration=Ave...
2025.12
0.46
0.05
Static Jury
Jury Configuration=Wei...
2025.12
0.46
0.05
Static Jury
Jury Configuration=Wei...
2025.12
0.46
0.06
Static Jury
Jury Configuration=Ave...
2025.12
0.45
0.06
Jury-on-Demand
Jury Configuration=Dyn...
2025.12
0.43
0.05
Feedback
Search any
task
Search any
task