Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Summarization on Cluster-level summaries (held-out set)
Loading...
46
ROUGE-L
Hybrid pipeline
21.04
27.52
34
40.48
Jan 27, 2026
ROUGE-L
BERTScore (F1)
Updated 1mo ago
Evaluation Results
Method
Method
Links
ROUGE-L
BERTScore (F1)
Hybrid pipeline
2026.01
46
91
Prompt-only LLM
2026.01
34
86
Rule-based
2026.01
22
75
Feedback
Search any
task
Search any
task