Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MultiNews

Benchmarks

Task NameDataset NameSOTA ResultTrend
SummarizationMultiNews
F1 Score34.79
28
SummarizationMultiNews (test)
Comprehensiveness4.98
24
Document SummarizationMultiNews
ASR87
14
SummarizationMultiNews
ROUGE Score24.6
10
SummarizationMultiNews
ROUGE-1 Std Dev0.11
8
Text Summarization Hallucination EvaluationMultiNews
Accuracy19
6
Latency EvaluationMultiNews
End-to-End Latency3.86
6
SummarizationMultiNews LongBench (test)
ROUGE-1 Score48.49
3
SummarizationMultiNews
Accuracy25.3
2
Showing 9 of 9 rows