Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SamSum

Benchmarks

Task NameDataset NameSOTA ResultTrend
Dialogue SummarizationSAMSum (test)
ROUGE-233
80
Abstractive SummarizationSAMSum
ROUGE-228.97
73
SummarizationSamSum
PRR-0.113
66
Selective GenerationSamSum
ROC-AUC82.1
66
Abstractive dialogue summarizationSAMSum (test)
ROUGE-L52.7
53
Few-shot LearningSAMSum
Score41.62
40
SummarizationSAMSum LongBench
ROUGE-L43.57
30
SummarizationSAMSum
ROUGE Score27.2
30
SummarizationSAMSum Full 2019
F1 Score37
30
SummarizationSAMSum
BERTScore F191.3
30
Factual Consistency EvaluationSAMSum
Spearman Correlation46.7
30
Answer AccuracySamsum
BRT Accuracy39.7
26
Factual Consistency EvaluationSamSum (test)
Pearson Correlation Coefficient44.6
22
Meeting SummarizationSamSum
HPI6.4347
22
Selective PredictionSAMSum
PRR32.9
20
SummarizationSAMSum
AlignScore89.5
19
SummarizationSamSum (test)
ROUGE-153.4
18
Selective GenerationSamSum
PRR (ROUGE-L)48.6
14
Language ModelingSAMSum
Perplexity31.18
13
Summarization FaithfulnessSAMSum
SummaC41.08
12
Abstractive SummarizationSAMSum sampled (test)
ROUGE Score26.88
12
Faithfulness EvaluationSAMSum (test)
SummaC29.58
12
SummarizationSAMSum
Completeness4.98
12
SummarizationSAMSum
ROUGE-L31.46
12
Dialogue SummarizationSAMSum 1.0 (test)
R151
11
Showing 25 of 57 rows