Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SamSum

Benchmarks

Task NameDataset NameSOTA ResultTrend
Dialogue SummarizationSAMSum (test)
ROUGE-233
80
Abstractive SummarizationSAMSum
ROUGE-228.97
73
Abstractive dialogue summarizationSAMSum (test)
ROUGE-L52.7
53
Few-shot LearningSAMSum
Score41.62
40
SummarizationSAMSum Full 2019
F1 Score37
30
SummarizationSAMSum
BERTScore F191.3
30
Factual Consistency EvaluationSAMSum
Spearman Correlation46.7
30
Factual Consistency EvaluationSamSum (test)
Pearson Correlation Coefficient44.6
22
Meeting SummarizationSamSum
HPI6.4347
22
SummarizationSAMSum
AlignScore89.5
19
SummarizationSamSum (test)
ROUGE-153.4
18
Language ModelingSAMSum
Perplexity31.18
13
Summarization FaithfulnessSAMSum
SummaC41.08
12
Abstractive SummarizationSAMSum sampled (test)
ROUGE Score26.88
12
Faithfulness EvaluationSAMSum (test)
SummaC29.58
12
SummarizationSAMSum
Completeness4.98
12
SummarizationSAMSum
ROUGE-L31.46
12
Dialogue SummarizationSAMSum 1.0 (test)
R151
11
Output OOD DetectionSamsum
AUROC99.99
10
Dialogue SummarizationSAMSum
ROUGE-229.88
10
SummarizationSamsum
PPL4.02
9
Input OOD DetectionSamsum
AUROC1
8
Factual Consistency EvaluationSamSum
Pearson Correlation Coefficient47.7
8
Factual Consistency EvaluationSamSum
Kendall's Tau38.2
8
Abstractive SummarizationSAMSum (val)
ROUGE-153.8
8
Showing 25 of 36 rows