Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DialogSum

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reasoning evaluationDialogSum
Reasoning99.1
33
SummarizationDIALOGSUM
ROUGE-L51.6
27
Dialogue SummarizationDialogSum
R-L39.4
15
SummarizationDialogSum 1.5k examples (val)
ROUGE-L39.1
11
SummarizationDIALOGSUM
Std Dev ROUGE-10.83
8
Controllable SummarizationDialogSum
Extent20.45
7
Dialogue SummarizationDialogSum 50 samples (test)
Informativeness4.03
3
Showing 7 of 7 rows