Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Summarization on CNN/DailyMail human evaluation (100 samples)
Loading...
43
Relevance Score
Soft Layer-Specific Multi-Task Summarization (MTL)
21.16
26.83
32.5
38.17
May 28, 2018
Relevance Score
Readability Score
Overall Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Relevance Score
Readability Score
Overall Score
Soft Layer-Specific Multi-Task Summarization (MTL)
Comparison Pair=MTL vs...
2018.05
43
40
83
Soft Layer-Specific Multi-Task Summarization (MTL)
Comparison Pair=MTL VS...
2018.05
39
33
72
Non-distinguishable
Comparison Pair=MTL vs...
2018.05
35
36
71
Non-distinguishable
Comparison Pair=MTL VS...
2018.05
32
29
61
Pointer-Coverage (See et al., 2017)
Comparison Pair=MTL VS...
2018.05
29
38
67
Baseline
Comparison Pair=MTL vs...
2018.05
22
24
46
Feedback
Search any
task
Search any
task