Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Summarization on DialogSum (test)
Loading...
4.03
Informativeness
Gold
3.6972
3.7836
3.87
3.9564
Sep 2, 2022
Informativeness
Factual Consistency
Updated 1mo ago
Evaluation Results
Method
Method
Links
Informativeness
Factual Consistency
Gold
Source=Human Reference
2022.09
4.03
4.21
SICK++
2022.09
3.79
3.97
BART-xsum
2022.09
3.71
3.68
Feedback
Search any
task
Search any
task