Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Summarization on SAMSum 200 samples (test)
Loading...
4.94
Faithfulness
ChatGPT
3.536
3.9005
4.265
4.6295
Oct 17, 2023
Faithfulness
Fluency
Informativeness
Conciseness
Updated 1mo ago
Evaluation Results
Method
Method
Links
Faithfulness
Fluency
Informativeness
Conciseness
ChatGPT
Evaluator=ChatGPT
2023.10
4.94
4.94
4.78
4.89
InstructDS
Evaluator=ChatGPT
2023.10
4.6
4.82
3.78
4.92
Human-written
Evaluator=ChatGPT
2023.10
4.49
4.81
3.74
4.95
Flan-UL2
Evaluator=ChatGPT
2023.10
4.45
4.78
3.52
4.91
BART
Evaluator=ChatGPT
2023.10
4.22
4.8
3.37
4.93
Alpaca
Evaluator=ChatGPT
2023.10
3.59
4.07
3.19
4.89
Feedback
Search any
task
Search any
task