Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Dialogue Summarization on SAMSum 200 samples (test)
Loading...
4.94
Faithfulness
ChatGPT
3.536
3.9005
4.265
4.6295
Oct 17, 2023
Faithfulness
Fluency
Informativeness
Conciseness
Updated 4d ago
Evaluation Results
Method
Method
Links
Faithfulness
Fluency
Informativeness
Conciseness
ChatGPT
Evaluator=ChatGPT
2023.10
4.94
4.94
4.78
4.89
InstructDS
Evaluator=ChatGPT
2023.10
4.6
4.82
3.78
4.92
Human-written
Evaluator=ChatGPT
2023.10
4.49
4.81
3.74
4.95
Flan-UL2
Evaluator=ChatGPT
2023.10
4.45
4.78
3.52
4.91
BART
Evaluator=ChatGPT
2023.10
4.22
4.8
3.37
4.93
Alpaca
Evaluator=ChatGPT
2023.10
3.59
4.07
3.19
4.89
Feedback
Search any
task
Search any
task