Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Perspective-specific Summarization on PUMA human evaluation (25 threads)
Loading...
93.56
Perspective Accuracy
GPT-4
70.3576
76.3813
82.405
88.4287
Jun 13, 2024
Perspective Accuracy
Fluency
Coherence
Consistency
Extractiveness
Capturing Perspective
Faithfulness
Updated 4d ago
Evaluation Results
Method
Method
Links
Perspective Accuracy
Fluency
Coherence
Consistency
Extractiveness
Capturing Perspective
Faithfulness
GPT-4
setting=zero-shot
2024.06
93.56
3.63
3.88
3.55
3.38
3.95
3.66
Reference
2024.06
92.65
4.42
4.29
4.21
4.1
4.53
4.75
PLASMA
2024.06
87.27
3.83
3.76
3.62
3.55
3.89
3.98
Flan-T5
2024.06
71.25
3.39
3.7
3.4
3.48
3.76
3.81
Feedback
Search any
task
Search any
task