Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PUMA

Benchmarks

Task NameDataset NameSOTA ResultTrend
SummarizationPUMA (test)
ROUGE-1 Recall30.16
11
Perspective-specific SummarizationPUMA human evaluation (25 threads)
Perspective Accuracy93.56
4
Showing 2 of 2 rows