Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Summarization on SAMSum Multiple Client (test)
Loading...
49.99
ROUGE-1 (Client 1)
Conf
45.1332
46.3941
47.655
48.9159
May 26, 2026
ROUGE-1 (Client 1)
ROUGE-1 (Client 2)
ROUGE-1 (Client 3)
ROUGE-1 (Client 4)
ROUGE-1 (Client 5)
Average ROUGE-1
Average PGR
Updated 5d ago
Evaluation Results
Method
Method
Links
ROUGE-1 (Client 1)
ROUGE-1 (Client 2)
ROUGE-1 (Client 3)
ROUGE-1 (Client 4)
ROUGE-1 (Client 5)
Average ROUGE-1
Average PGR
Conf
Description=Confidence...
2026.05
49.99
50.09
51.98
48.19
48.17
49.68
98.24
PT
Description=Ceiling pe...
2026.05
49.51
49.98
48.91
50.21
50.11
49.74
-
VisSup
Description=Weak-to-st...
2026.05
48.51
50.6
50.35
49.14
47.47
49.21
84.41
W2S
Description=Weak-to-st...
2026.05
48.02
49.26
50.11
48.06
46.39
48.37
59.71
GRAD-TRANSFORMER
Description=Learning t...
2026.05
47.92
50.5
48.83
49.37
48.45
49.01
78.53
PS
Description=TinyLM fin...
2026.05
45.32
47.7
45.31
46.12
47.25
46.34
-
Feedback
Search any
task
Search any
task