Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Controllable Summarization on DialogSum
Loading...
20.45
Extent
Llama-3.2-Instruct
-0.818
4.7035
10.225
15.7465
Sep 30, 2025
Extent
Length Score
Specificity Score
Top Score
Updated 6d ago
Evaluation Results
Method
Method
Links
Extent
Length Score
Specificity Score
Top Score
Llama-3.2-Instruct
Params=1B
2025.09
20.45
15.65
52.43
81.5
Llama-3.3-Instruct
Params=70B
2025.09
14.91
2.26
20.82
82.9
PACO
Params=1B, Base Model=...
2025.09
14.17
6.28
28.48
82.5
Qwen2.5-Instruct
Params=7B
2025.09
12.08
5.2
26.62
81.7
PACO
Params=7B, Base Model=...
2025.09
8.71
3.3
19.14
82
PACO
Params=70B, Base Model...
2025.09
8.35
1.56
10.2
82.8
Reference summary
Params=-
2025.09
0
0
0
81.7
Feedback
Search any
task
Search any
task