Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-Document Summarization on WCEP10
Loading...
29.77
ROUGE-1
DelimScaling (Qwen2.5-7B)
8.762
14.216
19.67
25.124
Feb 2, 2026
ROUGE-1
ROUGE-2
ROUGE-L
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-1
ROUGE-2
ROUGE-L
DelimScaling (Qwen2.5-7B)
Model Size=7B, Base Mo...
2026.02
29.77
11.7
20.35
Qwen2.5-7B
Model Size=7B, Configu...
2026.02
29.74
11.59
20.3
DelimScaling (Qwen2.5-3B)
Model Size=3B, Base Mo...
2026.02
27.52
9.99
18.47
Qwen2.5-3B
Model Size=3B, Configu...
2026.02
27.3
9.75
18.42
DelimScaling (Phi-1.5)
Model Size=1.5B, Base...
2026.02
9.8
1.49
8.09
Phi-1.5
Model Size=1.5B, Confi...
2026.02
9.57
1.45
7.94
Feedback
Search any
task
Search any
task