Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Summarization on Summarization (Rouge-L)
Loading...
18.4
Rouge-L
Baichuan2-Turbo-192K
1.552
5.926
10.3
14.674
Nov 15, 2023
Rouge-L
Updated 4d ago
Evaluation Results
Method
Method
Links
Rouge-L
Baichuan2-Turbo-192K
Context Window Size=192K
2023.11
18.4
ChatGLM3-6B-32K
Context Window Size=32...
2023.11
17.8
ChatGLM2-6B-32K
Context Window Size=32...
2023.11
16.1
GPT3.5-Turbo-16K
Context Window Size=16K
2023.11
16
Our model
Context Window Size=8K
2023.11
15.6
Vicuna-v1.5-7B-16K
Context Window Size=16...
2023.11
15.1
Qwen-14B-Chat
Context Window Size=8K...
2023.11
13.9
InternLM-7B-8K
Context Window Size=8K...
2023.11
12.4
Longchat-v1.5-7B-32K
Context Window Size=32...
2023.11
9.9
Xgen-7B-8K
Context Window Size=8K...
2023.11
2.2
Feedback
Search any
task
Search any
task