Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Single-document Question Answering on Single-doc QA
Loading...
62.3
F1
ChatGLM3-6B-32K
12.9
25.725
38.55
51.375
Nov 15, 2023
F1
Updated 4d ago
Evaluation Results
Method
Method
Links
F1
ChatGLM3-6B-32K
Context Window Size=32...
2023.11
62.3
GPT3.5-Turbo-16K
Context Window Size=16K
2023.11
61.2
Baichuan2-Turbo-192K
Context Window Size=192K
2023.11
44.7
Vicuna-v1.5-7B-16K
Context Window Size=16...
2023.11
43
Our model
Context Window Size=8K
2023.11
34.4
InternLM-7B-8K
Context Window Size=8K...
2023.11
33.6
ChatGLM2-6B-32K
Context Window Size=32...
2023.11
32.8
Qwen-14B-Chat
Context Window Size=8K...
2023.11
31.4
Longchat-v1.5-7B-32K
Context Window Size=32...
2023.11
29.1
Xgen-7B-8K
Context Window Size=8K...
2023.11
14.8
Feedback
Search any
task
Search any
task