Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Abstract Retrieval on Synthetic Tasks (EM)
Loading...
98.5
Exact Match
Our model
-3.004
23.348
49.7
76.052
Nov 15, 2023
Exact Match
Updated 4d ago
Evaluation Results
Method
Method
Links
Exact Match
Our model
Context Window Size=8K
2023.11
98.5
ChatGLM3-6B-32K
Context Window Size=32...
2023.11
94
Baichuan2-Turbo-192K
Context Window Size=192K
2023.11
90
GPT3.5-Turbo-16K
Context Window Size=16K
2023.11
77.5
ChatGLM2-6B-32K
Context Window Size=32...
2023.11
64.5
Qwen-14B-Chat
Context Window Size=8K...
2023.11
40
Longchat-v1.5-7B-32K
Context Window Size=32...
2023.11
7.6
Vicuna-v1.5-7B-16K
Context Window Size=16...
2023.11
5
Xgen-7B-8K
Context Window Size=8K...
2023.11
3.5
InternLM-7B-8K
Context Window Size=8K...
2023.11
0.9
Feedback
Search any
task
Search any
task