Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-context language understanding on LongBench Zh out-of-domain
Loading...
61.2
SingleDoc Acc
Original Prompt
34.16
41.18
48.2
55.22
Mar 19, 2024
SingleDoc Acc
MultiDoc Acc
Summarization Acc
FewShot Score
Synthetic Score
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
SingleDoc Acc
MultiDoc Acc
Summarization Acc
FewShot Score
Synthetic Score
Average Score
Original Prompt
Tokens=14,940
2024.03
61.2
28.7
16
29.2
77.5
42.5
LLMLingua-2
Type=Task(Question)-Ag...
2024.03
46.7
23
15.3
32.8
72.6
38.1
LLMLingua
Type=Task(Question)-Ag...
2024.03
35.2
20.4
11.8
24.3
51.4
28.6
Feedback
Search any
task
Search any
task