Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-context Question Answering on L-Eval
Loading...
30.2
Coursera QA
LongLLMLingua
24.688
26.119
27.55
28.981
Mar 20, 2026
Coursera QA
QuALITY
SFictionQA
TPO
LongFQA
Legal Contract QA
Average Score
Updated 26d ago
Evaluation Results
Method
Method
Links
Coursera QA
QuALITY
SFictionQA
TPO
LongFQA
Legal Contract QA
Average Score
LongLLMLingua
Backbone=Qwen3-8B, Inp...
2026.03
30.2
49
69.5
72.9
15.2
10.9
41.3
BEAVER
Backbone=Qwen3-8B, Inp...
2026.03
28.3
57.4
73.4
80.3
17.7
13.4
45.1
LLMLingua-2
Backbone=Qwen3-8B, Inp...
2026.03
25.3
53
74.2
78.4
16.9
12.8
43.5
LLMLingua
Backbone=Qwen3-8B, Inp...
2026.03
24.9
48
60.9
66.6
13.6
11.2
37.5
Feedback
Search any
task
Search any
task