Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Single-Doc Question Answering on LongBench-E

43.3F1 Score

Teacher Model (w/ Context)

0.76411.80722.8533.893Oct 23, 2025
Updated 22d ago

Evaluation Results

MethodLinks
2025.10
43.3
2025.10
39.7
2025.10
35.9
2025.10
33.3
2025.10
32.5
2025.10
30.5
2025.10
24.2
2025.10
20.7
2025.10
19.5
2025.10
19.1
2025.10
17.9
2025.10
17.2
2025.10
12.5
2025.10
10.1
2025.10
8.9
2025.10
5.6
2025.10
2.4