Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context downstream tasks on LongBench-E (test)

49.92S-Doc QA Perf

Elastic Attention

41.246443.498245.7548.0018Jan 24, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
49.920.6448.920.6330.140.7367.990.74540.6460.70.7953.350.69
2026.01
49.40.7152.940.6930.30.868.550.7949.660.7256.490.8752.710.77
2026.01
48.75-51.85-30.26-68.16-56-55.81-53.28-
2026.01
48.650.745.210.729.930.767.350.754.560.756.470.751.820.7
2026.01
47.690.743.050.730.020.767.860.753.560.761.070.752.110.7
2026.01
46.150.6446.540.6528.190.7267.520.7148.070.6562.950.7851.510.69
2026.01
46.050.746.880.728.30.766.610.748.660.762.240.751.340.7
2026.01
45.57-51.59-28.34-66.64-50.16-61.2-52.16-
2026.01
45.450.744.520.728.160.766.420.747.50.762.380.750.670.7
2026.01
44.40.6839.420.7128.490.8265.260.7644.350.7454.290.8747.590.76
2026.01
44.010.7549.990.7628.30.8366.230.850.950.7760.570.8651.660.8
2026.01
43.77-46.3-30.08-67.32-42.03-64.3-50.73-
2026.01
43.69-38.48-28.46-66.21-49.59-54.38-48.45-
2026.01
43.3-34.97-29.95-64.19-38.54-59.58-46.68-
2026.01
42.20.6638.860.6828.50.7665.730.7348.430.7154.340.8248.080.73
2026.01
42.2-42.33-29.58-64.55-45.87-59.58-49.03-
2026.01
41.730.735.720.728.470.764.590.747.170.753.910.746.950.7
2026.01
41.580.737.580.728.530.764.80.747.230.753.20.747.190.7