Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context understanding on LongBench-E 1.0 (test)

44.56Qasper Score

Exact (Baseline)

19.142425.741232.3438.9388Feb 11, 2025
Updated 24d ago

Evaluation Results

MethodLinks
2025.02
44.5650.6569.1460.3921.319.5275.3381.1443.172299.6750.935.2251.77
2025.02
43.3952.6364.0653.7128.0722.474.6788.7544.8122.339963.6946.354.14
2025.02
42.8748.5452.0538.631.3122.0771.6791.8542.3620.3798.1349.6242.7350.17
2025.02
40.1443.1764.4658.0622.2620.327380.6841.0722.339244.9532.4348.84
2025.02
39.2843.8264.8257.8423.119.57381.6339.7229244.9732.1948.76
2025.02
37.1837.1547.4937.8227.5621.0668.6790.4836.1320.3396.2645.7237.0946.38
2025.02
37.0741.6961.1149.9629.1822.6771.6787.8940.342284.335842.5749.88
2025.02
37.0241.9661.7450.929.2622.6471.6788.141.1423.6787.6758.6443.6350.62
2025.02
36.2146.7866.6457.0216.3516.0770.3377.5341.082299.3349.0435.6248.77
2025.02
35.7537.0446.3736.2427.0920.846990.8837.8820.3996.6548.4541.446.77
2025.02
34.4746.3367.7855.9215.1615.3969.3363.2440.3622.6799.3348.3234.3547.13
2025.02
33.9142.5549.0936.1320.4817.676291.740.2320.3398.8646.739.8646.12
2025.02
33.8639.7547.1235.9620.0317.7863.6790.7640.2120.498.9745.2539.5145.64
2025.02
32.9547.5361.9650.3420.2918.2860.6788.7542.9722.6799.3361.3245.8450.22
2025.02
31.7646.662.835019.0817.686585.5242.612299.3360.6244.449.8
2025.02
20.7332.6249.9342.3921.6318.6959.6774.9629.5711.676346.1632.2538.71
2025.02
20.6530.7139.1432.4323.118.75883.8728.8520.3697.2633.6930.4639.79
2025.02
20.1234.3551.8448.2319.0917.16151.1428.5223.3341.3339.1926.5435.52