Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on LongBench v1 (test)

30.7NrtvQA Score

Top-k

13.5417.99522.4526.905Apr 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.04
30.744.755.45546.531.734.825.126.871.592.244.87.810067.163.649.8
2026.04
30.644.756.355.245.930.634.624.426.7739243.56.710062.556.449
2026.04
30.245.554.955.546.731.335.225.227.272.591.743.88.499.565.158.849.5
2026.04
3044.756.55545.43033.524.326.57391.342.46.510061.556.748.6
2026.04
28.743.352.255.245.128.427.22422.869.591.141.26.2995954.447.3
2026.04
27.443.255.755.344.431.233.423.726.272.590.341.96.699.561.751.647.8
2026.04
27.240.752.855.145.629.727.62325.472.588.940.45.694.560.851.546.3
2026.04
26.833.149.342.827.318.83324.227.17186.242.82.98756.954.342.2
2026.04
26.231.348.94026.219.633.324.227.17386.643.32.382.95957.342.6
2026.04
26.135.247.551.645.628.122.922.522.853.590.1406.78557.748.942.8
2026.04
25.539.951.851.439.525.733.923.525.965.584377.899.547.444.844.6
2026.04
25.23148.440.42618.331.523.126.97186.342.83.985.154.753.241.7
2026.04
25.130.548.740.426.718.730.32226.170.585.742.43.470.253.651.540.4
2026.04
23.92947.440.523.518.330.621.82670.585.941.53.556.553.250.439
2026.04
23.727.346.352.340.824.219.222.619.22984.1398.697.557.356.240.8
2026.04
212846.538.419.517.530.823.325.87083.739.52.58549.448.339.1
2026.04
2125.842.518.120.31629.521.727.17085.839.72.575.554.249.237.4
2026.04
18.315.138.330.419.213.816.521.419.93281.738.1459.14951.831.8
2026.04
1815.241.530.71713.221.621.623.15885.840.62.141.551.447.233
2026.04
1723.225.72129.36.820.817.520.745.584.341.2571.559.547.533.5
2026.04
16.522.843.834.719.216.624.42224.870.581.738.63.135.236.538.433
2026.04
14.212.326.423.214.810.117.519.819.85180.539.9415.65245.427.8