Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on LongBench v1 (test)

30.7NrtvQA Score

Top-k

13.5417.99522.4526.905Apr 12, 2026Apr 16, 2026Apr 20, 2026Apr 25, 2026Apr 29, 2026May 3, 2026May 8, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.04
30.744.755.45546.531.734.825.126.871.592.244.87.810067.163.649.8
2026.04
30.644.756.355.245.930.634.624.426.7739243.56.710062.556.449
2026.04
30.245.554.955.546.731.335.225.227.272.591.743.88.499.565.158.849.5
2026.04
3044.756.55545.43033.524.326.57391.342.46.510061.556.748.6
2026.04
28.743.352.255.245.128.427.22422.869.591.141.26.2995954.447.3
2026.05
27.9738.6849.9649.9738.6925.2828.0524.5725.757488.7245.836.59561.6761.8546.41
2026.05
27.5735.1748.2351.4336.725.428.6624.3725.6175.589.0146.09695.560.3460.746.02
2026.04
27.443.255.755.344.431.233.423.726.272.590.341.96.699.561.751.647.8
2026.05
27.2136.4149.9650.5437.3325.0425.8223.8724.772.588.9445.0169660.5862.0845.75
2026.04
27.240.752.855.145.629.727.62325.472.588.940.45.694.560.851.546.3
2026.05
2737.9649.8850.5637.6328.232.3325.0526.577688.4847.88597.559.3560.446.86
2026.04
26.833.149.342.827.318.83324.227.17186.242.82.98756.954.342.2
2026.05
26.4137.7548.951.5435.2625.7430.9724.2526.077688.746.256.0695.559.1860.7146.21
2026.05
26.3136.4949.9551.7637.3727.0727.1124.8425.017389.0345.835.598.561.0362.3546.32
2026.04
26.231.348.94026.219.633.324.227.17386.643.32.382.95957.342.6
2026.04
26.135.247.551.645.628.122.922.522.853.590.1406.78557.748.942.8
2026.05
26.0637.2649.6250.0736.5826.9930.4825.1426.2375.589.0846.775.596.559.1460.5846.34
2026.05
26.0535.6350.0550.3936.4427.427.2423.9725.227389.1945.84698.561.4261.5546.12
2026.05
26.0433.0650.7550.2935.8126.5825.0323.3923.6571.588.8646.03697.560.7462.2545.47
2026.05
2635.6849.8949.1135.525.9927.7824.4825.2273.588.5846.469759.4960.645.7
2026.05
25.9138.1349.6252.0739.0328.134.1125.7326.477688.5947.467.019861.5862.3447.51
2026.05
25.8932.9948.5950.3936.7226.2424.8623.8124.176989.2845.39697.560.2161.0345.13
2026.05
25.8536.0548.2249.3136.5924.7527.3423.5925.267188.7545.375.594.559.7659.8845.11
2026.05
25.6133.7648.5848.6536.1925.3725.3924.1424.147089.1845.52597.559.359.6944.88
2026.04
25.539.951.851.439.525.733.923.525.965.584377.899.547.444.844.6
2026.04
25.23148.440.42618.331.523.126.97186.342.83.985.154.753.241.7
2026.04
25.130.548.740.426.718.730.32226.170.585.742.43.470.253.651.540.4
2026.05
24.8231.2150.2250.793523.723.7523.5523.176689.6143.95.594.559.3759.0444.01
2026.05
24.7130.1948.3149.0834.6926.0323.1323.4622.196688.9344.574.596.559.3859.6743.83
2026.05
24.729.4647.5146.4834.3722.7323.423.2322.736089.0744.994.59657.7858.1942.82
2026.05
24.3729.0847.6848.1434.6624.4523.1723.2422.375788.8544.36696.559.3257.9842.95
2026.04
23.92947.440.523.518.330.621.82670.585.941.53.556.553.250.439
2026.04
23.727.346.352.340.824.219.222.619.22984.1398.697.557.356.240.8
2026.05
23.6232.348.5550.2335.4126.9624.2222.2323.5867.588.6345.195.59259.0158.343.95
2026.05
22.825.8347.5547.3835.2324.0321.5421.5921.615589.0544.8448856.5956.6141.35
2026.05
21.9723.3146.2846.6734.5221.4720.9221.8519.714989.1942.636.591.556.4155.640.47
2026.05
21.929.8348.3247.2534.7623.2721.7322.3721.1847.588.3643.159357.6256.3941.35
2026.05
21.2522.9242.344.7533.3421.242121.3719.344.589.3542.474.59155.9854.1239.34
2026.04
212846.538.419.517.530.823.325.87083.739.52.58549.448.339.1
2026.04
2125.842.518.120.31629.521.727.17085.839.72.575.554.249.237.4
2026.05
20.2724.940.424533.5421.4419.8321.2618.4539.587.9440.6638353.1951.5137.74
2026.04
18.315.138.330.419.213.816.521.419.93281.738.1459.14951.831.8
2026.04
1815.241.530.71713.221.621.623.15885.840.62.141.551.447.233
2026.05
17.5317.7832.3640.9430.8419.3416.2720.3313.773887.6438.8576.548.4647.7234.45
2026.05
17.3318.4334.3342.1331.1621.0816.7920.6414.233888.1839.613.58351.2848.9335.54
2026.04
1723.225.72129.36.820.817.520.745.584.341.2571.559.547.533.5
2026.04
16.522.843.834.719.216.624.42224.870.581.738.63.135.236.538.433
2026.04
14.212.326.423.214.810.117.519.819.85180.539.9415.65245.427.8