Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on LongBench (16 Subtask Metrics + Avg)

31.42NQA

Full KV

21.633624.174326.71529.2557May 21, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2026.05
31.4243.1555.8451.7241.5630.2434.6227.5825.4178.5290.1249.827.1599.254.3255.7648.53
2026.05
31.3434.2855.4350.4641.5630.1227.4227.8425.4171.7590.1249.827.1599.254.3254.6247.15
2026.05
30.6246.4557.3458.2150.1232.4235.1525.5427.9173.4292.6843.128.4210062.1452.5849.76
2026.05
30.430.2454.1649.141.5630.2425.2627.5825.3269.1490.1245.987.1599.254.3254.1145.77
2026.05
30.3932.9751.9350.6641.1530.8527.7228.0924.4569.0290.1249.827.1599.254.3253.5946.82
2026.05
30.2733.3852.5551.5441.530.6127.8428.5724.7765.6890.1249.827.1599.254.3253.6946.09
2026.05
29.930.2350.5850.1840.7330.2425.7827.5824.4566.0690.1248.897.1599.254.2253.8445.54
2026.05
29.5430.652.5751.9241.9831.1227.3730.3624.9366.8390.1249.827.1599.253.6952.7445.24
2026.05
29.429.943.1547.3539.0229.0727.5228.7923.6559.0290.1249.827.1499.254.3254.443.68
2026.05
29.135.2955.2254.9849.3931.7227.325.2425.6566.5592.539.958.2199.6461.6452.1947.19
2026.05
29.0635.1753.6151.9741.092927.628.2823.4461.8690.1249.827.1599.253.5453.7745.04
2026.05
28.729.0950.2950.3140.229.5224.5627.4723.3661.1290.0249.437.1599.253.4553.4144.83
2026.05
28.6234.3852.1955.7348.2731.9527.0724.9925.6763.7592.2441.798.0399.8961.154.9446.98
2026.05
28.6131.2550.4550.2640.1729.225.9728.3324.5565.4190.1249.827.1599.253.0752.5645.65
2026.05
28.5431.3454.4253.8249.1131.4524.5225.1224.8464.1292.4238.828.1299.561.4551.8446.21
2026.05
28.3930.1150.1649.7939.8729.2125.8828.7324.262.6690.1248.367.1599.253.6352.8544.87
2026.05
28.1231.4250.8455.1247.8231.8225.1224.8425.1261.4292.1241.427.9299.8560.8454.9146.17
2026.05
28.128.9849.4150.0839.8229.0524.2727.1322.5761.2490.1247.587.1599.253.7753.2144.52
2026.05
27.8727.7749.2149.5839.6127.8823.1426.9622.5158.8789.8646.947.1599.252.4252.5543.84
2026.05
27.7834.0452.0955.9247.8330.9226.4724.7825.2460.3991.4942.137.9999.3860.4153.2646.27
2026.05
27.426.3648.2749.3838.4427.7322.2727.1421.2458.1889.6848.147.1598.5151.2551.9843.32
2026.05
27.3331.3550.7554.8646.8230.0524.6525.223.9658.0791.541.817.3599.2859.0754.4745.41
2026.05
27.1936.9653.1956.4647.2329.125.9924.2823.6456.2290.7641.637.9699.2858.7548.7145.46
2026.05
26.9532.6350.7755.3947.1830.2625.3724.3624.7659.6191.6640.637.6899.3560.4656.5845.84
2026.05
26.8830.1350.4855.2147.1230.4523.8224.5424.4156.4291.1241.827.8499.1859.8753.2545.16
2026.05
26.8130.3850.4250.29-26.4121.5126.4420.3549.5288.6248.057.1599.0850.6852.0242.91
2026.05
26.6431.2950.3754.7546.7329.0324.124.1123.8557.390.840.017.3299.0958.9957.3945.13
2026.05
26.5831.0942.4851.8145.4229.3625.1824.7523.4552.9991.6642.998.4399.2759.5649.2644.02
2026.05
26.5228.1949.3654.1246.1229.5422.4525.1223.1254.8591.2441.527.1299.1258.4254.4744.46
2026.05
26.423.9636.5344.6737.0227.1621.7527.320.6548.8589.6549.827.1498.9852.0751.7341.48
2026.05
26.1929.8849.4554.8246.5229.8223.4124.1224.1256.8491.4540.127.5299.2160.1256.5745.01
2026.05
25.8428.4549.1254.1246.1228.4222.1223.8423.1254.4190.4539.457.1298.9258.4257.3744.21
2026.05
25.1231.6250.7155.4545.5827.2120.8123.5421.2146.5189.6540.717.6898.8556.8146.5143
2026.05
24.8924.1835.8942.3934.0825.2722.0424.8919.7949.9783.4646.397.1598.0449.2649.3539.51
2026.05
24.8224.8136.5449.2443.5228.1221.1524.4221.6544.8291.2442.928.4198.8558.5147.9242.35
2026.05
23.4825.1835.8946.740.0426.2121.4422.1720.7146.028539.57.598.3355.445.7239.95
2026.05
22.9219.9531.3539.8731.9523.6718.8823.5617.9443.9781.5445.157.1597.5447.6347.4137.53
2026.05
22.0121.1431.8244.5138.1225.0218.8421.5219.3440.8283.5438.817.329854.1244.4238.08