Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language modeling on LongBench (test)

50.87Qasper Score

Full-context

7.470818.737930.00541.2721Jun 9, 2025Jul 29, 2025Sep 18, 2025Nov 7, 2025Dec 28, 2025Feb 16, 2026Apr 8, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2026.04
50.8730.2131.8979.8292.7453.1569.8775.43-58.43--38.6538.7617.5610055.8234.2167.5453.92
2026.04
50.2329.3430.5678.4592.1252.4568.9866.54-56.78--36.2337.9816.1299.554.0533.1263.5652.89
2026.04
50.1530.1231.3479.4392.6653.0168.7274.56-57.67--37.4238.5616.6710055.1934.1264.8953.67
2026.04
50.1229.6730.7878.8992.1252.3468.4573.12-57.23--36.4538.3415.9899.554.4933.5663.2352.12
2026.04
49.8728.4530.2378.1292.1551.6767.9866.87-55.43--33.5637.9814.3290.552.5732.5659.1252.34
2026.04
49.8528.9230.1577.4591.5651.8967.8972.12-56.78--35.6737.8915.4399.553.8833.1262.4551.34
2026.04
49.7628.6530.2377.2391.5652.1268.1265.67-55.98--35.1237.5615.3499.553.4632.8763.1252.45
2026.04
49.6529.8731.2379.1292.4552.7868.5673.34-56.34--36.7838.3415.8310054.6233.1564.2152.24
2026.04
49.1227.8929.5677.3491.5650.8966.7865.43-54.37--32.1837.1513.4590.551.6230.8457.6551.23
2026.04
49.1227.8729.3477.4591.8948.7866.1255.67-51.23--34.8736.9812.1296.551.0129.8957.1251.23
2026.04
48.3427.1228.7876.2391.4548.1265.3453.23-49.87--33.4536.2310.8794.549.9429.1256.2350.12
2026.04
42.7625.9823.8973.2390.5649.8766.2365.78-46.54--28.2331.879.1288.548.0629.8749.1247.34
2026.03
41.5425.7127.737688.5947.5161.4862.6829.0752.8849.3739.0128.5835.07698.548.11---
2026.03
41.525.3327.597588.7547.8161.4462.329.6253.0548.4938.728.1630.6769847.65---
2026.03
34.2421.6125.8972.587.6940.9653.9447.7625.9342.3446.2233.6522.826.96.58842.31---
2026.03
33.3623.223.2572.589.5645.1456.955.6227.652.2348.4838.1127.123.645.59644.89---
2026.03
33.1823.1223.217388.7944.8858.0558.1227.7251.5947.5837.627.7323.9859644.97---
2026.03
32.323.2226.677490.4842.2959.1354.0225.5639.7143.5635.4921.1428.584.869.7541.92---
2026.03
31.3922.9526.547390.6142.459.1154.8425.2639.643.8235.4621.5927.075.0269.541.76---
2026.04
22.1226.8929.8774.8793.1249.6566.2362.45-35.23--20.8736.5611.8985.543.5628.9826.5426.12
2026.03
20.5122.6822.837190.4839.8658.6454.0623.9936.0642.4731.3420.2821.365.8369.2539.42---
2026.03
20.22222.837190.8640.0360.1655.9123.3637.3642.3733.1819.9621.765.8369.539.77---
2026.04
19.3427.4530.6574.2392.6548.9864.1261.23-32.67--16.4336.8711.2382.54227.5623.4522.56
2026.03
18.5621.4824.5760.588.738.5558.5252.7323.0929.0236.0128.0715.723.315.3768.0437.01---
2025.06
9.7921.233.476687.8941.3766.5359.94------------
2025.06
9.5820.871.936687.7241.1366.5759.75------------
2025.06
9.5221.283.516687.7241.6966.6659.82------------
2025.06
9.2620.530.976687.4242.6166.2259.67------------
2025.06
9.1420.630.8565.8887.2141.4466.1859.55------------