Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context evaluation on LongBench (test)

32.85NarQA Score

xKV

6.985213.700120.41527.1299Mar 10, 2025
Updated 16d ago

Evaluation Results

MethodLinks
2025.03
32.8545.6254.6951.7436.3828.3331.3224.4922.6126.2610041.3
2025.03
31.8344.3652.5656.3848.3527.2328.6325.0623.4924.899.542.02
2025.03
30.7348.8656.4557.7750.8228.3434.3425.326.9129.2210044.43
2025.03
30.6446.6455.4258.1747.7628.2334.5125.6125.4323.5110043.27
2025.03
30.3445.5653.6257.9247.526.9628.9925.3323.9524.9199.542.23
2025.03
29.9341.2947.2160.1651.9832.7927.2122.8819.864510043.48
2025.03
29.8441.4951.2857.2742.5427.5433.2825.0123.925.8898.1741.47
2025.03
29.3947.2752.559.6756.0134.5833.3223.1424.0435.0910045
2025.03
29.2143.7848.5860.9253.2933.6833.2323.2423.5343.2110044.79
2025.03
28.7842.9847.4358.7952.4232.6933.0223.6723.1838.629843.6
2025.03
28.6844.5147.4360.8851.9132.1729.9523.0521.7945.6710044.19
2025.03
27.8841.8447.6652.7348.7628.4628.7122.2421.1833.186537.97
2025.03
26.4335.7838.446.1745.0823.7924.6222.4116.9839.5263.534.79
2025.03
26.2529.3234.5649.9942.4720.9226.2921.922.6323.8793.535.61
2025.03
24.1334.9927.4747.5545.1321.4426.7619.3421.1544.112931.01
2025.03
18.1120.4127.5138.821.3720.1921.0920.8519.622.759829.88
2025.03
15.5238.3637.1430.0419.722.0619.5222.3518.9122.828530.13
2025.03
7.9816.479.4221.7815.4610.558.837.943.8411.32510.78