Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Understanding on LongBench LLaMA-3.1-8B-Instruct (test)

32.8NrtvQA

StructKV

8.380814.720421.0627.3996Apr 8, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2026.04
32.844.555.2555.846.930.8532.125.3525.973.6592.143.258.8599.563.256.848.97
2026.04
32.4541.8855.1255.246.8530.5529.625.1524.673.292.45438.699.562.456.1548.61
2026.04
31.5140.184155.8344.330.9228.8824.5822.5770.591.2842.647.9399.562.3256.7146.92
2026.04
31.1544.1154.5555.4745.1431.1431.2224.9424.0970.591.942.237.9699.563.5557.9148.46
2026.04
30.5440.7554.5754.3346.330.5528.1524.2921.917392.3842.967.3699.560.1154.7947.59
2026.04
30.2643.754.8354.3946.4230.4230.2824.8822.9373.591.9743.077.6199.561.6155.6748.19
2026.04
30.2145.5355.0156.0146.6531.2835.1325.2827.257391.6443.88.9199.563.3856.6449.33
2026.04
27.9928.8934.4251.2141.0324.4328.3422.0822.166489.9642.278.668562.0256.8543.08
2026.04
26.0526.2933.6848.0737.9325.0325.3721.4919.9758.588.4642.617.918861.0754.9441.59
2026.04
26.0230.7447.337.9755.9720.6431.1920.9221.0661.592.7540.926.189731.9941.1241.42
2026.04
24.3621.0739.7325.9943.9235.7828.9421.4217.626191.5340.884.768559.9544.3840.4
2026.04
12.135.5437.5817.6633.716.5627.8421.6221.046491.8943.313.6570.0864.9257.938.09
2026.04
9.841.094119.233.926.8332.2621.525.626691.2742.414.3168.2562.0254.0838.72
2026.04
9.3234.9637.0819.6532.656.9131.2921.0125.5966.591.0342.163.4165.7558.2751.4837.32