Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on LongBench (Mistral-8B-Instruct)

30.21NrtvQA

StructKV

22.212424.288726.36528.4413Apr 8, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2026.04
30.2146.8851.5661.4357.5538.2332.124.0225.9874.592.0445.371010065.0256.550.68
2026.04
30.0646.5451.3362.3656.7438.6431.1223.9925.8674.592.0445.471010065.0256.1550.61
2026.04
28.0643.8150.5663.4659.4339.2530.5923.3122.0370.590.8443.265.59727.5646.0346.32
2026.04
27.7540.6144.0452.1948.7830.6829.423.1425.357290.6643.267.57061.9959.9145.45
2026.04
27.2843.3144.5753.4149.9330.8830.4723.5525.877291.1644.047.57165.0266.4746.65
2026.04
26.8243.5549.5360.1851.7236.3826.9423.5521.277292.0444.011010065.0266.4749.34
2026.04
26.7245.3449.8361.1152.0736.5829.3623.9923.6574.592.0445.37910065.5565.6550.05
2026.04
25.946.0551.2961.2450.2136.1528.6523.8422.577692.0446.21010064.6864.6849.97
2026.04
25.737.9739.7360.6256.0437.2228.6521.1819.576389.3642.796.585.559.9641.7944.72
2026.04
25.3143.7150.1161.3150.8536.7326.1523.4520.547592.0445.11010062.996549.27
2026.04
24.9237.640.3448.9240.8422.2627.2322.9922.2568.591.0645.027.59766.9164.7445.51
2026.04
24.5547.9952.2160.4352.6435.2832.3624.1626.6474.592.0445.94910067.0667.1950.75
2026.04
23.2535.9128.2651.4641.1324.0826.8821.3821.4367.591.8844.6557263.1763.2142.57
2026.04
22.5233.0227.6948.6837.6621.2623.5521.0918.3661.591.3843.4566362.8462.5540.28