Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context understanding on LongBench (Specific Subtask Scores)

56.02MQA-E Score

CSAttention

30.529637.147343.76550.3827Mar 30, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.03
56.0262.0130.4626.3831.1171.544.1691.9599.555.9433.663.334517.6352.04
2026.03
55.5462.8729.9127.1630.8972.543.7591.6510056.1635.2664.894617.1652.41
2026.03
53.6763.3726.0524.8836.1871.544.388.5410059.433.3569.1347.514.3152.3
2026.03
53.2163.7426.1824.637.057245.188.6210059.1832.6368.894614.2952.25
2026.03
52.9657.5730.1416.6728.697140.0191.829955.2234.0160.524316.3849.79
2026.03
52.1157.3226.3118.9428.085844.2287.998.551.8824.3255.67409.1246.6
2026.03
51.9860.3526.9821.936.97242.884.110058.7333.0961.014414.3950.59
2026.03
50.2153.1927.7426.5726.57046.389.049751.0834.2264.324715.6849.92
2026.03
50.157.926.0123.9934.26144.58598.554.1229.9963.013113.2148.04
2026.03
49.9252.9425.5627.0626.170.545.9190.599749.3432.8863.984616.4449.92
2026.03
48.7853.1625.8614.519.2704265.059638.523.3961.11387.8343.1
2026.03
45.8738.9126.0123.3421.427145.0290.159534.9831.2955.042914.0844.37
2026.03
45.5739.5922.5726.0422.37142.1888.628935.2229.6864.014615.0145.49
2026.03
45.3250.0725.8821.0331.95943.980.29147.9327.6248.3225.513.4843.65
2026.03
40.1740.0129.2123.9428.086241.190.329753.3628.3257.9623.516.545.11
2026.03
39.5634.3226.9621.7828.484742.1189.268751.8325.2155.422115.1841.79
2026.03
37.2630.4321.0725.3317.016341.9884.775231.5622.9259.91316.0437.45
2026.03
31.5131.7719.6221.8615.636141.6884.14229.6925.6753.01345.8135.53