Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Language Understanding on LongBench (17-Metric Performance Profile)

29.7NrtvQA

Standard

12.43616.91821.425.882Oct 17, 2024
Updated 14d ago

Evaluation Results

MethodLinks
2024.10
29.740.553.45029.132.934.925.427.77689.147.3598.560.462.147.6
2024.10
294153.650.527.532.334.825.427.37689.347.3697.559.961.347.4
2024.10
26.830.438.444.32118.624.92126.275.589.246.36.58960.660.642.5
2024.10
26.336.83448.12517.52821.525.571.592.844.811.56741.568.541.3
2024.10
25.844.346.949.329.420.828.422.126.97492.343.911.56843.669.843.6
2024.10
25.646.451.449.828.828.732.222.427.673.592.945.71268.541.669.744.8
2024.10
25.436.234.444.322.71525.820.226.266.591.143.611.56841.967.140
2024.10
25.145.238.446.224.917.829.122.327.17186.741.310.16735.654.440.1
2024.10
23.432.839.644.722.220.128.823.32773.590.641.93.67258.151.340.8
2024.10
23.218.335.743.720.914.524.122.3267191.141.46.96760.253.438.7
2024.10
22.232.144.841.72320.324.821.3266586.740.43.54652.847.937.4
2024.10
2019.626.237.518.713.323.82223.872.59041.56.76655.247.636.5
2024.10
19.730.335.629.515.520.324.821.3266586.740.43.845.152.847.935.3
2024.10
19.517.526.136.416.112.122.821.425.46686.440.13.570.759.754.236.1
2024.10
19.121.636.927.78.66.527.120.8266483.641.32.97.560.654.931.8
2024.10
17.410.918.411.56.715.923.820.125.574.584.537.43.264.148.545.331.7
2024.10
15.915.72725.56.54.321.919.623.36283.239.91.90.56053.528.7
2024.10
15.818.330.127.374.722.720.225.16282.839.62.11.259.453.629.5
2024.10
13.115.226.923.15.54.421.119.924.26182.838.92.145952.228.3
2024.10
13.113.730.315.64.79.821.520.924.36383.135.12.26.153.446.527.7