Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language modeling on InfiniteBench (Accuracy and Sparsity)

18En. Sum Accuracy

XAttn

14.77615.61316.4517.287Feb 26, 2026
Updated 27d ago

Evaluation Results

MethodLinks
2026.02
1871.0616.571.3222.371.2813.572.47868.3123.473.0332.970.7599.868.9299.568.5637.170.63
2026.02
1802.2048.9027.503.8018.5031.70100096.9038.60
2026.02
1877.31.974.6847.674.7330.576.493.874.2417.278.0633.770.9810071.7510072.0939.275.04
2026.02
17.975.3919.275.7824.575.711.577.0912.772.9222.877.6632.975.1383.174.1599.273.763675.29
2026.02
17.590.85290.8944.589.82886.693.886.632087.4532.387.3610085.5499.790.3338.688.39
2026.02
1791.7816.692.227.192.251291.51191.0624.490.9634.391.5498.693.999.893.8237.992.11
2026.02
16.7018.7027.1013.5013.2024.1033.1099.3099.3038.30
2026.02
16.785.9117.885.2324.585.3412.584.1611.384.821.883.7632.886.8210087.698.885.137.485.41
2026.02
16.466.311.967.0141.566.9716.569.143.766.692071.4730.661.8910062.298162.1234.665.99
2026.02
14.982.51.982.664584.941683.293.983.281785.813481.6499.881.819982.3736.883.14