Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Language Modeling on LongBench 16K context length

13.3NrtvQA Score

UltraLLaDA + BA-Att

3.7326.2168.711.184May 19, 2026
Updated 13d ago

Evaluation Results

MethodLinks
2026.05
13.313.627.920.313.89.88.712.620.87.912.78.375.590.935.534.41.382.370.96963.137.2
2026.05
12.114.527.419.913.4119.720.420.87.814.77.579.59236.240.50.781.372.966.256.937.2
2026.05
4.920.421.820.16.383.711.217.45.414.46.77160.133.622.32.128.468.267.960.831.3
2026.05
4.320.321.519.97.57.93.510.816.85.814.27.561.561.528.719.31.927.756.861.95528.8
2026.05
4.120.421.820.86.38.13.410.517.35.414.36.773.563.632.221.51.929.570.96860.631.5
2026.05
4.113.925.220.16.26.85.212.919.97167.47585.233.328.52.181.464.664.563.534.9