Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context reasoning on RULER (Context Length Sweep 8K-64K)

75.3RULER Performance (8K Context)

HyLo-Llama-14MLA14M2

-2.49217.70437.958.096Apr 27, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
75.349.716.60.4
2026.04
74.258.633.510.7
2026.04
73.962.6-33.1
2026.04
73.269.762.952
2026.04
71.765.4-46.6
2026.04
71.360.143.914.1
2026.04
71.24527.114.1
2026.04
71.145.619.20.2
2026.04
71.145.619.20.2
2026.04
68.262.155.746.3
2026.04
66.953.241.431.6
2026.04
65.739.525.211.4
2026.04
65.456.449.942.3
2026.04
63.543.630.317.4
2026.04
63.543.630.317.4
2026.04
61.553.748.141.6
2026.04
60.30.50.10.1
2026.04
59.853.842.530.5
2026.04
590.30.10
2026.04
5952.545.538.8
2026.04
58.741.127.514.6
2026.04
56.54938.427.8
2026.04
55.111.92.40.8
2026.04
53.346.740.437.9
2026.04
53.110.620.5
2026.04
52.548.344.540.8
2026.04
42.50.40.50.3
2026.04
37100
2026.04
36.431.323.816.4
2026.04
35.113.36.34.2
2026.04
18.9310
2026.04
12.36.83.70.1
2026.04
3.5000
2026.04
2.9000
2026.04
0.500.10