Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context evaluation on RULER 64k

100VT Score

Llama-3.1-8B

-4235077Mar 10, 2025May 21, 2025Aug 1, 2025Oct 12, 2025Dec 23, 2025Mar 5, 2026May 16, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2025.05
100100100100100100969910014.892605285.6-------
2025.03
97.29100100-98.9697.92-97.6698.96-85.4283.3359.3891.89-------
2026.05
96.8--------65.2884--86.23100-63.469.8399.597.699.66
2025.03
95100100-98.9694.79-97.66100-68.7583.3358.3389.68-------
2026.05
94.8--------36.5663.8--70.687.93-54.662.5780.475.978.86
2026.05
94.6--------65.181.4--85.4999.93-62.670.0399.6597.598.6
2026.02
93.32------------69.4371.3643.6-----
92.72--------51.0480.07--77.77100-5868.6396.3555.597.6
2026.02
91.96------------63.5367.7430.9-----
2026.05
91.52--------65.9683.8--85.1199.73-62.869.3398.9595.298.73
2025.03
91.46100100-10096.88-94.2798.96-77.0883.3357.2989.93-------
2026.02
90.92------------59.1166.220.2-----
2026.05
90--------60.0872--75.8198.73-47.768.1591.1560.594
2025.05
881001001001009668979819.662.7605680.4-------
2025.03
86.6710096.88-97.9297.92-96.6296.09-78.4778.1356.2588.5-------
2026.02
86.64------------71.9869.8859.41-----
2025.05
85.61001009610096289910018.457.3564875.7-------
2026.05
83.48--------61.8482.73--65.6186.53-42.829.6782.2581.140.07
2025.05
8088100609272093908.470.6525266-------
2025.03
77.7190.7590.63-96.8887.5-85.4294.27-81.9483.3357.2984.57-------
2025.05
74.4968408832040690.841.3564848.4-------
2025.03
73.96100100-8.3370.83-78.1252.6-75.3562.558.3368-------
2026.05
69.6--------59.9262--70.5376.53-5664.2798.1597.650.66
2026.02
58.52------------55.0266.1340.4-----
2026.02
56.4------------45.4260.5519.3-----
2025.05
52.8366012684841637073.3564839.2-------
2025.05
42.4100100921008428629033.648284865.8-------
2025.05
42.4100100100100242840670.857.3322455.1-------
2025.05
31.23256436160739666.6203626.9-------
2026.02
31------------44.2562.9638.8-----
2025.05
28.810010010096122441651.258.7282452.2-------
2026.02
28.08------------43.1861.7639.7-----
2025.05
24.810010010096562439760.873.3322457.3-------
2025.05
22.480280204011115.648244422.9-------
2025.05
20728405220028301.656242031.3-------
2025.05
1668922068400244210.862.6283639-------
2025.05
15.28840160057065.3282019.1-------
2025.05
12.81001001001009248618815.688324868.1-------
2025.05
10.416200288053050.6242414.5-------
2025.05
6.410010092100842853893249.3324862.6-------
2025.05
2.4840800530.872282812.2-------
2025.05
0040400101.274.616289.9-------
2025.05
0000800300.462.620168.5-------