Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context language understanding on L-Eval

58.28Coursera

NTK

51.7853.467555.15556.8425Feb 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.02
58.28---59.381.1139.59
2025.02
57.78063.8681.0464.065.5658.7
2025.02
56.547465.3584.0666.418.8959.21
2025.02
56.48159.479.1864.065.5657.6
2025.02
55.96---62.55.5641.34
2025.02
55.38---64.065.5641.67
2025.02
55.237964.3679.1867.975.5658.55
2025.02
55.23---67.197.7843.4
2025.02
54.65---65.636.6742.32
2025.02
54.518064.3677.767.975.5658.35
2025.02
54.36---64.065.5641.33
2025.02
54.36---64.066.6741.7
2025.02
54.177465.3584.0668.757.7859.1
2025.02
54.17---66.417.7842.79
2025.02
54.077564.3682.1667.971.1157.44
2025.02
53.92---65.633.3340.96
2025.02
53.347559.4181.4160.944.4455.76
2025.02
53.34---61.725.5640.21
2025.02
53.27563.3781.0463.282.2256.35
2025.02
53.05---60.164.4439.22
2025.02
53.057359.4179.5568.755.5656.55
2025.02
52.918261.3979.9367.192.2257.6
2025.02
52.627763.3781.0460.163.3356.25
2025.02
52.337663.8682.1671.883.3358.26
2025.02
52.03---42.97031.67
2025.02
52.03---52.342.2235.53