Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Modeling on LAMBADA (Accuracy)

79.4Accuracy

LLaMA-2 70B

3.157622.951342.74562.5387Jul 15, 2025Aug 29, 2025Oct 13, 2025Nov 27, 2025Jan 11, 2026Feb 25, 2026Apr 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.03
79.4
2026.03
79.4
2026.03
76.6
2026.03
76.6
2026.03
75.5
2026.03
75.5
2026.03
75.3
2026.03
75.3
2026.03
73.2
2026.03
73.2
2026.03
70.7
2026.03
70.7
2026.04
64.6
2026.04
64.6
2026.04
64.6
2026.04
64.6
2026.04
64.6
2026.04
64.5
2026.04
64.5
2026.04
62.9
2025.07
62.1
2025.07
61
2025.07
58.4
2025.07
53.5
2025.07
52.3
2025.07
51.5
2026.03
51.2
2026.03
51.2
2026.04
50.2
2026.04
49.64
2026.04
48.8
2026.04
48.57
2026.04
48.4
2026.04
48.04
2026.04
47.95
2026.03
47.7
2026.03
47.7
2026.04
47.53
2026.04
47.41
2026.04
47.28
2026.04
47
2026.04
47
2026.04
46.6
2026.04
46.26
2026.04
45.9
2026.04
45.42
2026.04
45.12
2026.04
45.02
2026.04
44.7
2026.04
43.39
2025.07
43.2
2025.07
42.9
2026.04
39.03
2026.04
38.55
2026.04
38.21
2026.04
38.17
2026.04
37.68
2026.04
36.81
2025.07
35.2
2026.04
33.2
2025.07
32.9
2026.03
32.6
2026.03
32.6
2025.07
28.5
2026.04
27.71
2025.07
25
2025.07
23
2026.04
18
2026.04
16.2
2026.04
15.4
2026.04
15.1
2026.04
12.8
2026.04
12.6
2026.04
11.93
2025.07
7.1
2026.04
6.09