Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Language Modeling on BIG-Bench

85.6Accuracy

TALE

66.67271.58676.581.414Oct 26, 2025
Updated 22d ago

Evaluation Results

MethodLinks
2025.10
85.60.25-14.4
2025.10
81.60.14-19.9
2025.10
81.6--19.9
2025.10
79.2--
2025.10
77.2--
2025.10
76.4--32.2
2025.10
75.40.22-28
2025.10
750.25-27.1
2025.10
72.6--33.8
2025.10
71--45.1
2025.10
70.4--
2025.10
67.4--