Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generalization on LLaMA Evaluation Expanded Languages 3.1-8B

69.37Overall Score

DeltaMoE

58.50261.323564.14566.9665May 18, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.05
69.37
2026.05
68.39
2026.05
66.88
2026.05
66.39
2026.05
65.81
2026.05
64.91
2026.05
64.38
2026.05
58.92