Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Evaluation on Instruction Tuning Suite (BIG-bench Hard, MMLU, TyDi QA, MGSM)

74.1Average Score

Flan-PaLM 2 (L)

57.9862.16566.3570.535May 17, 2023
Updated 4d ago

Evaluation Results

MethodLinks
74.1
2023.05
69.3
66.1
58.6