Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Evaluation on Instruction Tuning Suite (BIG-bench Hard, MMLU, TyDi QA, MGSM)
Loading...
74.1
Average Score
Flan-PaLM 2 (L)
57.98
62.165
66.35
70.535
May 17, 2023
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Score
Flan-PaLM 2 (L)
2023.05
74.1
PaLM 2 (L)
2023.05
69.3
Flan-U-PaLM-540B
2023.05
66.1
U-PaLM-540B
2023.05
58.6
Feedback
Search any
task
Search any
task