Share your thoughts, 1 month free Claude Pro on usSee more

General Language Capability on BIG-bench 57 Task

48.7Accuracy (Weighted)

GAL 120B

Updated 5mo ago

Evaluation Results

Method	Links
GAL 120B 2022.11		48.7	45.3
GAL 30B 2022.11		46.6	42.7
OPT 175B 2022.11		43.4	42.6
BLOOM 176B 2022.11		42.6	42.2
OPT 30B 2022.11		39.6	38