Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

About

Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages. What does it take to broaden access to breakthroughs beyond first-class citizen languages? Our work introduces Aya, a massively multilingual generative language model that follows instructions in 101 languages of which over 50% are considered as lower-resourced. Aya outperforms mT0 and BLOOMZ on the majority of tasks while covering double the number of languages. We introduce extensive new evaluation suites that broaden the state-of-art for multilingual eval across 99 languages -- including discriminative and generative tasks, human evaluation, and simulated win rates that cover both held-out tasks and in-distribution performance. Furthermore, we conduct detailed investigations on the optimal finetuning mixture composition, data pruning, as well as the toxicity, bias, and safety of our models. We open-source our instruction datasets and our model at https://hf.co/CohereForAI/aya-101

Ahmet \"Ust\"un, Viraat Aryabumi, Zheng-Xin Yong, Wei-Yin Ko, Daniel D'souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker• 2024

Related benchmarks

TaskDatasetResultRank
Machine TranslationWMT En-Ja 2023 (test)
COMET84.6
12
Machine Translation (XX to En)Flores-200 (test)
COMET86.3
8
Machine Translation (En to XX)Flores-200 (test)
COMET84.1
8
Machine TranslationWMT23 En-De (test)
BLEU25.1
8
Machine Translation (En to XX)WMT23 (test)
COMET80.8
8
Machine Translation (XX to En)WMT23 (test)
COMET Score79.7
8
Machine TranslationWMT23 En-Zh (test)
BLEU25.4
8
Machine TranslationWMT23 En-Ru (test)
BLEU Score22.1
8
Machine TranslationWMT23 En-Uk (test)
BLEU19.7
8
Machine TranslationWMT23 En-XX Average (test)
BLEU21.3
8
Showing 10 of 12 rows

Other info

Follow for update