Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

About

Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages. What does it take to broaden access to breakthroughs beyond first-class citizen languages? Our work introduces Aya, a massively multilingual generative language model that follows instructions in 101 languages of which over 50% are considered as lower-resourced. Aya outperforms mT0 and BLOOMZ on the majority of tasks while covering double the number of languages. We introduce extensive new evaluation suites that broaden the state-of-art for multilingual eval across 99 languages -- including discriminative and generative tasks, human evaluation, and simulated win rates that cover both held-out tasks and in-distribution performance. Furthermore, we conduct detailed investigations on the optimal finetuning mixture composition, data pruning, as well as the toxicity, bias, and safety of our models. We open-source our instruction datasets and our model at https://hf.co/CohereForAI/aya-101

Ahmet \"Ust\"un, Viraat Aryabumi, Zheng-Xin Yong, Wei-Yin Ko, Daniel D'souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker• 2024

Related benchmarks

Task	Dataset	Result
Machine Translation	FLORES	Average Score83.85	47
Machine Translation (En→X)	Multilingual Intersection High Resource	COMET-2287	27
Machine Translation (En→X)	Multilingual Intersection Medium Resource	COMET-22 Score87.54	27
Machine Translation	FLORES High Resource	En->X BLEU22.75	27
Machine Translation	FLORES Medium Resource	BLEU (En→X)20.02	27
Machine Translation (X→En)	Multilingual Intersection High Resource	COMET-22 Score86.55	27
Machine Translation (X→En)	Multilingual Intersection Medium Resource	COMET-22 Score87.32	27
Machine Translation (X→En)	Multilingual Intersection Low Resource	COMET-22 Score85.71	24
Machine Translation (X→Zh)	Multilingual Intersection High Resource	COMET-22 Score83.29	24
Machine Translation (X→Zh)	Multilingual Intersection Medium Resource	COMET-22 Score83.26	24

Showing 10 of 61 rows

Other info

Follow for update

@wizwand_team Discord