Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Aya 23: Open Weight Releases to Further Multilingual Progress

About

This technical report introduces Aya 23, a family of multilingual language models. Aya 23 builds on the recent release of the Aya model (\"Ust\"un et al., 2024), focusing on pairing a highly performant pre-trained model with the recently released Aya collection (Singh et al., 2024). The result is a powerful multilingual large language model serving 23 languages, expanding state-of-art language modeling capabilities to approximately half of the world's population. The Aya model covered 101 languages whereas Aya 23 is an experiment in depth vs breadth, exploring the impact of allocating more capacity to fewer languages that are included during pre-training. Aya 23 outperforms both previous massively multilingual models like Aya 101 for the languages it covers, as well as widely used models like Gemma, Mistral and Mixtral on an extensive range of discriminative and generative tasks. We release the open weights for both the 8B and 35B models as part of our continued commitment for expanding access to multilingual progress.

Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Jon Ander Campos, Yi Chern Tan, Kelly Marchisio, Max Bartolo, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Aidan Gomez, Phil Blunsom, Marzieh Fadaee, Ahmet \"Ust\"un, Sara Hooker• 2024

Related benchmarks

TaskDatasetResultRank
Mean Accuracy EvaluationNORMAD-ETI (test)
Accuracy65.8
75
Aspect Sentiment Quad Prediction (ASQP)Czech monolingual
F1 Score61.85
26
Aspect Sentiment Triplet Extraction (ASTE)Czech monolingual
F1 Score69.87
26
Aspect-Category-Opinion-Sentiment (ACOS)Czech monolingual
F1 Score57.46
26
Machine Translation (other-language-to-English)Neko (test)
EXACT12.92
22
Machine TranslationNeko (test)
XCOMET79.27
17
Machine TranslationWMT En-Ja 2023 (test)
COMET86.5
12
Medical ReasoningMMLU-Pro Biology English
Accuracy49.1
11
Machine TranslationWMT23 En-Zh (test)
BLEU44.5
8
Machine TranslationWMT23 En-XX Average (test)
BLEU29.3
8
Showing 10 of 20 rows

Other info

Follow for update