Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EuroLLM-9B: Technical Report

About

This report presents EuroLLM-9B, a large language model trained from scratch to support the needs of European citizens by covering all 24 official European Union languages and 11 additional languages. EuroLLM addresses the issue of European languages being underrepresented and underserved in existing open large language models. We provide a comprehensive overview of EuroLLM-9B's development, including tokenizer design, architectural specifications, data filtering, and training procedures. We describe the pre-training data collection and filtering pipeline, including the creation of EuroFilter, an AI-based multilingual filter, as well as the design of EuroBlocks-Synthetic, a novel synthetic dataset for post-training that enhances language coverage for European languages. Evaluation results demonstrate EuroLLM-9B's competitive performance on multilingual benchmarks and machine translation tasks, establishing it as the leading open European-made LLM of its size. To support open research and adoption, we release all major components of this work, including the base and instruction-tuned models, the EuroFilter classifier, and the synthetic post-training dataset.

Pedro Henrique Martins, Jo\~ao Alves, Patrick Fernandes, Nuno M. Guerreiro, Ricardo Rei, Amin Farajian, Mateusz Klimaszewski, Duarte M. Alves, Jos\'e Pombal, Nicolas Boizard, Manuel Faysse, Pierre Colombo, Fran\c{c}ois Yvon, Barry Haddow, Jos\'e G. C. de Souza, Alexandra Birch, Andr\'e F. T. Martins• 2025

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningWinoGrande
Accuracy73.2
1085
ReasoningBBH
Accuracy47.6
672
Instruction FollowingIFEval
IFEval Accuracy49.4
625
Multitask Language UnderstandingMMLU
Accuracy52.7
413
Commonsense ReasoningWinoGrande
Accuracy50.6
372
Question AnsweringTriviaQA
Accuracy54.9
238
Science Question AnsweringARC-C
Accuracy71.2
193
Safety EvaluationAdvBench--
117
Truthfulness EvaluationTruthfulQA
Accuracy29.6
103
Social Commonsense ReasoningSIQA
Accuracy40.8
89
Showing 10 of 45 rows

Other info

Follow for update