Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EuroLLM-22B: Technical Report

About

This report presents EuroLLM-22B, a large language model trained from scratch to support the needs of European citizens by covering all 24 official European Union languages and 11 additional languages. EuroLLM addresses the issue of European languages being underrepresented and underserved in existing open large language models. We provide a comprehensive overview of EuroLLM-22B's development, including tokenizer design, architectural specifications, data filtering, and training procedures. Across a broad set of multilingual benchmarks, EuroLLM-22B demonstrates strong performance in reasoning, instruction following, and translation, achieving results competitive with models of comparable size. To support future research, we release our base and instruction-tuned models, our multilingual web pretraining data and updated EuroBlocks instruction datasets, as well as our pre-training and evaluation codebases.

Miguel Moura Ramos, Duarte M. Alves, Hippolyte Gisserot-Boukhlef, Jo\~ao Alves, Pedro Henrique Martins, Patrick Fernandes, Jos\'e Pombal, Nuno M. Guerreiro, Ricardo Rei, Nicolas Boizard, Amin Farajian, Mateusz Klimaszewski, Jos\'e G. C. de Souza, Barry Haddow, Fran\c{c}ois Yvon, Pierre Colombo, Alexandra Birch, Andr\'e F. T. Martins• 2026

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval--
850
ReasoningBBH--
507
Instruction FollowingIFEval--
292
Science Question AnsweringARC Challenge
Accuracy89.8
234
Mathematical ReasoningGSM8K
Math Score85.5
171
General KnowledgeMMLU
MMLU General Knowledge Accuracy69.8
170
Common Sense ReasoningHellaSwag
Accuracy69.7
164
Mathematical ReasoningMGSM
Accuracy76.1
114
Scientific Question AnsweringGPQA Diamond
Accuracy26.8
64
Scientific ReasoningARC Challenge
Accuracy82.7
56
Showing 10 of 37 rows

Other info

Follow for update