EuroLLM-22B: Technical Report
About
This report presents EuroLLM-22B, a large language model trained from scratch to support the needs of European citizens by covering all 24 official European Union languages and 11 additional languages. EuroLLM addresses the issue of European languages being underrepresented and underserved in existing open large language models. We provide a comprehensive overview of EuroLLM-22B's development, including tokenizer design, architectural specifications, data filtering, and training procedures. Across a broad set of multilingual benchmarks, EuroLLM-22B demonstrates strong performance in reasoning, instruction following, and translation, achieving results competitive with models of comparable size. To support future research, we release our base and instruction-tuned models, our multilingual web pretraining data and updated EuroBlocks instruction datasets, as well as our pre-training and evaluation codebases.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code Generation | HumanEval | -- | 1036 | |
| Reasoning | BBH | -- | 672 | |
| Instruction Following | IFEval | IFEval Accuracy67.2 | 625 | |
| Science Question Answering | ARC Challenge | Accuracy89.8 | 342 | |
| General Knowledge | MMLU | MMLU General Knowledge Accuracy69.8 | 234 | |
| Common Sense Reasoning | HellaSwag | Accuracy69.7 | 213 | |
| Mathematical Reasoning | GSM8K | Math Score85.5 | 197 | |
| Mathematical Reasoning | MGSM | Accuracy76.1 | 194 | |
| Mathematics | MATH 500 | Pass@154.5 | 95 | |
| Scientific Reasoning | ARC Challenge | Accuracy82.7 | 94 |