Granite Embedding Multilingual R2 Models
About
We introduce the multilingual Granite Embedding R2 models, a family of encoder-based embedding models for enterprise-scale dense retrieval across 200+ languages. Extending our English-focused R2 release, these models add enhanced support for 52 languages and programming code, a 32,768-token context window (a 64x expansion over R1), and state-of-the-art overall performance across multilingual and cross-lingual text search, code retrieval, long-document search, and reasoning retrieval datasets. The release consists of two bi-encoder models based on the ModernBERT architecture with an expanded multilingual vocabulary: a 311M-parameter full-size, and a 97M-parameter compact model built via model pruning and vocabulary selection that achieves the highest retrieval score of any open multilingual embedding model under 100M parameters. The full-size also supports Matryoshka Representation Learning for flexible embedding dimensionality. Both models are trained on enterprise-appropriate data with governance oversight, and released under the Apache 2.0 license at https://huggingface.co/collections/ibm-granite, designed to support responsible use and enable unrestricted research and enterprise adoption.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multilingual Retrieval | MTEB Multilingual v2 | nDCG@1065.2 | 40 | |
| Retrieval | MTEB eng v2 | nDCG@1052.6 | 31 | |
| Information Retrieval | LongEmbed | NDCG@1071.7 | 26 | |
| Code Retrieval | MTEB Code | nDCG@1063.9 | 21 | |
| Reasoning Retrieval | Reasoning-as-Retrieval RaR-b | NDCG@1028 | 12 | |
| Encoding Speed | Encoding Speed Benchmark 512 tokens | Encoding Speed (Docs/s)2.53e+3 | 12 |