Granite Embedding Multilingual R2 Models

About

We introduce the multilingual Granite Embedding R2 models, a family of encoder-based embedding models for enterprise-scale dense retrieval across 200+ languages. Extending our English-focused R2 release, these models add enhanced support for 52 languages and programming code, a 32,768-token context window (a 64x expansion over R1), and state-of-the-art overall performance across multilingual and cross-lingual text search, code retrieval, long-document search, and reasoning retrieval datasets. The release consists of two bi-encoder models based on the ModernBERT architecture with an expanded multilingual vocabulary: a 311M-parameter full-size, and a 97M-parameter compact model built via model pruning and vocabulary selection that achieves the highest retrieval score of any open multilingual embedding model under 100M parameters. The full-size also supports Matryoshka Representation Learning for flexible embedding dimensionality. Both models are trained on enterprise-appropriate data with governance oversight, and released under the Apache 2.0 license at https://huggingface.co/collections/ibm-granite, designed to support responsible use and enable unrestricted research and enterprise adoption.

Parul Awasthy, Aashka Trivedi, Yushu Yang, Ken Barker, Yulong Li, Bhavani Iyer, Martin Franz, Juergen Bross, Meet Doshi, Vignesh P, Vishwajeet Kumar, Todd Ward, Abraham Daniels, Madison Lee, Luis Lastras, Jaydeep Sen, Radu Florian• 2026

Related benchmarks

Task	Dataset	Result
Multilingual Retrieval	MTEB Multilingual v2	nDCG@1065.2	40
Retrieval	MTEB eng v2	nDCG@1052.6	31
Information Retrieval	LongEmbed	NDCG@1071.7	26
Code Retrieval	MTEB Code	nDCG@1063.9	21
Reasoning Retrieval	Reasoning-as-Retrieval RaR-b	NDCG@1028	12
Encoding Speed	Encoding Speed Benchmark 512 tokens	Encoding Speed (Docs/s)2.53e+3	12

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord