Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Granite Embedding Multilingual R2 Models

About

We introduce the multilingual Granite Embedding R2 models, a family of encoder-based embedding models for enterprise-scale dense retrieval across 200+ languages. Extending our English-focused R2 release, these models add enhanced support for 52 languages and programming code, a 32,768-token context window (a 64x expansion over R1), and state-of-the-art overall performance across multilingual and cross-lingual text search, code retrieval, long-document search, and reasoning retrieval datasets. The release consists of two bi-encoder models based on the ModernBERT architecture with an expanded multilingual vocabulary: a 311M-parameter full-size, and a 97M-parameter compact model built via model pruning and vocabulary selection that achieves the highest retrieval score of any open multilingual embedding model under 100M parameters. The full-size also supports Matryoshka Representation Learning for flexible embedding dimensionality. Both models are trained on enterprise-appropriate data with governance oversight, and released under the Apache 2.0 license at https://huggingface.co/collections/ibm-granite, designed to support responsible use and enable unrestricted research and enterprise adoption.

Parul Awasthy, Aashka Trivedi, Yushu Yang, Ken Barker, Yulong Li, Bhavani Iyer, Martin Franz, Juergen Bross, Meet Doshi, Vignesh P, Vishwajeet Kumar, Todd Ward, Abraham Daniels, Madison Lee, Luis Lastras, Jaydeep Sen, Radu Florian• 2026

Related benchmarks

TaskDatasetResultRank
Multilingual RetrievalMTEB Multilingual v2
nDCG@1065.2
40
RetrievalMTEB eng v2
nDCG@1052.6
31
Information RetrievalLongEmbed
NDCG@1071.7
26
Code RetrievalMTEB Code
nDCG@1063.9
21
Reasoning RetrievalReasoning-as-Retrieval RaR-b
NDCG@1028
12
Encoding SpeedEncoding Speed Benchmark 512 tokens
Encoding Speed (Docs/s)2.53e+3
12
Showing 6 of 6 rows

Other info

Follow for update