Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters

About

Multilingual translation stands as a challenging task for large language models (LLMs) to handle intricate language patterns and stilted translations that arise in automated translations. In this paper, we introduce Seed-X, a family of open-source LLMs comprising instruct and reasoning models, pushing the limits of translation capability with 7B parameter size. The base model is pre-trained on a diverse, high-quality dataset encompassing both monolingual and bilingual content across 28 languages, harnessing the full potential of multilingual data. The instruct model is then finetuned to translate by Chain-of-Thought (CoT) reasoning and further enhanced through reinforcement learning (RL) to achieve better generalization across diverse language pairs. Seed-X achieves performance comparable to leading closed-source models, including Gemini-2.5 and GPT-4o, across 28 languages, and significantly outperforms larger open-source models in both automatic metrics and human evaluations. We share the best practices through our optimization process, and make the parameter public available for advancing translation research and applications.

Shanbo Cheng, Yu Bao, Qian Cao, Luyang Huang, Liyan Kang, Zhicheng Liu, Yu Lu, Wenhao Zhu, Jingwen Chen, Zhichao Huang, Tao Li, Yifu Li, Huiying Lin, Sitong Liu, Ningxin Peng, Shuaijie She, Lu Xu, Nuo Xu, Sen Yang, Runsheng Yu, Yiming Yu, Liehao Zou, Hang Li, Lu Lu, Yuxuan Wang, Yonghui Wu• 2025

Related benchmarks

TaskDatasetResultRank
Machine TranslationFLORES+ (test)
spBLEU45.48
128
Machine TranslationWMT24++ v1.0 (test)
XCOMET Score87.71
49
Machine Translation (xx -> zh)FLORES+ latest (test)
spBLEU32.51
30
Machine TranslationWMT 2025 (test)
XCOMET-XXL47.83
17
Machine TranslationFLORES-200 ZH ⇔ XX 2022
XCOMET-XXL0.7856
17
Machine TranslationFLORES-200 EN ⇔ XX 2022
XCOMET-XXL83.12
17
Machine TranslationFLORES-200 XX ⇔ XX 2022
XCOMET-XXL68.96
17
Machine TranslationMandarin ⇔ Minority (test)
XCOMET-XXL0.4206
16
Machine TranslationFLORES200 EN-FI
chrF++62.57
13
Machine TranslationWMT EN–FI 24
chrF++57.48
13
Showing 10 of 17 rows

Other info

Follow for update