Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TIM: Teaching Large Language Models to Translate with Comparison

About

Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks with instruction tuning. However, these models can sometimes struggle with tasks that require more specialized knowledge such as translation. One possible reason for such deficiency is that instruction tuning aims to generate fluent and coherent text that continues from a given instruction without being constrained by any task-specific requirements. Moreover, it can be more challenging for tuning smaller LLMs with lower-quality training data. To address this issue, we propose a novel framework using examples in comparison to teach LLMs to learn translation. Our approach involves presenting the model with examples of correct and incorrect translations and using a preference loss to guide the model's learning. We evaluate our method on WMT2022 test sets and show that it outperforms existing methods. Our findings offer a new perspective on fine-tuning LLMs for translation tasks and provide a promising solution for generating high-quality translations. Please refer to Github for more details: https://github.com/lemon0830/TIM.

Jiali Zeng, Fandong Meng, Yongjing Yin, Jie Zhou• 2023

Related benchmarks

TaskDatasetResultRank
Machine TranslationWMT De-En 22 (test)
COMET78.94
29
Machine TranslationWMT En-De 2022 (test)
COMET74.91
25
Machine Translation (Zh-En)WMT 22 (test)
BLEU24.51
23
Machine TranslationWMT ZH-EN 22
COMET79.33
20
Machine TranslationWMT En-Zh 22 (test)
COMET85.1
18
Machine TranslationWMT 22 (test)
COMET79.67
18
Machine TranslationWMT en→zh (test)
BLEU37.2
16
Machine TranslationWMT de-en
BLEU27.91
16
Machine TranslationWMT EN-DE 2022
COMET2282.56
16
Machine TranslationWMT is-en
BLEU23.42
14
Showing 10 of 14 rows

Other info

Follow for update