Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

About

Large language models (LLMs) have demonstrated multilingual capabilities, yet they are mostly English-centric due to the imbalanced training corpora. While prior works have leveraged this bias to enhance multilingual performance through translation, they have been largely limited to natural language processing (NLP) tasks. In this work, we extend the evaluation to real-world user queries and non-English-centric LLMs, offering a broader examination of multilingual performance. Our key contribution lies in demonstrating that while translation into English can boost the performance of English-centric LLMs on NLP tasks, it is not universally optimal. For culture-related tasks that need deep language understanding, prompting in the native language proves more effective as it better captures the nuances of culture and language. Our experiments expose varied behaviors across LLMs and tasks in the multilingual context, underscoring the need for a more comprehensive approach to multilingual evaluation. Therefore, we call for greater efforts in developing and evaluating LLMs that go beyond English-centric paradigms.

Chaoqun Liu, Wenxuan Zhang, Yiran Zhao, Anh Tuan Luu, Lidong Bing• 2024

Related benchmarks

TaskDatasetResultRank
Multiple-choice Question AnsweringMMLU-Pro
MMLU-Pro Overall Accuracy77.7
130
Causal ReasoningXCOPA (test)--
31
Question AnsweringXQuAD
F1 (de)90.6
30
Multiple-choice Question AnsweringMMLU-ProX
Average Accuracy77.7
14
Causal ReasoningXCOPA
Accuracy (ZH)97.8
14
Multiple-choice Question AnsweringGlobal MMLU
Accuracy (ZH)86.4
14
Multilingual Knowledge Question AnsweringMKQA (test)
F1-score (All)50.3
10
Cross-lingual Natural Language InferenceXNLI (test)
Accuracy (All)70.5
10
Multilingual Question AnsweringM3-Exam (test)
Accuracy (All)59.9
10
Multiple-choice Question AnsweringmCSQA
Accuracy (ZH)28.2
9
Showing 10 of 13 rows

Other info

Follow for update