Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation

About

Instruction tuning has shown great promise in improving the performance of large language models. However, research on multilingual instruction tuning has been limited due to the scarcity of high-quality instruction-response datasets across different languages. To bridge this gap, we present Bactrian-X, a comprehensive multilingual parallel dataset of 3.4 million instruction-response pairs across 52 languages. Leveraging this dataset, we train a set of adapters using low-rank adaptation (LoRA), which are lightweight components that seamlessly integrate with large language models. These adapters have a substantially lower parameter count than the base model, making them easily replaceable and usable as plug-ins for different languages or language groups. Extensive experiments in various multilingual evaluation settings demonstrate that models derived from LoRA-based training over Bactrian-X outperform both the vanilla models and existing instruction-tuned models. The code and models are publicly available at https://github.com/mbzuai-nlp/bactrian-x

Haonan Li, Fajri Koto, Minghao Wu, Alham Fikri Aji, Timothy Baldwin• 2023

Related benchmarks

TaskDatasetResultRank
Natural Language InferenceXNLI--
111
Commonsense ReasoningXStoryCloze
Average Score52.8
32
Causal ReasoningXCOPA (test)
Accuracy (id)62.8
13
Paraphrase IdentificationPAWS-X (test)--
13
Instruction FollowingVicuna & WizardLM Finnish fi
Win Rate (vs ChatGPT)37.6
9
Instruction FollowingVicuna & WizardLM Thai
Win Rate (vs ChatGPT)41.6
9
Instruction FollowingVicuna & WizardLM Turkish
Win Rate (vs ChatGPT)52
9
Instruction FollowingVicuna & WizardLM Swahili
Win Rate (vs ChatGPT)50
9
Instruction FollowingVicuna & WizardLM Indonesian
Win Rate (vs ChatGPT)39.6
9
Instruction FollowingVicuna & WizardLM Vietnamese / vi
Win Rate (vs ChatGPT)50.7
9
Showing 10 of 28 rows

Other info

Follow for update