Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models

About

Large Language Models (LLMs) demonstrate strong performance in real-world applications, yet existing open-source instruction datasets often concentrate on narrow domains, such as mathematics or coding, limiting generalization and widening the gap with proprietary models. To bridge this gap, we introduce Infinity-Instruct, a high-quality instruction dataset designed to enhance both foundational and chat capabilities of LLMs through a two-phase pipeline. In Phase 1, we curate 7.4M high-quality foundational instructions (InfInstruct-F-7.4M) from over 100M samples using hybrid data selection techniques. In Phase 2, we synthesize 1.5M high-quality chat instructions (InfInstruct-G-1.5M) through a two-stage process involving instruction selection, evolution, and diagnostic filtering. We empirically evaluate Infinity-Instruct by fine-tuning several open-source models, including Mistral, LLaMA, Qwen, and Yi, and observe substantial performance gains across both foundational and instruction following benchmarks, consistently surpassing official instruction-tuned counterparts. Notably, InfInstruct-LLaMA3.1-70B outperforms GPT-4-0314 by 8.6\% on instruction following tasks while achieving comparable foundational performance. These results underscore the synergy between foundational and chat training and offer new insights into holistic LLM development. Our dataset\footnote{https://huggingface.co/datasets/BAAI/Infinity-Instruct} and codes\footnote{https://gitee.com/li-touch/infinity-instruct} have been publicly released.

Jijie Li, Li Du, Hanyu Zhao, Bo-wen Zhang, Liangdong Wang, Boyan Gao, Guang Liu, Yonghua Lin• 2025

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval+--
189
Text-to-SQLSpider (dev)--
100
Code GenerationBigCodeBench
Accuracy35.3
59
Text2SQLSpider (test)
Exec Acc (Greedy)76.8
37
Text2SQLBIRD (dev)
Exec Acc (Greedy)46.9
37
Text-to-SQLSpider-Realistic--
33
Text-to-SQLSpider-DK--
26
Text-to-SQLSpider-Syn--
26
Text-to-SQLEHRSQL
Gre Score20.9
22
Code ReasoningCRUXEval
Accuracy44.8
21
Showing 10 of 15 rows

Other info

Follow for update