Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning

About

Large language models (LLMs) have shown great potential in complex reasoning tasks, yet their performance is often hampered by the scarcity of high-quality and reasoning-focused training datasets. Addressing this challenge, we propose Key-Point-Driven Data Synthesis (KPDDS), a novel data synthesis framework that synthesizes question-answer pairs by leveraging key points and exemplar practices from authentic data sources. KPDDS ensures the generation of novel questions with rigorous quality control and substantial scalability. As a result, we present KPMath, an extensive synthetic dataset tailored for mathematical reasoning, comprising over 800K question-answer pairs. Utilizing KPMath and augmenting it with additional reasoning-intensive corpora, we create the comprehensive KPMath-Plus dataset. The Qwen1.5-72B model, fine-tuned on KPMath-Plus, achieves 87.0% PASS@1 accuracy on GSM8K and 58.3% on MATH, surpassing competitors in the 7B to 70B range and best commercial models like GPT-4 across multiple math reasoning datasets.

Yiming Huang, Xiao Liu, Yeyun Gong, Zhibin Gou, Yelong Shen, Nan Duan, Weizhu Chen• 2024

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K (test)
Accuracy89.9
751
Mathematical ReasoningMATH
Accuracy48.8
643
Mathematical ReasoningMATH (test)
Overall Accuracy48.8
433
Mathematical ReasoningMATH500 (test)
Accuracy76
381
Mathematical ReasoningMATH
Pass@158.3
112
Mathematical ReasoningAIME 2024 (test)
Accuracy10
103
Mathematical ReasoningSVAMP
Pass@182.1
35
Mathematical ReasoningGSM8K v1 (test)
Accuracy83.4
35
Mathematical ReasoningMAWPS
Pass@195.5
28
Mathematical ReasoningASDIV
Pass@189.2
26
Showing 10 of 12 rows

Other info

Follow for update