Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis

About

Data synthesis for training large reasoning models offers a scalable alternative to limited, human-curated datasets, enabling the creation of high-quality data. However, existing approaches face several challenges: (i) indiscriminate generation that ignores the solver's ability and yields low-value problems, or reliance on complex data pipelines to balance problem difficulty; and (ii) a lack of reasoning in problem generation, leading to shallow problem variants. In this paper, we develop a problem generator that reasons explicitly to plan problem directions before synthesis and adapts difficulty to the solver's ability. Specifically, we construct related problem pairs and augment them with intermediate problem-design CoT produced by a reasoning model. These data are used to bootstrap problem-design strategies in the generator. Then, we treat the solver's feedback on synthetic problems as a reward signal, enabling the generator to calibrate difficulty and produce complementary problems near the edge of the solver's competence. Extensive experiments on 10 mathematical and general reasoning benchmarks show that our proposed framework achieves a cumulative average improvement of 3.4%, demonstrating robust generalization across both language and vision-language models.

Yongxian Wei, Yilin Zhao, Zixuan Hu, Li Shen, Xinrui Chen, Runxi Cheng, Sinan Du, Hao Yu, Chun Yuan, Dian Li• 2025

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMinerva
Pass@1 Accuracy35.79
289
Mathematical ReasoningAIME 2024
Pass@1 Accuracy22.08
236
Mathematical ReasoningGSM8K--
204
Mathematical ReasoningAIME 2025
Pass@1 Accuracy18.17
192
Multimodal ReasoningMathVision
Accuracy30.26
162
Multimodal ReasoningMathVerse
Accuracy46.65
130
Mathematical ReasoningAMC
Pass@1 Accuracy67.42
119
General ReasoningSuper GPQA
Accuracy31.33
99
General ReasoningBBEH
Accuracy11.86
64
Mathematical ReasoningMATH 500
Pass@1 Accuracy86.2
59
Showing 10 of 13 rows

Other info

Follow for update