Theorem-Validated Reverse Chain-of-Thought Problem Generation for Geometric Reasoning
About
Large Multimodal Models (LMMs) face limitations in geometric reasoning due to insufficient Chain of Thought (CoT) image-text training data. While existing approaches leverage template-based or LLM-assisted methods for geometric CoT data creation, they often face challenges in achieving both diversity and precision. To bridge this gap, we introduce a two-stage Theorem-Validated Reverse Chain-of-Thought Reasoning Synthesis (TR-CoT) framework. The first stage, TR-Engine, synthesizes theorem-grounded geometric diagrams with structured descriptions and properties. The second stage, TR-Reasoner, employs reverse reasoning to iteratively refine question-answer pairs by cross-validating geometric properties and description fragments. Our approach expands theorem-type coverage, corrects long-standing misunderstandings, and enhances geometric reasoning. Fine-grained CoT improves theorem understanding and increases logical consistency by 24.5%. Our best models surpass the baselines in MathVista and GeoQA by 10.1% and 4.7%, outperforming advanced closed-source models like GPT-4o.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multimodal Reasoning | WeMath | Accuracy57.59 | 43 | |
| Multimodal Reasoning | MathVista | Pass@162.6 | 30 | |
| Multimodal Reasoning | MathVision | Pass@122.72 | 23 | |
| Multimodal Reasoning | MathVerse | -- | 20 | |
| Multimodal Reasoning | GeoQA | Mean@146.49 | 11 | |
| Multimodal Reasoning | GeomVerse | Mean@13.33 | 11 |