Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner

About

Chart reasoning presents unique challenges due to its inherent complexity -- requiring precise numerical comprehension, multi-level visual understanding, and logical inference across interconnected data elements. Existing vision-language models often struggle with such reasoning tasks, particularly when handling multi-subchart scenarios and numerical sensitivity. To address these challenges, we introduce Chart-R1, a chart-domain vision-language model that leverages reinforcement fine-tuning for advanced chart reasoning. We first propose a programmatic data synthesis approach to generate high-quality step-by-step reasoning data with verifiable answer formats, covering diverse chart types and complexity levels. Our two-stage training strategy includes: (1) Chart-COT, which decomposes complex reasoning into interpretable subtasks through chain-of-thought supervision, and (2) Chart-RFT, which employs group relative policy optimization with numerically sensitive rewards tailored for chart-specific reasoning. Experiments on open-source benchmarks and our proposed ChartRQA dataset demonstrate that Chart-R1 significantly outperforms existing chart-domain methods and rivals large-scale open/closed-source models.

Lei Chen, Xuanle Zhao, Zhixiong Zeng, Jing Huang, Yufeng Zhong, Lin Ma• 2025

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMathVista
Score67.5
385
Chart Question AnsweringChartQA
Accuracy91
356
Chart-based Question AnsweringChartQA Pro
Accuracy44
52
Chart UnderstandingCharXiv
Reasoning Score45.2
44
Chart Understanding and ReasoningCharXiv
Score46.2
31
Chart Understanding and ReasoningEvochart
Score64.7
24
Chart Question AnsweringChartQAPro 67% held-out (test)
Factoid Accuracy42.17
16
Chart Understanding and ReasoningChartMuseum
Score31.7
13
Chart Understanding and ReasoningChartQAPro
Score44
12
General Perception and ReasoningMMStar
Score61.1
11
Showing 10 of 14 rows

Other info

Follow for update